How Tabnine protects your code privacy

TL;DR:

Visit Tabnine’s Trust Center to learn more

Your code always remains private

Tabnine NEVER stores or shares any of your code. Any action that shares your code with the Tabnine servers for the purpose of training team models requires explicit opt-in. Tabnine does not retain any user code beyond the immediate time frame required for training models. Private code models created by Tabnine Enterprise are only accessible by your team members.

Trained on open-source code with permissive licenses

Tabnine’s generative AI only uses open-source code with permissive licenses for our Public Code trained AI model (MIT, Apache 2.0, BSD-2-Clause, BSD-3-Clause). Whether you’re using Tabnine’s Pro plan or our Basic plan, your code, and AI data are NEVER used to train any models other than private code models.

Tabnine privacy deep dive

What is Tabnine?

Tabnine provides AI code completions using two sources: (1) A Public Code model that has been trained on open-source code. (2) Private code models trained on your and your team’s code and local interactions with Tabnine.

How do we build our models?

Training the Open-Source model

  • The Open-Source model is trained on public code from GitHub.
  • It only uses repositories with permissive open-source licenses (MIT, Apache 2.0, BSD-2-Clause, BSD-3-Clause).
  • Trained code is filtered to ensure quality and avoid outdated code, esoteric code, auto-generated code, and other edge cases.
  • The model is updated regularly to capture recent developments.

Training the Private code Models (available with Tabnine Enterprise)

  • Private code models are trained on your code and interactions with Tabnine.
    The model and AI training data are never shared with anyone outside your team.
  • Private code models capture patterns that are specific to your team.
  • Private code models require an explicit opt-in from the team administrator.
  • Every private code model is trained in a secure and separate environment.
How do we use our models?

Tabnine uses different models to deliver the best possible experience and quality.

Local machine mode (available on Tabnine Pro)

Allowing you to run Tabnine Pro's AI models locally, the models are downloaded to your machine and the data is queried locally. You receive code completions continuously and quickly in this mode.

Cloud mode

AI models will run on Tabnine Cloud, which requires a good network in order to function.

For both models, your code and AI training data are only used to provide the service, and your code is never shared nor used as part of Tabnine’s open-source trained AI.

Due to their inherent benefits, as of June 1, 2022 both modes will be enabled by default for all new installations, allowing for the best quality and smoothest user experience (you can opt-out of either mode, at any time).

Tabnine for teams

  • The Public Code AI model is downloaded from the Tabnine server.
  • Inference uses two models: The Public Code model and your private code model.
  • The private code models are trained continuously on your code and your team’s interactions with code completions.
  • The cumulative interactions are used to fine-tune The private code model, which in turn improves code completions for the entire team by identifying the most relevant patterns.

Your code’s privacy is our highest priority. We at Tabnine want to reassure you that your code stays private when using our products. If you still have questions or reservations, please contact support@tabnine.com