Typically, Databricks recommends utilizing OAuth as an alternative of Private Entry Tokens (PATs) for authentication with Databricks to boost safety. We are actually extending this suggestion to Databricks Git credentials and encourage using OAuth over Git suppliers’ PATs when authenticating along with your Git suppliers.
At this time, we’re excited to announce the Common Availability of OAuth Git credential help for Service Principals with GitHub and Azure DevOps, enhancing Git connection safety for automated workloads.
Databricks Git integration initially supported solely PATs for authentication. Customers created private entry tokens with their Git supplier and saved the tokens in Databricks. This method is not really helpful for just a few causes, together with:
- [Long lifetimes] PATs supply longer entry durations (weeks/months) than short-lived tokens (hours/days). Though directors can implement shorter PAT lifespans, this creates operational challenges as customers should often replace their Databricks Git credentials to keep away from workflow failures upon expiration.
- [Insecure storage and transfer] Customers typically manually copy PATs, which may depart traces in clipboards and paperwork.
- [Wide scopes] Some PATs, akin to GitHub Traditional PATs, apply to each repo the consumer can entry. This behaviour can simply result in unintended privilege escalation and permit for lateral motion.
- [Missing service principal support] Some Git suppliers, akin to Azure DevOps, don’t help producing PATs for service principals.
Our hottest Git suppliers discourage using PATs: GitHub and Azure DevOps don’t suggest utilizing PAT for long-lasting integrations. Bitbucket recommends Bitbucket Cloud integration or app builders use OAuth for consumer authentication as an alternative of entry tokens.
Databricks has supported OAuth 2.0-based consumer authentication with GitHub and Azure DevOps for a number of years, however this help was beforehand restricted to interactive consumer periods.
Now that Service Principal help is usually accessible, our suggestion is to make use of OAuth as an alternative of PATs when integrating with these Git suppliers for each interactive and automatic workflows. What are the advantages? Take our GitHub App integration for example:
- OAuth tokens are robotically refreshed by default. Customers not encounter errors when their PAT token expires.
- OAuth presents enhanced administrative management, particularly concerning the viewing and entry of built-in repos.
- OAuth lets you configure entry to particular GitHub repos.
- Entry tokens have a brief lifespan (on this case, 8 hours), which reduces the chance of credential publicity.
Some clients have requested SSH authentication and GPG commit signing. Nevertheless, we selected to put money into OAuth help as an alternative, as SSH and GPG would require customers to add non-public keys to Databricks, just like storing a PAT, resulting in the identical drawbacks: long-lived credentials and guide rotation. Furthermore, if an improperly scoped SSH key have been compromised, it may grant an attacker direct entry to the Git server host, considerably rising the chance of exploitation.
Getting Began
For GitHub, you’ll be able to configure the Service Principal GitHub App connection on the Service Principal’s settings web page, following an identical course of as a consumer’s configuration. For Azure DevOps, we now help OAuth connections for service principals utilizing federated credentials primarily based on OpenID Join (OIDC). OIDC is an authentication protocol constructed on prime of OAuth 2.0 that gives login and profile details about the logged-in consumer. OIDC permits safe and user-friendly login experiences by permitting customers to authenticate as soon as with a trusted identification supplier (IdP, on this case, Microsoft EntraID) and be remembered while not having to re-enter credentials. This new characteristic replaces the sooner scripting-based method described on this weblog, considerably simplifying and shortening this essential consumer journey from hours to just some minutes.