Data Collection
We design our data collection to be purpose-built, minimal, and secure, with a focus on collecting only the metadata required to generate accurate metrics:
- Metadata only: Our Git integrations collect metadata about your organization’s activity across repositories, pull requests, and deployments. We never read, transmit, or store source code.
- Minimal PII: We store minimal personally identifiable information (PII) for organization members, such as name and email address.
- Data encryption: Sensitive data, such as branch names and pull request titles, are always encrypted.
We partner with Drata, a continuous compliance platform, to monitor our security posture on an ongoing basis. Our SOC 2 Type 2 report, latest security and policy documents, continuous monitoring status, and list of subprocessors can all be found at our Trust Center .
Git Data
Our Git integrations collect only the metadata required to measure engineering activity and delivery. We request only the necessary permissions to analyze Git metadata. You can learn more about the scopes requested for GitHub, GitLab, Bitbucket, and Azure DevOps.
Organizations and Teams
To enable aggregation of metrics at the organization, team, or group level, we collect data about your organization structure:
- Organization metadata, such as name, ID, and URL
- Team metadata, such as name, ID, and URL
Members
For attributing Git activity to individuals, we collect limited member metadata:
- Name, username, ID, and URL
- Organization roles (e.g. admin, member)
- Team memberships (e.g., GitHub Teams)
- Publicly available location information from a user’s profile
Repositories
To understand where work occurs and how activity is distributed across projects, we collect repository metadata:
- Name, URL
- Visibility (public, private)
- Branch names
- Branch rules
Pull Requests
To analyze development activity, we collect pull request metadata:
- Pull request title, ID, number, URL, status, labels, head branch, base branch, author, and change counts
- Commit SHAs, timestamps, filenames, and change counts
- Events (e.g. created, updated, closed, merged)
- Review activity (e.g. commenting, assigned reviewers)
Deployments
For deployment tracking, we collect event metadata about workflows/pipelines, check runs, deployments, commit statuses, releases, and tags, including:
- Name
- Status
- Completion and run time
AI Tools
We use AI tool metadata to understand adoption of AI-assisted development across your team and measure its impact on productivity when combined with Git data.
Usage Reports
We collect usage data from AI tools to measure adoption and engagement. This does not include the contents of AI interactions, such as chat prompts, responses, or code suggestions. While the data available varies by AI tool, common types of data we collect include:
- Suggestion and acceptance metrics (counts, rates, lines)
- Programming language and file type
- Feature usage (e.g., autocomplete, chat, agent)
- Model usage
- Seat assignments
Team Members
For attribution and reporting, we collect limited user metadata:
- Name
- Role (e.g., owner, member)
Cost Data
For tools that provide billing information, we collect license-level cost data:
- License cost (GitHub Copilot only)
Planning Data
Planning data is imported through our integrations with project tracking tools such as Jira. We collect issue- and project-level metadata to help analyze work patterns, planning accuracy, and delivery trends. We do not access attachments, comments, or other free-form content beyond standard issue fields.
We subscribe to webhooks to stay in sync with changes to project data, such as issue updates or status transitions (for example, moving an issue from In Progress to Done).
Projects
To identify and organize work across project spaces, we collect basic project metadata:
- Workspace name
- Project name and key
Issues
To understand how work is planned, tracked, and completed, we collect issue metadata:
- Issue name and key
- Status, type, labels, and priority
- Assignee
- Story points
- Associated project and sprint
Sprints
To analyze planning cycles and delivery timelines, we collect sprint-level metadata:
- Sprint name and key
- Status and target date
- Associated project
Team Members
For attribution and reporting purposes, we collect limited user metadata:
- Name
User Enrichment
You can import additional information about team members to help create and manage group memberships, enabling team-level insights and cohort-based analysis (for example, comparing AI usage across teams or roles). You can choose which fields to upload, and providing this data is entirely optional.
Currently supported fields include:
- Name
- Role
- Manager
- Team
- Department
- Company
- Location
- Product Line