Data/WorkingGroups/GleanDictionary
Contents
Quick Reference
Production URL | dictionary.telemetry.mozilla.org |
Development URL | glean-dictionary-dev.netlify.app |
Source code | mozilla/glean-dictionary on GitHub |
Matrix Channel | #glean-dictionary on chat.mozilla.org |
Meetings | No longer occurring (Glean Dictionary is occasionally discussed in the Glean Platform meetings). For legacy minutes, see: Meeting Notes |
Charter
The focus of this working group is to produce a usable version of the Glean Dictionary, a data dictionary for applications written using Glean SDK. This includes Firefox for Android and Firefox for iOS. As Firefox on Glean comes together, the Glean Dictionary will also index the metadata sent by Firefox Desktop.
There are currently four planned phases for the project:
- (done) Index Glean metric and ping data produced by Glean applications, providing links to their representation in BigQuery
- (done) Index derived datasets produced by bigquery-etl, creating generated documentation at bigquery-etl
- (done) Add facilities to annotate glean metrics and pings with additional data and commentary useful for data scientists and other data practitioners at Mozilla
- (in progress) Represent and link the dataset documentation outlined above in the Glean Dictionary
We are currently discussing what comes after this work is complete. Possible directions include integrations with Mozilla's Looker deploy and better support for Firefox Desktop (as it moves over to Glean).
Stakeholders
Glean team, Data science, the Data Taxonomy Effort, other consumers of data.
Getting involved
We welcome your feedback and involvement! We work in the open and anyone from the Mozilla community is welcome to join this group. This project involves a variety of pieces emphasizing different technologies, including:
- Building out the frontend (JavaScript, svelte)
- Working on the data infrastructure pieces to gather metadata (python, BigQuery)
- Improving data documentation and metadata definitions through Mozilla (markdown, yaml, python)
If you want to contribute, but aren't sure where to start, join our #glean-dictionary matrix channel and say hi! Someone can probably find an initial task for you to work on.
Communication
There are two primary communication channels for the group:
- The #glean-dictionary channel on Matrix for quick questions
- Discussions in GitHub issues for design discussions and all other questions
Generally, discussions in GitHub issues is preferred since it can happen asynchronously and leaves a historical record that we can refer back to. For larger changes, consider writing a proposal (see below).
All project communications should follow the Mozilla Community Participation Guidelines.
Coordination
This group is currently being coordinated by Will Lachance (wlach on Github, Matrix, and Mozilla Slack). Feel free to get in touch if you have feedback or questions!
Proposals
Proposal | Date | Status |
---|---|---|
Glean Metric Annotation Repository | 22 February 2021 | Implemented |
Working Group Proposal | 20 November 2020 | Accepted |
Initial Proposal | 10 August 2020 | Accepted |