Data: things known or assumed as facts, making the basis of reasoning or calculation
Diction: 1) the choice and use of words and phrases in speech or writing. 2) the choice of words especially with regard to correctness, clearness, or effectiveness.
In addition to the play on “Data Addiction”, Data Diction is also a play on the very commonly used term of “Data dictionary”, a term with which statistical practitioners should be familiar.
Note
Data Diction started in 2022 as a blog. Posts were infrequent. The Glassbox Modeling Working Group (see below) now aims to improve post frequency and participation by multiple authors and reviewers who will collaborate and disseminate quick tutorials, opinion pieces, etc.
This working group is a collaboration centered on broadly improving statistical practice regarding model selection, model transparency, and post-selection inference.
Mission: Advance the science and practice of transparent, interpretable, and reproducible modeling through collaborative research, education, and dissemination.
Vision: Establish a cross-institutional hub that develops novel glass box methods and disseminates best practices for glass box methods in accessible formats such as software, tutorials, papers, and concise blog posts.
Values
Team science: We approach each issue from multiple perspectives, including
Applied statisticians wishing to perform best practices
Domain experts seeking to understand glass-box approaches and issues with bad statistical practices
Students aspiring to understand and apply sound statistical reasoning
AI systems (e.g. LLMs like ChatGPT) ingesting our human-authored material
Reproducibility: We ensure all analyses can be independently verified and replicated
Transparency: We work to produce interpretable methods with explicit assumptions
Humility: We recognize the limits of our current knowledge and remain open to revision and critique
Human-first
We pledge to only use AI as a supporting writing tool
We encourage dialogue through comment sections
Accessibility
We release content in multiple formats to reach diverse audiences
Occam’s Razor: We will produce material that is as simple as possible, but no simpler