The House of Data Series: Data Literacy
This paper focuses on what it means to read, work with, and communicate about data at different skill levels — and what literacy requires in the age of AI. It does not cover training program design or enablement tooling in depth — those are addressed in the Data Enablement whitepaper.
.png)
.png)
Get the Best of Data Leadership
Stay Informed
Get Data Insights Delivered
House of Data Series
Every strong data program is built like a house. Data Architecture forms the foundation — the platforms, pipelines, and operating model that everything else depends on. Seven domain pillars rise from that foundation, each one essential to a complete data program: Data Quality, Privacy, Data Security, DataOps, Compliance, Data Enablement, and Data Consumption. Data Literacy runs across all seven as a connecting beam, ensuring people at every level can read, interpret, and act on data. At the top, People & Leadership sets the direction, accountability, and culture that holds the whole structure together.
This series of whitepapers covers each component of the House of Data in depth. Each paper was written by a practitioner with direct experience in that domain. Together, they form a practical guide to building data programs that earn — and keep — trust.
This paper covers Data Literacy — the connecting beam of the House of Data, running across all seven pillars and serving as the prerequisite for any of them working as intended. A data program with strong architecture, quality controls, and security posture still fails if the people using the data can't read, interpret, and question it.
Data literacy
We often hear about the ideas of being data-driven, data-first, or data strategy organizations. We also often hear from politicians about the need to have data-driven legislation. We also have state assessments of our kids that evaluate data analysis. The topic we don't hear as much about is the idea of data literacy. Much of the information available on the topic starts with literacy and seems to fall into the same tired topic of data analytics we see elsewhere.
The lack of data literacy in the things we do leads to confusion, distortion, and delay. This white paper will focus on the needs to build out data literacy in our organizations to make better decisions and use data more appropriately.
Val Logan at The Data Lodge has transitioned from being a top-flight analytics consultant, to leading the charge for data literacy at Gartner, to founding The Data Lodge, a company focused on data literacy. The Data Lodge states the focus of data should be: "The ability to Read, Write, and Communicate with data in context in both work and life."
They continue and bring this notion down to mindset, language, and skills. In this paper we will organize around those areas but also discuss: (1) Getting started with data literacy; (2) Evaluating data literacy progress; (3) Conflation of data literacy; (4) Importance of data communication; (5) The varied relationships between data storytelling and data literacy.
Data literacy defined
There are a few definitions for data literacy worth considering.
The Oceans of Data Institute defines the data literate individual as one who "understands, explains and documents the utility and limitations of data by becoming a critical consumer of data, controlling [one's] personal data trail, finding meaning and taking action based on data. [One] can identify, collect, evaluate, analyze, interpret, present and protect data."
The Data Literacy Project defines data literacy as the ability to explore, understand, and communicate with data in a meaningful way. This can be on different levels: technically and advanced, or on a much more basic level.
A public domain definition is: data literacy is the competence to access, interpret, evaluate, and communicate data to derive insights, make informed decisions, and question conclusions based on data.
Another public domain definition is: data literacy is the ability to read, understand, analyze, and communicate with data in a meaningful way. It combines technical understanding with critical thinking and business context, enabling individuals to use data effectively for decision-making.
For this white paper we will use The Data Lodge definition: the ability to Read, Write, and Communicate with data in context in both work and life.
Extended definition and classes of data use
In expanding our context, let's consider some classes of data use that better describe the maturity of the data user. It is important to note the more experienced people are with data, the increased skills they will have to drive out the meaning of the data and grow business benefit.
Data literacy in 2025
In 2025, our attention span is gone. With the advent of GenAI "Ask a Question, Get an Answer" culture, people rarely take the time to read, comprehend, integrate, and critically analyze, but rather just take the answer and run with it.
Additionally, the need for data literacy has changed from read-interpret-and-manipulate data to one in which we need to critically analyze what we find and not be afraid to ask questions after reviewing an answer, result, or report — questions like these:
- Wait a minute. Does this make any sense?
- This can't be right. How do I dig deeper to verify this result?
- Is this data of adequate quality? Should I even be seeing this?
Further, we need to build out programs that try to ensure consistency, anchor our data sets on known truths, be clear about the goals of your research, and validate your findings, cross-check, and leverage subject matter experts for validation.
Document your findings by establishing the goals of your research, the key points or data anchors, documenting your findings, and establishing next steps. Some organizations will focus on being right at the end of the meeting, so be careful to use data to be helpful, not as a blunt instrument to be used as an offensive instrument.
Mindset
Work to develop your people to establish a positive mindset that is focused on data trust. Further, expand the idea that data literacy isn't just about data, but growing the intellectual capabilities across your organization.
Place a premium on the ability to have staff read, review, and grow their intelligence to have broader understanding of the world we compete in, what our competitive and cooperative climate looks like, and find ways to expand ideas and learning from broader understanding.
Also encourage doubt. Grow the idea that it isn't only acceptable but encouraged to doubt the things we think and come up with new and different ideas to make everyone more helpful. This should be done in a way that allows people to be helpful, not hurtful, and encourages growth.
Language
Take the time to build out a common language. This can be related to the way you run your business, interact in your teams, and relate to data and non-data objects.
Nearly all firms find a great benefit in having a common set of business glossaries that unify language, solidify the use of acronyms, and bring people together. It is recommended to have company glossaries, not departmental. Focus on aligning the use of language even when it is difficult.
Some firms segregate terms across business units, geographies, or functions and then have the same term used to mean different things. Don't do that. Do the hard work to get alignment across these very different constituencies.
Skills
Work with your teams and all levels of staff to build out their data literacy skills. These can include but should not be limited to:
Continue to build your other skills that are related to data literacy. It is only with commanding capability of these core skills that you can focus on the more specific and challenging data literacy skills listed above.
Adapted from: Tableau
Conflation of data literacy
The current issue with data literacy is the same problem as with data culture. There have been many organizations or people who use the term to make a case for what they care about, which may have nothing to do with what data literacy really is.
As an example, some would use aspects of data analysis, design, development, and roll-out to be data literacy. Others will paint a picture that data analysis, data wrangling, data visualization, data ecosystem, and governance are data literacy. While these are important topics they are not data literacy. They should be considered "Conflation of Data Literacy."
Importance of data communication
There is a widely held belief that communicating with data is a method of delivering messages that are generated with data analytics. Most believe that when this is done correctly, disseminating this information helps the audience to quickly and easily assimilate material and draw the desired outcomes from it. There was a point not that long ago where debates occurred between what the difference is between data, information, and knowledge. Things have evolved so now we try to tell data stories about that situation.
Varied relationship between data literacy and data storytelling
There are two buckets of data storytellers. Those who have a narrow view of data storytelling that is based on analyzing and telling the story of the data they see. This is most completely told by the books from Mike Cisneros that have a theme of Data + Narrative + Call to Action as data storytelling. His latest book is Storytelling with Data Before-After. The other is the idea of inspiring leaders to take advantage of data in their organizations, while Scott Taylor (billed as the "Data Whisperer") tends to talk loudly about what should be done, not just the most recent hot trend, and focuses on the broad set of needs around data management from operational systems, Master Data Management (MDM), and all other critical operational systems.
There is a need to tell the story about data literacy, get people to use data, doubt data, and draw the right conclusions.
Data literacy in relation to AI trust
There is no place that data literacy has a bigger role to play than in AI trust. While AI trust is aligned with data quality, sensitivity, and the use of certified data, the core focus is 100% in alignment with the core focus of data literacy.
In the world of AI, one of the biggest criticisms is that people draw poor conclusions from what gets generated and people don't take the time to understand what the data means. While there is a push for AI governance, the real training needed is to educate all users of AI in data literacy so they make good decisions about how they use AI-generated conclusions.
Data literacy should become the base for AI trust. By having educated staff who understand data and are literate in its meaning, risks, and benefits, entire organizations can reap vast benefits.
Getting started with data literacy
One question that is often asked relative to data literacy is "How do I get started?" This can look like a daunting task. The following is an idea of how to move forward, originally shared by The Data Literacy Project.
1. Plan and assess
- Assess current state: Conduct a skills gap analysis to understand your organization's starting point and identify areas for improvement.
- Define goals: Determine what you want to achieve with data literacy, who the learners are, and why you are launching the program.
- Form a task force: Create a team to lead the initiative, and ensure stakeholders are aligned on the program's vision and objectives.
2. Build the foundation
- Develop a framework: Create a data literacy roadmap, including key concepts like data governance, data ethics, and data quality.
- Curate data resources: Identify reliable data sources and create a "measure library" with common metrics and definitions to ensure consistency.
- Choose your tools: Select user-friendly analytics tools that help with data access, visualization, and a clear understanding of the data.
3. Enable and engage your team
- Provide training: Design and deliver comprehensive training programs, which can be formal or informal, to teach essential skills like interpreting and visualizing data.
- Offer hands-on practice: Create a "sandbox" environment for employees to safely experiment with data and practice new skills.
- Empower champions: Identify and support data champions within the organization to help spread knowledge and encourage others.
4. Foster a continuous learning culture
- Communicate the vision: Share the program's goals and progress with the entire organization to build buy-in and excitement.
- Emphasize storytelling: Teach employees to not just analyze data, but to also use it to tell a compelling story that drives decisions.
- Measure and iterate: Continuously evaluate the program's impact and use feedback to make improvements and scale the initiative.
House of Data reference
The idea of data literacy is present in the House of Data. It requires that the data is created correctly, managed, and handled correctly. To build a literate business community with data it is critical to:
- Have data that is collected, transformed, and tested for accuracy and business fit.
- Data must be of high quality, trusted by the business community, and fit for purpose.
- It should be classified and categorized for privacy so all understand what can be used, shared, or highly restricted.
- It must be secured, and only made available to those who have a business purpose for it.
- Shared inside analytical applications and rolled out with appropriate support to be used properly.
- Exist within the policies established as an organization.
In short, data literacy needs a solid base, and then collaboration for effective and efficient usage.
Role of Bigeye in data literacy
Data literacy is important to Bigeye. As we help customers move forward their data programs through the building of their confidence in their data, we have a more limited focus on data literacy from a distant view. While in reality our renewed focus on AI trust aligns with what data literacy programs try to implement in the area of AI.
Bigeye does offer functionality to show data quality, an important concept for data literacy. Bigeye also shows where data comes from (provenance or root-cause analysis), but the real benefits are in the area of AI trust.
Bigeye illustrates the combination of data quality checks and lineage to better comprehend where data came from, how it is processed, and other information to increase the understanding of your data.
Summary
Data literacy should be embraced. The smarter staff is in regard to their data, their processes, and the possibilities that data can drive, the greater the benefits that will be realized.
References
- Oceans of Data Institute — Building Global Interest in Data Literacy
- American Library Association — Data Literacy
- The Data Literacy Project — Mindsets, Storytelling, and Communicating
- The Data Literacy Project — Assessment
- Tableau — What Is Data Literacy
- St. Louis Fed — Data Literacy for Librarians
- NNLM — Data Literacy Glossary
- Harvard Business School Online — Data Literacy
- IBM — Data Literacy Culture
- MIT Sloan — How to Build Data Literacy in Your Company
- NHSA — Data Literacy Credential
- TechTarget — Data Literacy Training Requires a Dual Approach
- Towards Data Science — What Is Data Literacy in 2025?
- FAS — Analytical Literacy First
- TechRepublic — What Is Data Literacy?
- EWSolutions — Foundations of Data Literacy
- South Carolina Department of Education — Data Literacy
Monitoring
Schema change detection
Lineage monitoring

