How MLIS Graduates Can Build Careers in Data Science

A practical guide to the skills, certifications, job titles, and salary expectations for library professionals entering data science roles.

By Meredith SimmonsReviewed by MLIS Academic Advisory TeamUpdated June 24, 202622 min read
Data Science Careers for MLIS Graduates: Skills & Paths

What you’ll learn in this article…

  • Data scientist roles are projected to grow 34% from 2024 to 2034, creating roughly 23,400 new openings each year.
  • MLIS graduates already possess communication, metadata, and research skills that employers in data science positions actively seek.
  • Several ALA-accredited MLIS programs now offer data science coursework, letting students build analytical skills within their library degree.
  • National median total pay for data scientists reaches $156,000, and librarians can pursue certifications to qualify for these roles.

The U.S. Bureau of Labor Statistics projects data scientist employment will grow 34% between 2024 and 2034, adding roughly 23,400 new positions each year. For MLIS graduates, that statistic carries particular weight: the competencies at the core of library science, including metadata management, information architecture, research data curation, and source evaluation, are the same foundational skills that distinguish strong data scientists from technically proficient but contextually weak ones.

The practical tension most MLIS professionals face is not whether data science is relevant to their field. It is how much technical depth to develop, and through which path, given that library roles and data science roles still require overlapping but distinct credential stacks. A data librarian at a research university and a data scientist at a financial services firm both work with structured datasets, but the job requirements, hiring pipelines, and salary ceilings differ substantially. Understanding what MLIS degree skills employers actually seek helps clarify where targeted data science training delivers the most value.

Glassdoor placed the median total pay for U.S. data scientists at $156,000 as of May 2026. That figure sits well above median salaries for most traditional library roles, which means the skills gap between librarianship and data science also represents a meaningful earnings gap for professionals willing to close it.

What Is a Data Science Librarian?

What exactly does a data science librarian do, and how does it differ from a traditional librarian role? A data science librarian bridges the fields of library and information science and computational data analysis, helping researchers manage, analyze, and preserve data. Unlike traditional librarians who focus on collections of published works, these professionals curate datasets, build data infrastructure, and teach data literacy skills. They often sit at the intersection of library services and campus research computing, providing hands-on support for everything from scripting to statistical modeling.

Common Role Variations

Job titles vary by setting, but three common paths illustrate the range:

  • Data Services Librarian: Typically an entry-level role requiring an ALA-accredited MLIS and experience with tools like R, Python, SPSS, or ArcGIS.1 These librarians advise on data management plans, help with statistical software, and collaborate with research offices and IT units. They apply knowledge of the research data lifecycle and RDM best practices.1
  • Research Data Manager: Often requiring an advanced degree in MLIS or information science, this role develops institutional policies for research data management and ensures compliance with funding mandates.4 It emphasizes infrastructure and long-term curation over public-facing reference, working behind the scenes to build systems that support data discovery and preservation.
  • Data Science Librarian: Sometimes listed without an explicit MLIS requirement but preferring one, this librarian acts as a campus data science educator.2 They provide expert consultative support in data science and visualization, using tools from the R and Python ecosystems.3 Their work often includes teaching workshops, building data applications, and embedding within data-intensive research teams.

In corporate and government contexts, similar roles may carry titles like information architect or knowledge manager, with a focus on big data pipelines or AI governance.

How MLIS Competencies Map to Data Science Workflows

Core library and information science competencies translate directly into data science tasks. Metadata standards and controlled vocabularies become the foundation for dataset documentation and discoverability. Digital preservation principles guide long-term storage and versioning of research data. Information retrieval expertise helps design search algorithms and data catalogs. User-centered design shapes intuitive data portals and self-service analytics tools. These library science skills make MLIS graduates uniquely prepared to bridge technical data work and user needs.

Why Libraries Need These Hybrid Roles

The World Economic Forum's Future of Jobs Report 2025 identifies AI and big data as the fastest-growing skill areas from 2025 to 2030, with over 90% of employers expecting increased demand.2 Libraries are feeling this urgency as researchers produce more data, seek help with machine learning applications, and need guidance on ethical data use. Data-focused librarians fill a critical gap, ensuring that academic and public institutions remain relevant in a data-driven world.

Why Data Science Skills Matter for LIS Professionals

The numbers make a compelling case for LIS professionals to take data science seriously. According to Bureau of Labor Statistics projections, data scientist roles are expected to grow roughly 34% between 2024 and 2034, far outpacing the national average of about 3% across all occupations.1 That translates to approximately 23,400 new positions opening each year through the decade, with total employment in the field expected to climb by more than 82,000 jobs.3 Data science currently ranks among the fastest-growing occupations in the entire U.S. economy.

Librarianship, by contrast, follows a more modest growth trajectory. While the profession remains stable, it does not share the same explosive demand curve. For library professionals who want to stay ahead of a shifting job market, building data science competencies offers a genuine career multiplier rather than a peripheral credential. MLIS career outcomes span a wide range of sectors, and data skills are increasingly appearing at the top of employer wish lists across many of them.

A Global Signal, Not Just a U.S. Trend

The demand for data skills extends well beyond American borders. The World Economic Forum's Future of Jobs Report 2025 found that 87% of employers globally see AI and big data as increasingly critical capabilities. Over 90% of survey respondents anticipated that demand for these skills would rise through 2030, spanning sectors from healthcare and financial services to government, energy, and telecommunications. AI and big data rank as the single fastest-growing skills cluster projected for the second half of this decade.

For information professionals, this is not an abstract trend. Academic libraries, research institutions, and government agencies are all navigating data-intensive environments, and they increasingly need staff who can bridge the gap between raw data and meaningful decisions.

The LIS Competitive Advantage

What separates MLIS graduates from purely technical data science candidates is a foundation that most computer science programs do not teach. MLIS vs. computer science degrees differ most visibly here: library professionals are trained in information architecture, metadata standards, controlled vocabularies, and ethical data stewardship, the very competencies that data teams frequently struggle to source.

A data scientist who can build a predictive model is valuable. One who can also structure a research data repository, document provenance, and apply privacy frameworks to a sensitive dataset is considerably harder to replace. MLIS graduates who layer technical data skills onto their existing knowledge base do not simply compete in the data job market; they offer a profile that pure technologists cannot easily replicate.

Essential Data Science Skills for Librarians

MLIS graduates already bring strong foundations in information organization, metadata, and research support. Building targeted data science competencies on top of those strengths opens doors to roles such as data librarian, information architect, and research data manager. The table below maps core data science skill areas to practical library applications, drawing on both technical and workplace competencies recognized across the field.

Skill CategorySpecific SkillsLibrary Application Example
ProgrammingPython, R, SQLWriting scripts to clean, transform, and analyze research datasets for institutional repositories
Statistics and MathematicsDescriptive statistics, probability, regression analysisEvaluating collection usage patterns or analyzing survey data for library assessment reports
Data VisualizationTableau, matplotlib, ggplot2, dashboard designCreating interactive dashboards that display circulation trends, database usage, or patron demographics
Machine LearningSupervised and unsupervised learning, model evaluationBuilding recommendation systems that suggest resources to patrons based on borrowing history
Data Curation and ManagementMetadata standards, data documentation, FAIR principlesOrganizing and preserving research datasets so they remain discoverable, accessible, and reusable over time
Big Data ToolsHadoop, Spark, SASProcessing large-scale digitized text collections or web-harvested archival materials for computational analysis
Communication and StorytellingData-driven presentations, report writing, audience adaptationTranslating complex usage analytics into clear narratives for library board meetings or grant proposals
Critical Thinking and Problem SolvingHypothesis formation, root cause analysis, analytical reasoningIdentifying gaps in digital collections by examining access logs and user feedback to guide acquisition decisions
Business Acumen and StrategyStakeholder analysis, project scoping, ROI assessmentMaking the case for new data services by connecting proposed initiatives to institutional research priorities
Teamwork and CollaborationCross-functional coordination, version control (Git), documentationPartnering with data engineers, faculty researchers, and IT staff on campus-wide research data infrastructure projects
Adaptability and CuriosityContinuous learning, emerging technology evaluationStaying current with AI and big data developments to integrate new tools into library instruction and reference services

How Data Science Fits Into an MLIS Degree

An increasing number of ALA-accredited MLIS programs now embed data science directly into their curricula, making it possible to graduate with both library expertise and analytical skills employers demand.

Programs That Bridge LIS and Data Science

At the University of Denver, the MLIS offers a Research Data Management concentration, with courses like Data Visualization that prepare students to curate and communicate complex data.1 The OU MLIS program provides a full Data Science concentration within its MLIS, equipping graduates for roles at the intersection of information and analytics.2 Wayne State University features a Data Analytics concentration, while the University of Kentucky's MSLS program includes a Data Science Elective Cluster, allowing students to tailor their degree with coursework in areas such as programming and machine learning.2 Rutgers University's Master of Information program goes further with a dedicated Data Science concentration, and the University of Michigan's MSI highlights Data Science and Analytics as a special strength.3 These pathways share a common goal: blending traditional LIS skills with computational methods.

Typical Course Sequence

Most data-focused MLIS tracks begin with foundational courses in metadata, information organization, and research methods. From there, students move into data-specific electives such as Python for information professionals, data visualization, and introductory machine learning. A capstone project or practicum typically applies these skills to a real-world information challenge, often in a library, archive, or research setting, solidifying the connection between theory and practice.

Dual Degrees and Certificates

For deeper technical training, several schools offer formal dual-degree options. The University of North Texas, for example, pairs its MS in Library Science with a master's in Data Science, producing graduates ready for data stewardship roles.2 Other institutions allow students to add a graduate certificate in data analytics or applied statistics alongside the MLIS, creating a stackable credential without extending time to degree substantially. If you are still weighing program options, how to choose a library science program can help you evaluate concentrations and electives against your career goals.

A Direct Path to Data-Focused Roles

For anyone asking whether an information science degree can lead to data-oriented library positions, the answer is clearly yes. The ALA-accredited programs described here explicitly brand their data pathways to prepare students for jobs such as data services librarian, research data manager, and information architect. As libraries and cultural heritage organizations increasingly rely on data-driven decision making, these integrated curricula ensure that MLIS graduates are not only eligible but competitive.

Data Science Certifications and Training for Librarians

Building data science competencies does not always require a full degree program. A growing number of certifications and training opportunities cater specifically to library and information professionals who want to expand their skill sets without returning to graduate school. These programs range from free, community-driven workshops to paid professional certificates backed by major technology companies.

Library Carpentry Workshops

Library Carpentry offers hands-on workshops designed to introduce library staff to core data science skills, including working with structured data, using version control systems like Git, and automating repetitive tasks with scripting languages.1 These workshops are typically two-day events offered in both in-person and virtual formats, and they are free to participants. The curriculum emphasizes practical, task-based learning tailored to the workflows and challenges librarians encounter daily, such as cleaning bibliographic data or managing digital collections. Library Carpentry is a community initiative under The Carpentries umbrella, and its sessions are widely recognized in academic and research library settings as valuable professional development.

For those interested in leading their own workshops, Library Carpentry also offers a free instructor certification program.2 This online training takes approximately three days to complete, including a combination of video lessons and practical tasks. Certified instructors join a global network and gain the credentials to teach Library Carpentry workshops at their own institutions or conferences. This pathway is particularly appealing for librarians who want to build capacity in their departments or regions.

Professional Certificates from Online Platforms

Beyond library-specific offerings, several online learning platforms provide data science certificates that translate well to library roles. The Google Data Analytics Professional Certificate and the IBM Data Science Professional Certificate are among the most widely recognized. Both programs cost approximately $49 per month and can be completed in three to six months depending on weekly time commitment. These certificates cover foundational topics such as data cleaning, visualization with tools like Tableau or R, and introductory programming in Python or SQL. Employers in academic and corporate libraries increasingly value these credentials as evidence of applied technical skills.

DataCamp is another popular option, offering monthly subscription access (around $25 per month) to hundreds of short courses in data science, machine learning, and programming. While not a formal certificate program, DataCamp allows librarians to build targeted skills at their own pace, making it a flexible choice for busy professionals.

University Micro-Credentials and Stackable Certificates

Some universities now offer micro-credentials or stackable certificates in data science that can be completed alongside or after an MLIS degree. These programs, often priced between $1,000 and $5,000, provide focused training in areas like data visualization, statistical analysis, or research data management. Universities such as Syracuse, Illinois, and North Carolina have begun piloting such offerings for their library science students, recognizing the demand for hybrid skill sets in the job market. These credentials are especially valuable when paired with an ALA-accredited MLIS, as they signal both information expertise and technical fluency to prospective employers.

Questions to Ask Yourself

Do you already use tools like Excel or OpenRefine for data cleaning in your current role?
If you routinely wrangle messy metadata, deduplicate records, or normalize datasets, you have foundational data wrangling experience that translates directly to data science workflows. Recognizing these existing skills helps you gauge how much additional training you actually need.
Have you supported researchers with data management plans or institutional repository submissions?
Experience guiding faculty through data lifecycle practices means you already understand research data governance, a competency many data scientists lack. This background positions you for roles like research data manager or data services librarian without starting from scratch.
Are you comfortable committing to learning a programming language such as Python or R if you had a structured, three month roadmap?
Data science careers increasingly require at least basic scripting ability. Honestly assessing your willingness to invest focused study time will help you choose between certificate programs that teach coding and those that emphasize no code analytics platforms.

Data Science Career Paths and Job Titles for MLIS Graduates

MLIS graduates can follow a clear career ladder in data science, whether they remain in academic or research libraries or pivot to corporate, government, or technology roles. The pathway below outlines typical progression, but career paths can diverge at any stage. Some professionals deepen their expertise within library systems, while others leverage their information science background to move into industry data roles such as data officer, systems analyst, or UX researcher.

Three-stage career progression for MLIS graduates in data science, from entry-level data services librarian through mid-career research data manager to senior head of data services, with credentials and salary bands at each level

Data Scientist Salary Overview: National, State, and Metro Benchmarks

The table below presents national salary benchmarks for data scientists (SOC 15-2051) drawn from the Occupational Employment and Wage Statistics program published by the U.S. Bureau of Labor Statistics, using 2024 data. These figures reflect the broader data scientist occupation, not exclusively MLIS-trained professionals, but they represent the realistic pay range that MLIS graduates can reach when they move into data science roles. For additional context, Glassdoor reported a median total pay of $156,000 for U.S. data scientists as of May 2026, a figure that includes base salary plus bonuses and other compensation. The difference between the BLS median annual wage of $112,590 and the Glassdoor total pay figure suggests that supplemental compensation (bonuses, equity, profit sharing) can meaningfully boost overall earnings in this field.

MetricValue
Total National Employment233,440
Mean Annual Wage$124,590
25th Percentile Annual Wage$82,630
Median Annual Wage$112,590
75th Percentile Annual Wage$155,810
Glassdoor Median Total Pay (May 2026)$156,000

Data Scientist Salary by State

Salaries for data scientists vary widely depending on location, with coastal tech hubs and major metro areas commanding significantly higher pay. The table below draws from the most recent Occupational Employment and Wage Statistics published by the U.S. Bureau of Labor Statistics (2024 data) and covers selected states across the pay spectrum. MLIS graduates considering data science roles should weigh these figures against local cost of living when evaluating opportunities.

StateTotal Employment25th PercentileMedian Salary75th PercentileMean Salary
Washington7,930$108,530$158,760$199,620$162,730
District of Columbia3,580$102,440$137,120$208,600$148,170
California36,850$98,460$136,800$177,480$155,450
Massachusetts9,990$103,130$132,250$167,710$139,670
Michigan6,450$78,410$99,470$124,070$102,030
New Hampshire870$78,550$98,970$124,520$102,230
Ohio5,510$71,660$98,620$126,470$100,260
Iowa2,330$68,910$97,980$126,310$97,300
Nebraska1,850$81,400$96,470$108,940$97,720
North Dakota320$74,690$96,230$120,270$94,130
West Virginia220$72,250$95,760$118,820$102,770
Maine1,040$76,020$94,350$127,230$101,890
Kentucky1,700$74,110$93,490$123,150$100,410
Nevada1,230$67,520$93,310$122,200$98,820
South Dakota270$78,590$92,000$109,940$92,460
South Carolina2,810$73,050$90,660$108,610$96,970
Missouri4,350$61,910$85,570$123,180$95,310
New Mexico510$62,310$85,040$104,290$86,520
Indiana3,120$66,800$84,050$110,770$92,620
Oklahoma2,160$66,500$80,380$107,520$89,940
Alaska210$56,020$77,400$116,470$88,690
Louisiana1,120$56,070$70,530$80,940$71,740
Mississippi270$53,980$69,430$97,150$77,830

The American Library Association's Privacy: An Interpretation of the Library Bill of Rights establishes that libraries must require informed, opt-in consent before collecting patron data. This longstanding commitment to privacy and ethical data handling positions MLIS graduates uniquely in data science careers, where algorithmic transparency and responsible research data stewardship are increasingly valued.

How to Transition From Traditional Librarianship to Data Science

Moving from traditional library work into data science roles does not require abandoning your profession or starting from scratch. Many academic and research libraries are actively creating data-focused positions, and your existing skills in information organization, research support, and ethical information handling provide a strong foundation. The key is building technical competencies strategically while leveraging what you already bring to the table.

Building a 6 to 12 Month Skills Roadmap

A practical transition plan begins with foundational data literacy and progresses toward project-based learning. Start with Library Carpentry workshops, which teach data skills specifically for library and information professionals. These free, community-driven sessions cover shell scripting, version control with Git, and data cleaning techniques tailored to library contexts.

After completing introductory workshops, add a structured Python or R course. Both languages are widely used in data science, and either will serve you well. Python tends to be more versatile for general data science applications, while R is particularly strong for statistical analysis and research data work common in academic settings.

The final phase involves building two or three portfolio projects using real library data. Consider analyzing circulation patterns to identify collection development opportunities, conducting metadata quality audits to demonstrate data cleaning skills, or creating research data visualizations that showcase your ability to communicate findings effectively. These projects give you concrete evidence of your capabilities when applying for data-focused roles.

Gaining Experience Within Your Current Position

You do not need to wait for a new job to start building relevant experience. Volunteer to review data management plans for researchers at your institution, as this work combines your information expertise with data lifecycle knowledge that employers value. Propose a pilot data visualization dashboard for your library, perhaps tracking reference transactions, database usage, or interlibrary loan trends. Contributing to open-source library data tools through platforms like GitHub demonstrates both technical skills and collaborative work habits.

Positioning Your Ethics Training as a Differentiator

Most data science job candidates lack formal training in information ethics, which gives MLIS graduates a significant advantage. Your familiarity with ALA privacy guidelines, patron confidentiality principles, and frameworks like GDPR awareness positions you to address growing concerns about responsible data use. Employers increasingly recognize that technical skills without ethical grounding create organizational risk, making your background particularly valuable in healthcare, government, and educational settings where data sensitivity is paramount. Understanding ethics of AI in libraries only deepens that advantage as machine learning tools become embedded in institutional workflows.

Connecting With the Data Librarianship Community

Networking accelerates your transition by connecting you with professionals who have navigated similar paths. IASSIST (the International Association for Social Science Information Services and Technology) hosts conferences and maintains active discussion forums for data professionals. Joining the right library associations for MLIS students can surface job leads, mentorship connections, and committee work that builds your professional profile quickly. The Research Data Alliance brings together practitioners working on research data challenges globally. Data science librarian Slack communities offer informal spaces to ask questions and share resources. Regional library data interest groups, often organized through state library associations, provide local networking opportunities and may lead to collaborative projects or job leads.

Frequently Asked Questions About Data Science for Librarians

Below are answers to some of the most common questions prospective and current librarians ask about breaking into data science. These responses reflect current labor market data, program trends, and the practical realities of blending library and information science expertise with data science competencies.

How much does a data librarian make?
Salaries vary by employer type, location, and experience level. Data librarians in academic settings typically earn in the range reported for specialized librarian roles, while those who move into full data scientist positions can earn considerably more. Glassdoor reported a median total pay of $156,000 for data scientists in the U.S. as of May 2026. Librarians who add data science skills often see meaningful salary gains compared to traditional library roles.
Can you be a librarian with an information science degree?
Yes. Many library positions accept degrees in information science, and an ALA-accredited MLIS (Master of Library and Information Science) program qualifies graduates for most professional librarian roles. Because information science curricula emphasize data organization, metadata, and information retrieval, graduates are well positioned for data-intensive library work such as research data management and digital preservation.
What skills do librarians need for data science?
Core technical skills include statistics, programming in Python or R, data visualization, and machine learning fundamentals. Equally important are workplace skills such as communication, critical thinking, storytelling with data, and problem solving. Librarians already bring strong foundations in information organization and research support, so the transition often centers on building quantitative and coding proficiency.
What is the job outlook for data scientists?
The outlook is exceptionally strong. The U.S. Bureau of Labor Statistics projects 34% job growth for data scientists from 2024 to 2034, translating to roughly 23,400 new positions each year. The World Economic Forum's Future of Jobs Report 2025 ranks AI and big data as the fastest growing skills for 2025 to 2030, with over 90% of surveyed employers expecting rising demand across sectors including healthcare, government, and information technology.
What certifications help librarians learn data science?
Popular options include the Google Data Analytics Professional Certificate, the IBM Data Science Professional Certificate, and the Coursera Data Science Specialization from Johns Hopkins. Certifications in Python, R, SQL, or machine learning through platforms like edX or DataCamp can also strengthen a librarian's profile. These credentials complement an MLIS by adding verifiable technical skills without requiring a second full degree.
How does data science fit into an MLIS degree?
Many ALA-accredited MLIS programs now offer elective tracks or concentrations in data science, data analytics, or digital scholarship. Coursework may cover data curation, research data management, text mining, and introductory programming. Some programs partner with computer science or statistics departments to provide cross-listed courses. Students interested in data science careers should look for programs that include hands-on projects with real datasets.
Do I need to know how to code to become a data science librarian?
Some coding ability is increasingly expected, though you do not need to be a software engineer. Familiarity with Python or R is the most common requirement, especially for tasks like data cleaning, analysis, and visualization. Many librarians learn these skills through online courses, workshops, or certificate programs while working. Starting with a single language and building proficiency over time is a practical approach.

Recent News

Recent Articles