Data use in the library has specific characteristics and common problems. Data Clean-up and Management addresses these, and provides methods to clean up frequently-occurring data problems using readily-available applications. The authors highlight the importance and methods of data analysis and presentation, and offer guidelines and recommendations for a data quality policy. The book gives step-by-step how-to directions for common dirty data issues.

Key Features

  • Focused towards libraries and practicing librarians
  • Deals with practical, real-life issues and addresses common problems that all libraries face
  • Offers cradle-to-grave treatment for preparing and using data, including download, clean-up, management, analysis and presentation


Practitioners and students of Library and Information Science

Table of Contents

  • List of figures
  • List of tables
  • About the authors
  • Chapter 1: Introduction (why this book is needed)
    • Abstract:
    • What makes this book unique?
    • Why library data is important
    • The book’s outline
  • Chapter 2: Commonalities
    • Abstract:
    • Microsoft Office Excel
    • MarcEdit
    • Microsoft Access
    • XML
    • Commonalities
    • Capture and use
    • Standardization
    • Data import issues
    • Technical skills
    • Project management challenges
  • Chapter 3: Defining data
    • Abstract:
    • Rule 1: define data points
    • Rule 2: apply data point definitions
    • Rule 3: count the right apples
    • Rule 4: avoid capturing redundant data
  • Chapter 4: Types of data issues
    • Abstract:
    • Microsoft Excel vs Microsoft Access
    • General data-handling edicts
    • Data issues: importing data
  • Chapter 5: Microsoft Excel techniques
    • Abstract:
    • Creating datasheets
    • Selecting cells
    • Copying
    • Sorting
    • Filter
    • AutoSum
    • Sum
    • Fill
  • Chapter 6: Data clean-up in Excel
    • Abstract:
    • Common dirty data scenarios
    • The usefulness of delimiting
    • System limitations
    • Removing extra characters
  • Chapter 7: Excel: combining data
    • Abstract:
    • IF statements
    • The TEXT function
    • PivotTables and filtering
    • MATCH
  • Chapter 8: Additional tools
    • Abstract:
    • PDFs
    • Notepad
    • Microsoft Word
    • Global update in an integrated library system
    • Regular expressions
    • Excel
    • Access
    • Macros
    • XML
    • MarcEdi


No. of pages:
© 2012
Chandos Publishing
Print ISBN:
Electronic ISBN:

About the editors

Margaret Hogarth

Margaret Hogarth is Electronic Resources Coordinator and Subject Specialist for Environmental Sciences, Water and Soils for the University of California, Riverside Libraries. She has a B.A. in English from the University of California, Santa Barbara, an MLIS from San Jose State University and an M.S. in Environmental Studies from California State University, Fullerton. She has been a librarian since 1999.

Kenneth Furuta

Kenneth Furuta, a Reference/Information Technology librarian for the University of California, Riverside Libraries. His B.A. was a double major in Psychology & Music from the University of California, Riverside. In addition to his MLS from the University of Arizona he has a Masters of Administration (emphasis in Management Information Systems) from UC Riverside. He has been a librarian since 1990.

Affiliations and Expertise

University of California, USA


"This excellent book is highly recommended; it will be keenly sought out and become a well-used tool. Whilst a key text for students of this subject, it is also particularly suited for staff using Excel, Access and other applications to improve the quality of the information available for decision-making."--The Australian Library Journal, Vol. 63, No. 1, 2014