Secure CheckoutPersonal information is secured with SSL technology.
Free ShippingFree global shipping
No minimum order.
- Author Biography
- Chapter 1. Introduction
- 1.1 Why Bother?
- 1.2 What Is Data Repurposing?
- 1.3 Data Worth Preserving
- 1.4 Basic Data Repurposing Tools
- 1.5 Personal Attributes of Data Repurposers
- Chapter 2. Learning from the Masters
- 2.1 New Physics from Old Data
- 2.2 Repurposing the Physical and Abstract Property of Uniqueness
- 2.3 Repurposing a 2,000-Year-Old Classification
- 2.4 Decoding the Past
- 2.5 What Makes Data Useful for Repurposing Projects?
- Chapter 3. Dealing with Text
- 3.1 Thus It Is Written
- 3.2 Search and Retrieval
- 3.3 Indexing Text
- 3.4 Coding Text
- Chapter 4. New Life for Old Data
- 4.1 New Algorithms
- 4.2 Taking Closer Looks
- 4.3 Crossing Data Domains
- Chapter 5. The Purpose of Data Analysis Is to Enable Data Reanalysis
- 5.1 Every Initial Data Analysis on Complex Datasets Is Flawed
- 5.2 Unrepeatability of Complex Analyses
- 5.3 Obligation to Verify and Validate
- 5.4 Asking What the Data Really Means
- Chapter 6. Dark Legacy: Making Sense of Someone Else’s Data
- 6.1 Excavating Treasures from Lost and Abandoned Data Mines
- 6.2 Nonstandard Standards
- 6.3 Specifications, Not Standards
- 6.4 Classifications and Ontologies
- 6.5 Identity and Uniqueness
- 6.6 When to Terminate (or Reconsider) a Data Repurposing Project
- Chapter 7. Social and Economic Issues
- 7.1 Data Sharing and Reproducible Research
- 7.2 Acquiring and Storing Data
- 7.3 Keeping Your Data Forever
- 7.4 Data Immutability
- 7.5 Privacy and Confidentiality
- 7.6 The Economics of Data Repurposing
- Appendix A. Index of Case Studies
- Appendix B. Glossary
Repurposing Legacy Data: Innovative Case Studies takes a look at how data scientists have re-purposed legacy data, whether their own, or legacy data that has been donated to the public domain.
Most of the data stored worldwide is legacy data—data created some time in the past, for a particular purpose, and left in obsolete formats. As with keepsakes in an attic, we retain this information thinking it may have value in the future, though we have no current use for it.
The case studies in this book, from such diverse fields as cosmology, quantum physics, high-energy physics, microbiology, psychiatry, medicine, and hospital administration, all serve to demonstrate how innovative people draw value from legacy data. By following the case examples, readers will learn how legacy data is restored, merged, and analyzed for purposes that were never imagined by the original data creators.
- Discusses how combining existing data with other data sets of the same kind can produce an aggregate data set that serves to answer questions that could not be answered with any of the original data
- Presents a method for re-analyzing original data sets using alternate or improved methods that can provide outcomes more precise and reliable than those produced in the original analysis
- Explains how to integrate heterogeneous data sets for the purpose of answering questions or developing concepts that span several different scientific fields
Primary Market Data scientists, Big Data curators, Statisticians, Researchers; Secondary Market: graduate level students in computer science, statistics, information sciences
- No. of pages:
- © Elsevier 2015
- 17th March 2015
- Paperback ISBN:
- eBook ISBN:
Jules Berman holds two bachelor of science degrees from MIT (Mathematics, and Earth and Planetary Sciences), a PhD from Temple University, and an MD, from the University of Miami. He was a graduate researcher in the Fels Cancer Research Institute, at Temple University, and at the American Health Foundation in Valhalla, New York. His post-doctoral studies were completed at the U.S. National Institutes of Health, and his residency was completed at the George Washington University Medical Center in Washington, D.C. Dr. Berman served as Chief of Anatomic Pathology, Surgical Pathology and Cytopathology at the Veterans Administration Medical Center in Baltimore, Maryland, where he held joint appointments at the University of Maryland Medical Center and at the Johns Hopkins Medical Institutions. In 1998, he transferred to the U.S. National Institutes of Health, as a Medical Officer, and as the Program Director for Pathology Informatics in the Cancer Diagnosis Program at the National Cancer Institute. Dr. Berman is a past President of the Association for Pathology Informatics, and the 2011 recipient of the association's Lifetime Achievement Award. He is a listed author on over 200 scientific publications and has written more than a dozen books in his three areas of expertise: informatics, computer programming, and cancer biology. Dr. Berman is currently a free-lance writer.
Freelance author with expertise in informatics, computer programming, and cancer biology
Elsevier.com visitor survey
We are always looking for ways to improve customer experience on Elsevier.com.
We would like to ask you for a moment of your time to fill in a short questionnaire, at the end of your visit.
If you decide to participate, a new browser tab will open so you can complete the survey after you have completed your visit to this website.
Thanks in advance for your time.