Database linking

About database linking

How Elsevier is connecting data and research articles on ScienceDirect

At Elsevier we are keen to work with researchers and data repositories to ensure that data that is relevant for scientific, technical and medical research can be easily discovered and accessed.

One of the ways in which we are doing this is by creating bidirectional links between data repositories and online articles on ScienceDirect. This provides ScienceDirect's readers with one-click access to relevant, trusted data that may help to validate research or drive further investigations.

Benefits for authors: Linking helps to make your articles and data better discoverable, attracting more usage. Sharing the data that underpins your conclusions is not only good scientific practice, it is also increasingly required by funding bodies.

Benefits for readers: Linking provides essential context to data sets, and improves the reproducibility of published research.

Depositing data at a repository

Elsevier encourages authors to deposit raw experimental data at relevant data repositories. Instructions for authors depend on the data repository: in some cases data is extracted from the article by curators, while in other cases authors need to upload their data manually. Detailed information is available with the individual data repositories given in the listing of supported databases.

How data and articles are linked

There are several ways in which we support interlinking of articles and data:

  • Referencing data in your article through tagging identifiers or accession numbers: If your article contains relevant unique identifiers or accession numbers linking to information on genes, proteins, diseases, etc. or structures deposited in public databases, and you would like your article to link to that data, please identify these entities in the following way:

       database abbreviation: data identifier

    For example, "PDB: 1TUP" to identify the protein with accession number "1TUP" in the Protein Data Bank (PDB). Please bear in mind that an error in a letter or number will result in a dead link in the article. Database abbreviations and further examples can be found in the listing of supported databases.

  • Data DOI's: Elsevier supports Data DOI's as persistent identifiers for scientific data. If you include a data DOI in your article, it will automatically turn into a link to your data on ScienceDirect.

  • Linked data repository banners on ScienceDirect: Elsevier collaborates with selected data repositories to show banner links next to relevant articles on ScienceDirect. This linking system requires that the data repository maintains accurate records of associations between articles and data sets. What you need to do as an author to support this type of linking depends on the data repository; see links to more information in the supported databases section.

  • Data visualization and integration applications: In close collaboration with selected data repositories, Elsevier has developed a number of data-integration and visualization applications that are shown next to the article on ScienceDirect, e.g. the Protein Viewer (with PDB), the PANGAEA data visualization tool, and the Genome Viewer (with NCBI). These applications build further on tagged entities or banner links to visualize data and integrate it into the online reading experience.

Further reading

For more information on Elsevier's data-linking program, please see:

Supported data repositories

Supported Data Repositories

If you are a data repository manager interested in setting up bidirectional linking with Elsevier publications, please contact us at articleofthefuture@elsevier.com.


Crystallography

Data Repository How articles and data are linked (learn more) More information
Cambridge Crystallographic Data Centre (CCDC) CCDC banners will be shown on ScienceDirect when the repository has data for the article.

Clinical Trials

Data Repository How articles and data are linkedMore information
ClinicalTrials.gov (NCT) Authors should specify NCT accession numbers, e.g. NCT: NCT00222573.

Earth, Environmental & Oceanographic Data

Data Repository How articles and data are linkedMore information
Atmospheric Radiation Measurement (ARM) Data Archive Authors should include data DOI's in their manuscript, following these citation instructions.
BGS GeoScenic Authors should specify BGS GeoScenic numbers, e.g. GeoScenic: P603281.
EarthChem EarthChem banners will be shown on ScienceDirect when the repository has data for the article. Data is extracted from the literature by curators.
Marine Geoscience Data System (MGDS) MGDS banners will be shown on ScienceDirect when the repository has data for the article.
Natural Environment Research Council (NERC), including BADC, BODC, EIDC and NGDC. Authors should include data DOI's in their manuscript
Oak Ridge National Laboratory (ORNL) Distributed Active Archive Center (DAAC) ORNL-DAAC banners will be shown on ScienceDirect when the repository has data for the article.
PANGAEA Data-integration application on ScienceDirect opens automatically for relevant articles.
System for Earth Sample Registration (SESAR), registry for International Geo Sample Numbers (IGSN) Authors should specify IGSN numbers, e.g. IGSN: HRV003M16.
Woods Hole Open Access Server (WHOAS) WHOAS banners will be shown on ScienceDirect when the repository has data for the article.

Engineering & Materials Standards

Data Repository How articles and data are linkedMore information
ASTM (formerly, American Society for Testing and Materials) Authors should specify ASTM identifiers, e.g. ASTM: G63.

Genes & Gene Expression

Data Repository How articles and data are linkedMore information
GenBank Authors should specify GenBank accession numbers, e.g. GenBank: BA123456.
Gene Expression Omnibus (GEO) Authors should specify GEO accession numbers, e.g. GEO: GSE27196; GEO: GPL5366; GEO: GSM9853.

Genes & Medicine

Data Repository How articles and data are linked More information
Online Mendelian Inheritance in Man (OMIM) Authors should specify OMIM accession numbers, e.g. OMIM: 601240.

Life Sciences

Data Repository How articles and data are linked More information
DRYAD DRYAD banners will be shown on ScienceDirect when the repository has data for the article
EMBL-EBI OLS Molecular Interaction Ontology (MI) Authors should specify EMBL-EBI OLS accession numbers, e.g. MI: 0218.
NCBI Taxonomy Authors should specify NCBI Taxonomy accession numbers, e.g. NCBI Taxonomy: 48184.

Model Organisms

Data Repository How articles and data are linked More information
The Arabidopsis Information Resource database (TAIR) Authors should specify TAIR accession numbers, e.g. TAIR: AT1G01020.
EMAGE EMAGE banners will be shown on ScienceDirect when the repository has data for the article.
FlyBase Authors should specify FlyBase accession numbers, e.g. FlyBase: FBgn0036925
Mouse Genome Informatics (MGI) Authors should specify MGI accession numbers, e.g. MGI: 2448567. In addition, MGI banners will be shown on ScienceDirect when the repository has data for the article.
Rat Genome Database (RGD) Authors should specify RGD accession numbers, e.g. RGD: 67383. In addition, RGD banners will be shown on ScienceDirect when the repository has data for the article.
Saccharomyces Genome Database (SGD) SGD banners will be shown on ScienceDirect when the repository has data for the article.
Wormbase Authors should specify Wormbase accession numbers, e.g. WB Gene: WBGene00010049, WB Strain: RB877, WB Protein: WP:CE02807, WB Protein: CE09620.
The Zebrafish Model Organism Database (ZFIN) Authors should specify ZFIN accession numbers, e.g. ZFIN: ZDB-GENO-960809-7, ZFIN: ZDB-GENE-040426-2439, ZFIN: ZDB-PUB-010810-1.

Physics & Astronomy

Data Repository How articles and data are linkedMore information
NASA/IPAC Extragalactic Database NED banners will be shown on ScienceDirect when the repository has data for the article.
HepData (Durham Reaction Database) HepData banners will be shown on ScienceDirect when the repository has data for the article.
SIMBAD Astronomical Database SIMBAD banners will be shown on ScienceDirect when the repository has data for the article. Data is extracted from the literature by curators.
ThermoML (NIST) ThermoML text-links will be shown on ScienceDirect when the repository has data for the article. Data is extracted from the literature by curators.

Proteins

Data Repository How articles and data are linked More information
Molecular Interactions Database (MINT) Authors should specify MINT accession numbers, e.g. MINT: 6166710.
Protein Data Bank (PDB) Authors should specify PDB accession numbers, e.g.PDB: 1TUP. Protein structures are visualized using a Protein Viewer application.
Universal Protein Resource Knowledgebase (UniProt) Authors should specify UniProt accession numbers, e.g.UniProt: Q9H0H5.

Reagents

Data Repository How articles and data are linked More information
Antibody Registry Authors should specify Antibody Registry accession numbers, e.g. AntibodyRegistry: AB_1015010.

Software & Computer Code

Data Repository How articles and data are linkedMore information
Comprehensive R Archive Network (CRAN) Authors should specify CRAN library names, e.g. CRAN: pso.
ModelDB ModelDB banners will be shown on ScienceDirect when the repository has data for the article.
RunMyCode RunMyCode banners will be shown on ScienceDirect when the repository has data for the article.