Dataset

Chemical

(properties of chemical or drugs)
**//**PubChem_Compound**//** (compound information)
ChEBI (compound information)
ChemDB

Chemogenomic (Interacting)

(Interacting between compound and proteins or compound and biological system )
**PubChem_BioAssay** (compound, assay, target, binding affinity)
**DrugBank**(drug, target) (existing rdf)
BindingDB (drug, target, binding affinity)
PharmGKB (gene, drug, disease, relation, pathway)
STITCH (chemical protein interaction)
CTD (comparative toxicogenomics, chemical-gene-disease network)
SuperTarget (MATADOR, protein-chemical interactions)
KEGG Ligand & Drug
TTD (Therapeutic Target Database)
GLIDA (GPCR)
QSAR dataset 1 (from cheminformatics.org, including various small sets)
QSAR dataset 2 (from QSAR world, including various small sets)
PDSP (ki)
Chembl

Chembank (chem & cell line)
CEBS (chemical effects in biological system, Microarray)
ARRAYEXPRESS ( functional genomic data, Microarray)
CEO(Gene Eexpression Omnibus, Microarray)
HMDB(Human Metabolome Database)

**//Key properties//**
data structure

Biology

(protein or gene informations)
GeneCards (gene)
**SWISSPROT** (protein)
PDB (protein)
UniProt
GO (Gene Ontology)
HGNC (Human Gene Symbol)

Systems

(biological network or pathway)
**HPRD** (protein protein interaction)
DIP (protein protein interaction)
STRING (protein protein interaction)
**KEGG** (pathway)
Reactome (pathway)
PID (Pathway interaction)

Phenotype

(disease, adverse drug reaction, toxicity)
**OMIM** (human disease)
HGMD (Human gene mutation database)
ADRES (Adverse drug reaction from FDA)
SIDER (side effects)
DiDB (Metabolism & Transport Drug Interaction Database)

PubMed(used for literature validation)

Toxicology

opentox (a starting effort)

Glycomics

Cazy

Others

bio2rdf rdfs
LODD dataset
pathguide(a comprehensive path resource list)
LOD (linked open data)
Linked Life Data