DATABASE Group member: Nurul Aisyah Afifah Liyana Noor Atiqah Norfariza Ronalizah

Importants of Databases

Uniprot

important to find out the sequences of our protein

analyse the functional information of the protein

KEGG

to find out the detail of biological activities of our protein

study the whole function our protein

Pfam

contain the family tree our protein

can study the structure and alignment of protein

CAZy

determine in detailed the activity and metabolism of our protein towards carbohydrate

Gene3D

to study the 3D structure of protein and each sequences of the structure

Criteria that need to be fulfilled to be inserted into NAR database

discuss the topic of interest

emerging or specialized subject areas

single cell gene regulation studies

nuclear architecture and functional consequences

gene targeting and Genome Engineering

genome studies using massive parallel(Deep) sequencing

molecur machines and complex molecular assemblages

single molecule studies of macromolecular function

synthetic Biology and Chemistry

Computational biology

Description of a new algorithm that represents a substantial improvement over current methodology and has direct biological relevance

Describe the use of existing computational method to generate significant novel biological information

Gene regulation,chromatin and epigenetics

genome integrity,repair and replication

genomics

nucleic acid enzymes

RNA

structure assembly,mechanism of action and regulation of ribosomes,snRNPs and other stable ribonucleoprotein particles

Structure,biogenesis,cellular roles and regulation of non-coding RNAs

new information about the structure and biochemistry of nucleic acid binding proteins or enzymes that function in RNA metabolism

Structural biology

Molecular biology

must update regularly

Can be access without interface or paywall

New and Updated NAR Database

Nucleic Acid Sequence and Structure, Transcriptional Regulation

microRNA Databases

miRBase

miRNEST

mirTarBase

PolymiRTS

starBASE

NONCODE Database

Various type of non-coding RNA

Transcription Factor Binding Sites (TFBSs)

JASPAR

YEASTRACT

Protein Sequences and Structure, Motifs and Domains, Protein-protein Interactions

annual update

uniprot and KEGG

popular database

Pfam

eggNOG

ELM

2 new database

HRAP

Repeats database

structural database

NDB

RNA bricks

protein structure database

RDBe

visualization

analysis of the electron microscopy derived structure

NCBI (MMDB)

calculated using VAST +

protein-protein interaction

Negatome database

Protein SCOP Database

SCOPe

SCOP Hierarcy

SCOP2

Metabolics and Signalling Pathways, Enzymes, Protein Modification

Metabolics Pathway Databases

MetaCyc

Reactome

Small Molecule Pathway

Catalytic Site Atlas

Structure-Function Linkage Database (SFLD)

Carbohydrate-Active enzyme Database (CAZy)

Merops

EKPD

MultiTaskDB

Viruses, Bacteria, Protozoa and Fungi

Human Microbiome Project

Genome Annotation

Human Pathogen

IMG

PATRIC

SEED

Free-living Microorganism

CynoBase

PortEco

Rhizobase

SubtiWiki

Microbial Diversity in Natural Enviroment

JGI'S IMG/M

EBI metagenomics resources

Sequence of Microbial Genome

Ribosomal Database Project (RDP)

the SILVA/LTP project

BacDive

Taxonomic

MetaRef

Human Genome, Model Organisms, Comparative Genomics

Human Genome

dbGaP

database of genotyping result

related to clinically relevant phenotype

Consensus CDS Project

collaborative effort to identify a core set of human protein region

Model Organism Databases

Saccharomyce Genome Database (SGD)

WormBase

FlyBase

Mouse Genome Database (MGD)

Mouse Gene Expression Database

Mouse Phenome Database

Vertebrate Genome Annotation (VEGA)

International Mouse Phenotyping Consortium (IMPC)

Genomic Variation, Diseases and Drugs

NCBI's ClinVar

database documenting clinically relevant sequences variant

NHGRI GWAS Catalog

a curated resources of SNP-Treat Association

Sanger institute's DECIPHER

database of pathogenic single nucleotide variant, indels and copy-number variant

Database of Genomic Variant (DGV)

canSAR

DriverDB

FINDbase

HbVar

Lynx

NECTAR

Progenetix

Plant Databases

Others Molecular Biology Databases

Organization of Databases in NAR

Nucleotide Sequence Databases

International Nucleotide Sequence Database Collaboration

BioSample

GenBank

Coding and Non-coding DNA

Dfam

Patome

Gene Structure, introns and exon, Splice side

GeneTack

ECgene

Transcriptional Regulator Sites and Transcription Factors

JASPAR

3D-Footprint

RNA sequence Databases

BPS

MeRNA

snOPY

Protein Sequence Databases

General Sequence Databases

Patome

UniProt

Protein Properties

REFOLD

Cybase

Structure Databases

BARD

Small Molecule

SuperDrug

DrugBank

Protein Structure

DBAli

Genome3D

Gene3D

Genomic Databases ( Non-Vertebrate)

MGD-mouse genomic database

The Gene Indices

Viral Genome Databases

HFV Database

ViralZone

Metabolic and Signalling Pathway

ChemProt

Protein-protein Interaction

IBIS

GeneNet

Enzyme and Enzyme nomeclature

BRENDA

ORENZA

Human and Other Veterbrate Genomes

Model Organisms, Comparative Genomics

AgBase

GeneSpeed

Human Genome Database, Maps Viewers

GeneAnnot

HOWDY

Human Genes and Diseases

CancerResource

Protein Mutant Database

Cancer Gene Databases

MethyCancer

CancerGene

Microarray Data and Other Gene Expression Databases

Gene Expression Barcode

HemBase

Sebida

Proteomics Resourses

GELBANK

MOPED

MAPU

Molecular Biology Databases

Network Portal

PubMed

CellFinder

Drug and Drug Design

PharmGed

SuperDrug

Molecular Probe and Primer

PrimerBank

VirOligo

Organelle Databases

Organelle Genome

GOBASE

Metochondrial Genes and Protein

MITOMAP

HMPD

Plant Databases

Chloroplast Genome Database

General Plant Databases

GeneFarm

PGDD

Immunological Databases

Epitome

MUJEN MOUSE DATABASE

Cell Biology

NCBI BookShelf

CloneDB

Different Type of Databases

UniProt

Database of protein sequences and functional information

UniProtKB

UniRef

UniParc

KEGG

Database resources for understanding high level function and utilities of the biological system

KEGG GENOME

KEGG GENES

KEGG DISEASES

KEGG DRUG

Pfam

Database of protein families that include annotation and multiple sequences aligments generated using hidden Markov models

CAZy

Database of Carbohydrate-Active enzymes which contains classification and associated information about enzymes involved in the systhesis, metabolism and transport of carbohydrates.

CAZypedia

Gene3D

To find out the protein domain, sequences features and 3D structures

CATH

Genome3D

InterPro