Download DualSeqDB data
This file contains all Dual-Sequencing Database entries, as described in the About section.
DualSeqDB version 1 (2020-08)
dualseqdb_v1.zip (47.4 MB, contains a 236.2 MB tab-separated text file)
This table contains more than 300,000 entries, each corresponding to a bacterial or host gene whose expression level was determined during infection using Dual-Seq.
Each row includes:
- A semi-stable unique internal identifier (or "DualID") ,
- Pathogen name ,
- Pathogen NCIB taxon ID ,
- Host name ,
- Host NCBI taxon ID ,
- Organism ,
- Tissue infected ,
- GenBank Locus Tag ,
- GenBank Protein ID ,
- UniProt accession (where available) ,
- Gene symbol (where available) ,
- Time post infection ,
- Description of the gene product (protein) ,
- A log2 expression fold change ,
- A p-value ,
- PubMed ID of the original study ,
- Protein sequence .
For information on how the expression fold change values were obtained, please see the About tab.
Our own work is licenced under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Licence . Please also see the CRG's legal notice.