Difference between revisions of "README iRefIndex Feedback 2.0"
Line 194: | Line 194: | ||
'''Notes''' | '''Notes''' | ||
− | === Column number: | + | |
+ | === Column number: 4 === | ||
{| | {| | ||
− | |Column name: || | + | |Column name: ||primary_db |
|- | |- | ||
|Column type: ||string | |Column type: ||string | ||
|- | |- | ||
− | |Description: || | + | |Description: ||source db for accession listed in column 4 |
|- | |- | ||
− | |Example: || | + | |Example: ||uniprotkb |
|} | |} | ||
'''Notes''' | '''Notes''' | ||
+ | This is the primary protein sequence database referenced in the interaction record. | ||
− | === Column number: | + | === Column number: 5 === |
{| | {| | ||
− | |Column name: || | + | |Column name: ||primary_acc |
|- | |- | ||
|Column type: ||string | |Column type: ||string | ||
|- | |- | ||
− | |Description: || | + | |Description: ||An accession for a protein interactor in some database as supplied in the interaction record (see columns 1-2) |
|- | |- | ||
− | |Example: || | + | |Example: ||Q9Y6Q9 |
|} | |} | ||
'''Notes''' | '''Notes''' | ||
− | |||
− | === Column number: | + | === Column number: 6 === |
{| | {| | ||
Line 240: | Line 241: | ||
− | === Column number: | + | |
+ | === Column number: 7 === | ||
{| | {| | ||
− | |Column name: || | + | |Column name: ||used_db |
|- | |- | ||
|Column type: ||string | |Column type: ||string | ||
|- | |- | ||
− | |Description: || | + | |Description: ||source db for accession listed in column 4 |
|- | |- | ||
− | |Example: || | + | |Example: ||uniprotkb |
|} | |} | ||
'''Notes''' | '''Notes''' | ||
+ | This is the primary protein sequence database referenced in the interaction record. | ||
− | === Column number: | + | |
+ | === Column number: 8 === | ||
{| | {| | ||
− | |Column name: || | + | |Column name: ||used_acc |
|- | |- | ||
|Column type: ||string | |Column type: ||string | ||
|- | |- | ||
− | |Description: || | + | |Description: ||An accession for a protein interactor in some database as supplied in the interaction record (see columns 1-2) |
|- | |- | ||
− | |Example: || | + | |Example: ||Q9Y6Q9 |
|} | |} | ||
'''Notes''' | '''Notes''' | ||
− | + | === Column number: 9 === | |
− | |||
− | === Column number: | ||
{| | {| | ||
Line 285: | Line 287: | ||
'''Notes''' | '''Notes''' | ||
− | === Column number: | + | |
+ | === Column number: 10 === | ||
{| | {| | ||
− | |Column name: || | + | |Column name: ||mapped_db |
|- | |- | ||
|Column type: ||string | |Column type: ||string | ||
|- | |- | ||
− | |Description: ||the | + | |Description: ||the source protein db that this interactor was mapped to by iRefIndex |
|- | |- | ||
− | |Example: || | + | |Example: ||uniprot |
|} | |} | ||
'''Notes''' | '''Notes''' | ||
− | This will most likely be the same | + | |
+ | This will most likely be the same as the db listed in column 6 unless: | ||
{||class="wikitable" style="text-align:left" border="1" cellpadding="5" | {||class="wikitable" style="text-align:left" border="1" cellpadding="5" | ||
|reason||example||see scores with... | |reason||example||see scores with... | ||
|- | |- | ||
− | |a | + | |the db name is not valid or is a variation of a cv db||uniprot in place of "protein database"||D |
− | |||
− | |||
|} | |} | ||
− | === Column number: | + | |
+ | === Column number: 11 === | ||
{| | {| | ||
− | |Column name: || | + | |Column name: ||mapped_acc |
|- | |- | ||
|Column type: ||string | |Column type: ||string | ||
|- | |- | ||
− | |Description: ||the | + | |Description: ||the accession that this interactor was mapped to by iRefIndex |
|- | |- | ||
− | |Example: || | + | |Example: ||Q9Y6Q9 |
|} | |} | ||
'''Notes''' | '''Notes''' | ||
− | + | This will most likely be the same accession as listed in column 4 unless: | |
− | This will most likely be the same as | ||
{||class="wikitable" style="text-align:left" border="1" cellpadding="5" | {||class="wikitable" style="text-align:left" border="1" cellpadding="5" | ||
|reason||example||see scores with... | |reason||example||see scores with... | ||
|- | |- | ||
− | |the | + | |a modified version of the accession has been used||NP_0001 in place of NP 0001||M |
+ | |- | ||
+ | |an updated version of the accession has been used||xxx in place of xxx||U or E | ||
|} | |} | ||
− | === Column number: | + | |
+ | === Column number: 12 === | ||
{| | {| | ||
Line 349: | Line 354: | ||
|} | |} | ||
− | === Column number: | + | === Column number: 13 === |
{| | {| | ||
Line 364: | Line 369: | ||
See iRefIndex paper. PMID 18823568. | See iRefIndex paper. PMID 18823568. | ||
− | === Column number: | + | === Column number: 14 === |
{| | {| | ||
Line 380: | Line 385: | ||
See iRefIndex paper. PMID 18823568. | See iRefIndex paper. PMID 18823568. | ||
− | === Column number: | + | === Column number: 15 === |
{| | {| |
Revision as of 17:24, 12 December 2008
Last edited: October 13th, 2008
Applies to iRefIndex release: 1.1 beta.
Release date: July 11th, 2008
Authors: Ian Donaldson and Sabry Razick
Database: iRefIndex (http://irefindex.uio.no)
Organization: Biotechnology Centre of Oslo, University of Oslo (http://www.biotek.uio.no/)
Contents
- 1 Description
- 2 Directory contents
- 3 Changes from last version
- 4 Known Issues
- 5 License
- 6 Citation
- 7 Disclaimer
- 8 Understanding the Feedback file
- 9 Description of Feedback file
- 9.1 Column number: 1
- 9.2 Column number: 2
- 9.3 Column number: 3
- 9.4 Column number: 4
- 9.5 Column number: 5
- 9.6 Column number: 6
- 9.7 Column number: 7
- 9.8 Column number: 8
- 9.9 Column number: 9
- 9.10 Column number: 10
- 9.11 Column number: 11
- 9.12 Column number: 12
- 9.13 Column number: 13
- 9.14 Column number: 14
- 9.15 Column number: 15
- 10 Not found file
Description
This file describes the contents of the xxx/feedback directory and the format of the tab-delimited text files contained within.
Directory contents
README this file xxxx.feedback.y.y.txt.zip feedback for some database (xxxx) for release y.y
Changes from last version
None. First release of this file.
Known Issues
None.
License
This directory is private and only released to invited source databases. These data are released under the Creative Commons Attribution License http://creativecommons.org/licenses/by/2.5/. This means that you are free to use, modify and redistribute these data for personal or commercial use so long as you provide appropriate credit. See next section.
Copyright © 2008 Ian Donaldson
Citation
Razick, S., Magklaras, G., Donaldson, IM. iRefIndex: A consolidated protein interaction database with provenance. BMC Bioinformatics. 2008. 9(1):405. PMID 18823568.
Disclaimer
Data is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
Understanding the Feedback file
Insert explanatory text here
Score | Description of feature |
P | The interaction record\'s primary (P) reference for the protein was used to make the assignment. |
D | The source database (D) listed in the interaction record is different than what is expected for the given accession for the protein. In specific cases, this difference is tolerated and the assignment is made. |
T | The taxonomy (T) identifier for the protein (as supplied by the interaction record) differed from what was found in the protein sequence record. This discrepancy was tolerated and the assignment was made. |
M | The protein reference listed by the interaction record was a typographical modification (M) of a known accession. In specific cases, this variation is tolerated and the assignment is made. |
V | The protein reference listed by the interaction record contained version (V) information that was ignored. For example, RefSeq accession.version NP_012420.1 was listed but treated as RefSeq accession NP_012420. |
Q | The protein reference used to make the assignment was of the type \"see-also\". See PSI-MI Path: entrySet/entry/interactorList/interactor/xref/primaryRef/refType = \"see-also\". |
U | The protein reference listed in the interaction record and used to make the assignment was a secondary UniProt accession and was updated (U) to a primary UniProt accession in order to make the assignment. |
E | The protein reference was a retired NCBI Identifier. NCBI\'s eUtils (E) were used to retrieve the current accession and/or sequence. |
I | The protein reference used was an NCBI GenInfo Identifier (I). |
G | The interaction record\'s reference for the protein was an EntrezGene (G) identifier. The corresponding products of the gene were used to make the assignment. |
S | One of the interaction record\'s secondary (S) references for the protein was used to make the assignment. |
+ | More than one possible assignment is possible (+). This case may arise in one of three ways. 1) The reference supplied by the interaction record requires updating but more than one possibility exists. For example, Q7XJL8 was found to be a secondary accession in three separate UniProt records (Q3EBZ2, Q6DR20, and Q8GWA9). 2) The secondary references supplied by the interaction record point to more than one unique protein sequence. 3) An EntrezGene identifier is provided in the interaction record as a protein reference. This identifier points to more than one protein product. An attempt is made to resolve this ambiguity as indicated by ROG score features O, X or L (see below). |
O | More than one possible assignment is possible (see + above). The assignment chosen has a SEGUID that is identical to the SEGUID of the original (O) sequence provided in the interaction record. |
X | More than one possible assignment is possible (see + above). The assignment chosen has the same taxonomy (X) identifier as listed in the interaction record. |
L | More than one possible assignment is possible (see + above). The assignment with the largest (L) SEGUID is arbitrarily chosen (see Methods). |
N | The protein reference, taxonomy identifier and sequence for the protein as provided in the interaction record are used to make a new entry in the SEGUID table. The protein interactor is assigned the newly (N) generated ROG identifier. |
Description of Feedback file
Each line in this file represents
- a protein interactor found in some source db record and
- the iRefIndex mapping to a current protein sequence record.
Column number: 1
- I moved the db to first column and acc to the second column (sabry), I think it is better to tell where the interaction was located, all though the file name tells this. Further when there is no interactions identifier (-10) it does not look good.
Column name: | int_db |
Column type: | string |
Description: | name of interaction db |
Example: | intact |
Notes Possible values in this field are:
bind | biomolecular interaction network db |
biogrid | the biogrid db |
dip | db of interacting proteins |
hprd | human protein reference db |
intact | ebi interaction db |
mint | molecular interaction db |
mpact | mips yeast protein interaction db |
mppi | mips mammalian protein interaction db |
ophid | online predicted human interaction db |
Column number: 2
Column name: | int_acc |
Column type: | string |
Description: | accession for interaction record |
Example: | intact |
Notes
Column number: 3
- I recommend having this column , this would allow for the sources to locate there files, avoid questions like 'Where did you get this information from '
Column name: | source_file |
Column type: | string |
Description: | source file of interaction record |
Example: | pmid_2006_14691232.xml |
Notes
Column number: 4
Column name: | primary_db |
Column type: | string |
Description: | source db for accession listed in column 4 |
Example: | uniprotkb |
Notes
This is the primary protein sequence database referenced in the interaction record.
Column number: 5
Column name: | primary_acc |
Column type: | string |
Description: | An accession for a protein interactor in some database as supplied in the interaction record (see columns 1-2) |
Example: | Q9Y6Q9 |
Notes
Column number: 6
Column name: | primary_taxid |
Column type: | integer |
Description: | taxonomy of protein interactor as listed in the source interaction record |
Example: | 9606 |
Notes
Column number: 7
Column name: | used_db |
Column type: | string |
Description: | source db for accession listed in column 4 |
Example: | uniprotkb |
Notes
This is the primary protein sequence database referenced in the interaction record.
Column number: 8
Column name: | used_acc |
Column type: | string |
Description: | An accession for a protein interactor in some database as supplied in the interaction record (see columns 1-2) |
Example: | Q9Y6Q9 |
Notes
Column number: 9
Column name: | used_taxid |
Column type: | integer |
Description: | taxonomy of protein interactor as listed in the source interaction record |
Example: | 9606 |
Notes
Column number: 10
Column name: | mapped_db |
Column type: | string |
Description: | the source protein db that this interactor was mapped to by iRefIndex |
Example: | uniprot |
Notes
This will most likely be the same as the db listed in column 6 unless:
reason | example | see scores with... |
the db name is not valid or is a variation of a cv db | uniprot in place of "protein database" | D |
Column number: 11
Column name: | mapped_acc |
Column type: | string |
Description: | the accession that this interactor was mapped to by iRefIndex |
Example: | Q9Y6Q9 |
Notes This will most likely be the same accession as listed in column 4 unless:
reason | example | see scores with... |
a modified version of the accession has been used | NP_0001 in place of NP 0001 | M |
an updated version of the accession has been used | xxx in place of xxx | U or E |
Column number: 12
Column name: | mapped_taxid |
Column type: | integer |
Description: | Taxonomy identifier for interactor as found in source protein db for record specified in column 8 and 9. |
Example: | 9606 |
Notes
This will most likely be the same as the taxid listed in column 7 unless:
reason | example | see scores with... |
the listed taxid is different from that found in the mapped record | xxx in place of xxx | T |
Column number: 13
Column name: | rogid |
Column type: | string |
Description: | rogid of the interactor assigned by iRefIndex |
Example: | HWcRyNPgZ0dLD9cb5iuiarsGG8E9606 |
Notes See iRefIndex paper. PMID 18823568.
Column number: 14
Column name: | rogscore |
Column type: | string |
Description: | description of assignment score for this interactor |
Example: | PUTO+ |
Notes
See iRefIndex paper. PMID 18823568.
Column number: 15
Column name: | score_type |
Column type: | integer |
Description: | assignment score type 1-6 |
Example: | 1 |
Notes
See iRefIndex paper. PMID 18823568. Table 4 column 1
Not found file
Same format as the feed back file scores are all '-' (need to talk to Ian, Sabry)
Each row in this table represents an interactor reference which we were unable to map to a sequence.
Column number: 1
Column name: | int_acc |
Column type: | string |
Description: | accession for interaction record |
Example: | intact |
Notes
Column number: 2
Column name: | int_db |
Column type: | string |
Description: | name of interaction db |
Example: | intact |
Notes Possible values in this field are:
bind | biomolecular interaction network db |
biogrid | the biogrid db |
dip | db of interacting proteins |
hprd | human protein reference db |
intact | ebi interaction db |
mint | molecular interaction db |
mpact | mips yeast protein interaction db |
mppi | mips mammalian protein interaction db |
ophid | online predicted human interaction db |
Column number: 3
Column name: | unfound_acc |
Column type: | string |
Description: | An accession for a protein interactor in some database as supplied in the interaction record (see columns 1-2) |
Example: | Q9Y6Q9 |
Notes
Column number: 4
Column name: | unfound_db |
Column type: | string |
Description: | source db for accession listed in column 4 |
Example: | uniprotkb |
Notes
This is the primary protein sequence database referenced in the interaction record.
Column number: 5
Column name: | unfound_taxid |
Column type: | integer |
Description: | taxonomy of protein interactor as listed in the source interaction record |
Example: | 9606 |
Notes