Difference between revisions of "iRefIndex Testing 7.0"

From irefindex
Line 3: Line 3:
 
==Cross check with output of element counter==
 
==Cross check with output of element counter==
 
=== Program to use : biotek.uio.no.XML.Element_Counter (SaxValidator package)===
 
=== Program to use : biotek.uio.no.XML.Element_Counter (SaxValidator package)===
* For each interaction source  </interactor> count should match the UID count  
+
* For each interaction source  </interactor> count should match the UID count int_object (select (select name from int_db where int_db.id=source) as intSource, count(uid) from int_object group by source; ).
int_object (select (select name from int_db where int_db.id=source) as
+
* For each interaction source  </interactor>  count should match the UID count int_source (select (select name from int_db where int_db.id=source) as intSource, count(uid) from int_source group by source;).
intSource, count(uid) from int_object group by source; ).
+
* When </interactor> is not usable to count distinct objects (when this occurs as part of interaction and repeated in interactorList) some other suitable element has to be used (e.g </participant>)
* For each interaction source  </interactor>  count should match the UID count  
+
* Why count the closing elements in the above cases (e.g. </interactor> , instead of <interaction> or </interaction ). The reason is interaction elements may have attributes  and elements starting with interaction may be ambiguous. This program uses text matching (to be independent of any XML parsing).
int_source (select (select name from int_db where int_db.id=source) as
+
 
intSource, count(uid) from int_source group by source;).
+
 
* When </interactor> is not usable to count distinct objects (when this occurs
+
==Check SEGUID. Check one record each to very the process worked==
as part of interaction and repeated in interactorList) some other suitable
+
===Test SEGUID updating process===
element has to be used (e.g </participant>)
+
*SQL query = select orid, count(distinct rog) as rog_C from seguid where orid<0 group by orid;
* Why count the closing elements in the above cases (e.g. </interactor> ,
+
{| {{table}}
instead of <interaction> or </interaction ). The reason is interaction elements
+
| align="center" style="background:#f0f0f0;"|''' orid'''
may have attributes  and elements starting with interaction may be ambiguous.
+
| align="center" style="background:#f0f0f0;"|'''Record_count'''
This program uses text matching (to be independent of any XML parsing).
+
|-
 +
| -30||16983
 +
|-
 +
| -26||2
 +
|-
 +
| -24||78
 +
|-
 +
| -23||14
 +
|-
 +
| -22||1043258
 +
|-
 +
| -21||669761
 +
|-
 +
| -12||2679
 +
|-
 +
| -11||1665
 +
|-
 +
| -8||6525
 +
|-
 +
| -7||6547
 +
|-
 +
| -6||5235
 +
|-
 +
| -5||50305
 +
|-
 +
| -3||10853842
 +
|-
 +
| -2||11972291
 +
|-
 +
|
 +
|}
 +
 
 +
*All entries with orid<0 are altered during update. All interies with orid>=0 are original entries from seguid annotation file.
 +
{| {{table}}
 +
| align="center" style="background:#f0f0f0;"|'''ORID'''
 +
| align="center" style="background:#f0f0f0;"|'''Description'''
 +
|-
 +
| -30 || This is a iRefIndex Complex (RIGID used as ROGID), included in a previous process
 +
|-
 +
| -26 ||Is a OLN dead yeast_acc mapped using UniProt cross reference
 +
|-
 +
| -25 ||Is a SGD acc dead yeast_acc mapped using UniProt cross reference
 +
|-
 +
| -24 ||Is a dead fly_acc mapped using UniProt cross reference
 +
|-
 +
| -23||  Is a dead PDB
 +
|-
 +
| -22 ||Is a dead RefSeq
 +
|-
 +
| -21 || Is a dead UniProtKB
 +
|-
 +
| -12 || Added to SEGUID from original sequence record (N-Scores) in a previous process
 +
|-
 +
| -11 || Added to SEGUID using Eutils in a previous process
 +
|-
 +
| -8 ||Is a live OLN acc yeast_acc mapped using UniProt cross reference
 +
|-
 +
|  -7 ||Is a live SGD acc yeast_acc mapped using UniProt cross reference
 +
|-
 +
|  -6 || Is a live fly_acc mapped using UniProt cross reference
 +
|-
 +
| -5 || Is a alive PDB
 +
|-
 +
|  -3 || Is a alive RefSeq
 +
|-
 +
|  -2 || Is a alive UniProtKB
 +
|-
 +
|
 +
|}
  
 
Follow this link for a listing of all iRefIndex related pages (archived and current).
 
Follow this link for a listing of all iRefIndex related pages (archived and current).
 
[[Category:iRefIndex]]
 
[[Category:iRefIndex]]

Revision as of 09:27, 28 November 2010

The testing procedure for iRefIndex

Cross check with output of element counter

Program to use : biotek.uio.no.XML.Element_Counter (SaxValidator package)

  • For each interaction source </interactor> count should match the UID count int_object (select (select name from int_db where int_db.id=source) as intSource, count(uid) from int_object group by source; ).
  • For each interaction source </interactor> count should match the UID count int_source (select (select name from int_db where int_db.id=source) as intSource, count(uid) from int_source group by source;).
  • When </interactor> is not usable to count distinct objects (when this occurs as part of interaction and repeated in interactorList) some other suitable element has to be used (e.g </participant>)
  • Why count the closing elements in the above cases (e.g. </interactor> , instead of <interaction> or </interaction ). The reason is interaction elements may have attributes and elements starting with interaction may be ambiguous. This program uses text matching (to be independent of any XML parsing).


Check SEGUID. Check one record each to very the process worked

Test SEGUID updating process

*SQL query = select orid, count(distinct rog) as rog_C from seguid where orid<0 group by orid;
orid Record_count
-30 16983
-26 2
-24 78
-23 14
-22 1043258
-21 669761
-12 2679
-11 1665
-8 6525
-7 6547
-6 5235
-5 50305
-3 10853842
-2 11972291
  • All entries with orid<0 are altered during update. All interies with orid>=0 are original entries from seguid annotation file.
ORID Description
-30 This is a iRefIndex Complex (RIGID used as ROGID), included in a previous process
-26 Is a OLN dead yeast_acc mapped using UniProt cross reference
-25 Is a SGD acc dead yeast_acc mapped using UniProt cross reference
-24 Is a dead fly_acc mapped using UniProt cross reference
-23 Is a dead PDB
-22 Is a dead RefSeq
-21 Is a dead UniProtKB
-12 Added to SEGUID from original sequence record (N-Scores) in a previous process
-11 Added to SEGUID using Eutils in a previous process
-8 Is a live OLN acc yeast_acc mapped using UniProt cross reference
-7 Is a live SGD acc yeast_acc mapped using UniProt cross reference
-6 Is a live fly_acc mapped using UniProt cross reference
-5 Is a alive PDB
-3 Is a alive RefSeq
-2 Is a alive UniProtKB

Follow this link for a listing of all iRefIndex related pages (archived and current).