Displaying Bioscape Search Results

From irefindex
Revision as of 16:56, 26 January 2010 by PaulBoddie (talk | contribs) (Initial notes.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Although it might be acceptable for some applications to merely list the supposed presence of bioentity mentions in a given document, potentially with information about the location of such mentions, it is much more convenient to produce a display which integrates such results with the original text in which they have been found. To achieve this, the Bioscape Web application employs a number of techniques:

  1. The selection of results for a particular region of text, typically a sentence.
  2. The traversal of such results in order, assigning each result to one or more fragments of the text.
  3. The production of XML output which can then be transformed into a format readable by Web browsers.

Consider this example region of text:

"single-stranded DNA-binding protein"

Now consider the following occurrences of bioentities corresponding to regions of this text:

  • A - single-stranded DNA-binding
  • B - binding
  • C - DNA-binding protein

Upon processing these two input, the fragments should be defined as follows:

Fragments single-stranded DNA- binding protein
Occurrences A A, C A, B, C C