Bioscape Workflow
From irefindex
The preparation of a working Bioscape system involves a number of activities in a workflow or schedule. These activities are performed in the following general order (with annotations referring to functions in the bsindex_quickstart.py script, found in the scripts directory of the bsindex distribution):
- Initialise basic resources (quickstart, init_database).
- Import essential data and initialise data sources (update_sources, update_source).
- Update derived information such as lexicon tables, scores for essential data (update_derived_sources).
- Import textual data and initialise textual data sources (update_text_source).
- Update text search results and related information such as result scores (update_text).
- Initialise the Web database in order to present a coherent view of the system (init_web_database).
The bioscape/sql directory (in the bsadmin distribution) provides a reasonable overview of the different activities, containing activity-specific directories which each contain templates for manipulating the database. The activities involved include the following:
- Basic resource initialisation: dictionaries, score
- Data source initialisation: chebi, gene, go, taxonomy
- Derived data preparation: bioentities, searchscore, termscore
- Textual data source initialisation: text
- Search results and related data preparation: search, sentencescore, results, resultscore, evidence, evidencescore
- Web data preparation: web-bioentities, web-index, web-search, web-results, web-evidence, web-resultscore