- To write a wrapper for new data source
- To gain a clear understanding of how mediated schemas are developed and how BioMediator translates between user queries, the BioMediator ontology and data source schemas.
The process of adding a new datasource to Bio-Mediator involves the following steps:
- Identify the datasource and figure out how to query via URL
- Write a wrapper that queries the database via URL and returns valid XML that can be consumed by the metawrapper
- Create relationships in the Source Knowledge Base
- Write directives that map XML returned by the wrappers to the XML of the mediated schema
During the course of this Independent Study, I wrote two wrappers to query the InterPro protein database. The first wrapper queries by InterPro's internal ID or accession number, while the second wrapper queries by a genetic sequence. The results are returned in XML, which is then passed to the meta-wrappers.
After completing the wrappers, the Bio-Mediator team made relevant entity and relationship connections in the Protege Source Knowledge Base.
These metawrappers use "directives" to translate from the source-specific XML to the XML of Bio-Mediator's mediated schema. Once the results have been translated into Bio-Mediator XML, the data is then displayed.
|