Subcellular Anatomy Ontology (SAO)
The Subcellular Anatomy Ontology (SAO)
NEW! To view the CCDB / SAO Wiki, click here
NEW! To read a published article that describes the SAO from a neuroscience perspective, click here
Larson SD, Fong LL, Gupta A, Condit C, Bug WJ, Martone ME (2007) "A formal ontology of subcellular neuroanatomy", Front. Neuroinform. 1:3. doi:10.3389/neuro.11/003.2007
To view the SAO 1.2.5 on the NCBO Bioportal, click here.
To read a published article that describes SAO and its utilization, click here.
Fong LL, Larson SD, Gupta A, Condit C, Bug WJ, Chen L, West R, Lamont S, Terada M, Martone ME (2007) "An Ontology-Driven Knowledge Environment for Subcellular Neuroanatomy", OWL: Experiences and Directions, Innsbruck, Austria, CEUR Workshop Proceedings, ISSN 1613-0073, http://CEUR-WS.org/Vol-258/, June 6-7, 2007
To read a published article that uses SAO to perform rule-based reasoning, click here.
Larson SD, Martone ME (2007) "Rule-Based Reasoning with a Multi-Scale Neuroanatomical Ontology", OWL: Experiences and Directions, Innsbruck, Austria, CEUR Workshop Proceedings, ISSN 1613-0073, http://CEUR-WS.org/Vol-258/, June 6-7, 2007
Additional materials for the paper on rule-based reasoning are available here.
We have developed the Subcellular Anatomy Ontology (SAO) for the nervous system to provide a formal ontology to describe structures from the dimensional range known as the âmesoscale,â encompassing cellular and subcellular structure, supracellular domains, and macromolecules.
The development of ontologies for neuroscience data is a key objective of neuroinformatic. An ontology consists of a set of concepts, or entities, within a domain linked by relationships such as âis aâand âhas part,â e.g., â neuron is a cellâ and âcell has part plasma membrane.â Ontologies are highly valuable in that they provide a formalization of knowledge within a domain in a machine-readable form. Ontologies include a much wider scope of information than taxonomies, which are simply hierarchical representations of the concepts but lack formal descriptions of their properties and the types of relationships they have with one another. Ontologies have been used in many business and scientific environments to share, reuse, and process domain knowledge. A well constructed ontology supports the use of machine-based reasoning to derive new knowledge based on encoded relationships among individual data nodes.
Ontologies form one of the cornerstones of large scale data sharing projects such as the Cell Centered Database and Biomedical Informatics Research Network (BIRN) that are building infrastructure for amassing and integrating biological data across dimensional scales. Ontologies promote data integration both by providing a common terminology for data annotation and the means by which relationships among diverse data can be inferred (Gupta et al., 2001 link to PDF of paper by Gupta; should be on BIRN site).
As part of the Cell Centered Database project, we are developing informatics infrastructure for the dimensional range known as the âmesoscaleâ. This range roughly encompasses the structures that sit between gross morphology and molecular structure, e.g., cellular networks, cellular and subcellular microdomains along with their macromolecular constituents. These structures lie at the heart of information processing in the nervous system, providing the adaptive spatial framework in which processes giving rise to complex behaviors occur. The study of mesoscale structures continues to present a challenge to experimentalists, because their dimensions fall squarely between the capabilities of current imaging technologies. Investigations of physiology, structural dynamics, coarse molecular distributions and large-scale distributions of dendritic spines are typically accomplished by optical microscopies. Appreciation of the fine structural detail on internal structure, cytoskeletal organization, localization of molecular constituents, location of synaptic contacts, and detailed views of the immediate microdomain such as pre-synaptic boutons and glial processes require 3D electron microscopic imaging. To build a comprehensive understanding of the nervous system in this dimensional range requires the ability to aggregate data obtained by multiple researchers across techniques and spatial scales.
The Ontology for Subcellular Anatomy of the Nervous System (SAO) describes the parts of neurons and glia and how these parts come together to define supracellular structures such as synapses and neuropil (Fong et al., submitted). Molecular specializations of each compartment and cell type are identified. The SAO was designed with the goal of providing a means to annotate cellular and subcellular data obtained from light and electron microscopy, including assigning macromolecules to their approporiate subcellular domains. The SAO thus provides a bridge between ontologies that describe molecular species and those concerned with more gross anatomical scales. Because it is intended to integrate into ontological efforts at these other scales, particular care was taken to construct the ontology in a way that supports such integration.
Structure of the SAO
Cells are divided into regional parts, e.g., dendrite and axon, and component parts, e.g., mitochondrion, similar to the way the Foundational Model of Anatomy divides anatomical structures. Each of the parts of a cell can be further divided into regional parts and component parts, e.g., an axon can be divided into an initial segment and main axon. Each part of the cell is connected to its parent part through the relationships âcontinuous withâ. Thus a dendrite is continuous with the cell somata; the dendritic spine is continuous with the dendritic shaft.
Because the SAO is built on a model of the cell, both molecular constituents and anatomical location are assigned to the subparts of cells, rather than to the cell itself. The SAO utilizes the âlocated inâ relationship to situate cellular parts into higher order brain regions. The SAO was constructed using OWL (Web Ontology Language), a first order description logic that supports reasoning. We have constructed the SAO in a way that allows inferencing to be performed across scales so that molecules and higher order connectivity may be inferred from local interactions (Larson and Martone, submitted Ã link to PDF of rule based reasoning paper). Ideally, the observation that F-actin is located in the head of a dendritic spine from a Purkinje cell dendrite found in the molecular layer of the cerebellar cortex should allow for the following statements to be inferred.
The SAO directly builds upon several foundational ontologies recommended by the OBO Foundry Project, designed to promote the adoption of best practices in ontology construction to foster interoperability of ontologies with the broader biomedical community. At the most abstract level, SAO class structure follows the Basic Formal Ontology.
Cell: The SAO contains a list of cell types found in the nervous system. Top level cell classes such as neuron and glia were taken from the Cell Type Ontology. The SAO does not contain a comprehensive list of neuron types, because these entities fall under the scope of other ontologies. Rather, because the SAO is designed as an application ontology for annotation of biological data, the parent cell types are expected to be added to the SAO as they are encountered. The SAO lists neurons according to common names that reflect a mixture of classification criteria, e.g., morphology (``pyramidal neuron''), proper names (``Purkinje neuron''). The SAO utilizes these names merely as labels and does not further classify cell types into subtrees, except in instances where the hierarchy is fairly straightforward, e.g., layer 3 cortical pyramidal neuron is a cortical pyramidal neuron. We deliberately kept the cell classification flat because the SAO can be used to classify neurons along multiple dimensions through their specific properties, e.g., primary neurotransmitter, number of processes, anatomical location of cell parts.
Part of Cell: The SAO comprises two main classes of cell parts: regional part and component part. Regional part of cell is elaborated under the BFO concept Fiat Object Part defined as a "part of an object not demarcated by any physical discontinuities ." Regional parts of neurons include dendrites, axons, the cell soma and protrusions such as dendritic spines. Regional parts of glia include the cell soma and processes such as astrocytic endfeet and myelinating processes. Each of these regional parts may be further subdivided into finer parcellations. Component parts are considered to be BFO independent objects and represent the building blocks common to all cells, e.g., organelles. Components are largely adapted from the Gene Ontology cell component hierarchy and cross referenced to the GO ID where possible, with additional neuron-specific components added where necessary. Macromolecules may be considered to be component parts, but are listed under a separate class in the SAO. Just as with cell types, the SAO does not contain an exhaustive list of macromolecules, because these entities are covered in other resources. As with cell types, we intend to keep SAO application-driven, and as such as molecules are encountered in biological data, they may be added to the SAO.
Supracellular structures: The SAO also includes a class termed ÂŽsupracellular structureÂ¡. While somewhat paradoxical in light of use of ÂŽsubcellularÂ¡ in the name of the ontology, this class refers to multicellular domains defined by subcellular parts of neurons and glia such as neuropil, synapses, and the Node of Ranvier. The supracellular designation refers to the fact that the subcellular parts are derived from at least two different cells. To classify supracellular domains according to the BFO, we used both the aggregate object and site classes. We consider some supracellular domains as aggregate objects because they represent a somewhat ad hoc grouping of cell parts into a higher order structures. which we believe best fits the BFO:ObjectAggregate defintion: "a mereological sum of separate objects possessing non-connected boundaries.". For example, the neuropil is a term applied to regions of the nervous system characterized by a dense tangle of intertwined cell processes each of which have distinct non-connected boundaries. Other supracellular structures are better characterized as sites, because they are believed to be the locations at which a particular function occurs. For example, the synapse is the site at which neurotransmission occurs. The location of that function is inferred because of the presence of one of more molecules or cell components involved in these processes. For some aggregate structures, we create an aggregate object and then a site where the object is located, e.g., the chemical synapse may be considered an abstract aggregate entity that consists of a pre-synaptic part, a post-synaptic part and a junctional part. Each of these parts have cell components, e.g., synaptic vesicles, located within them that define the extents of these parts. The site of the synapse is that part of the cell or cell-cell apposition where the parts of the synapse are localized. In this way, we restrict the synaptic site to that portion of the cell part occupied by synaptic components which certainly fits the BFO:Site definition as an entity possessing "a characteristic spatial shape in relation to some arrangement of other (entities) and ... (which) can be occupied by other (entities)".
SAO Properties: Properties in the SAO are grouped into part of, morphological and spatial relationships. Regional parts are assigned to each cell class using restrictions, e.g., neurons may only have neuronal regional parts. Each regional part is assumed to belong to a parent cell; geometrical relationships among cell parts are specified by relationships such as continuous with, e.g., dendrites are continuous with the cell somata. Although some properties are assigned at the level of cell class, e.g., morphological type, most are assigned at the level of regional cell part. In this way, cell components and macromolecules are assigned to the particular part of the nerve cell in which they are found. Similarly, because nerve cells are large and may span many brain regions, the property has anatomical location, designed to situate the cell within a regional part of the nervous system, is assigned separately to each part of the cell. Regional parts of brain will be drawn from other resources, e.g., BAMS