Skip to content

Library Generation Schema

The Library Generation schema is designed to represent types and relationships of samples and digital data assets generated during processes that generate multimodal genomic data.

URI: https://identifiers.org/brain-bican/library-generation-schema

Name: library-generation-schema

Classes

Class Description
Annotation Biolink Model root class for entity annotations.
        QuantityValue A value of an attribute that is quantitative and measurable, expressed as a combination of a unit and a numeric value
ChemicalEntityOrGeneOrGeneProduct A union of chemical entities and children, and gene or gene product. This mixin is helpful to use when searching across chemical entities that must include genes and their children as chemical entities.
Entity Root Biolink Model class for all things and informational relationships, real or imagined.
        Checksum Checksum values associated with digital entities.
        DissectionRoiPolygon A polygon annotated on a brain slab image delineating a region of interest (ROI) for a tissue sample dissectioning.
        NamedThing a databased entity or concept/class
                Activity An activity is something that occurs over a period of time and acts upon or with entities; it may include consuming, processing, transforming, modifying, relocating, using, or generating entities.
                Attribute A property or characteristic of an entity. For example, an apple may have properties such as color, shape, age, crispiness. An environmental sample may have attributes such as depth, lat, long, material.
                BiologicalEntity None
                        Gene A region (or regions) that includes all of the sequence elements necessary to encode a functional transcript. A gene locus may include regulatory regions, transcribed regions and/or other functional sequence regions.
                        Genome A genome is the sum of genetic material within a cell or virion.
                InformationContentEntity a piece of information that typically describes some topic of discourse or is used as support.
                        Dataset an item that refers to a collection of data from a data source.
                                DigitalAsset None
                OrganismTaxon A classification of a set of organisms. Example instances: NCBITaxon:9606 (Homo sapiens), NCBITaxon:2 (Bacteria). Can also be used to represent strains or subspecies.
                PhysicalEntity An entity that has material reality (a.k.a. physical essence).
                        Donor A person or organism that is the source of a biological sample for scientific study. Many biological samples are generated from a single donor.
                        MaterialSample A sample is a limited quantity of something (e.g. an individual or set of individuals from a population, or a portion of a substance) to be used for testing, analysis, inspection, investigation, demonstration, or trial use. [SIO]
                                AmplifiedCdna A collection of cDNA molecules derived and amplified from an input barcoded cell sample. These cDNA molecules represent the gene expression of each cell, with all cDNA molecules from a given cell retaining that cell's unique barcode from the cell barcoding step. This is a necessary step for GEX methods but is not used for ATAC methods.
                                BarcodedCellSample A collection of molecularly barcoded cells. Input will be either dissociated cell sample or enriched cell sample. Cell barcodes are only guaranteed to be unique within this one collection. One dissociated cell sample or enriched cell sample can lead to multiple barcoded cell samples. The sequences of the molecular barcodes are revealed during alignment of the resulting fastq files for the barcoded cell sample. The barcoded cell sample name and the cell level molecular barcode together uniquely identify a single cell.
                                BrainSlab A thick flat piece of brain tissue obtained by slicing a whole brain, brain hemisphere or subdivision with a blade at regular interval. When multiple brain slabs are obtained from the slicing process, an ordinal is assigned to provide information about the relative positioning of the slabs.
                                DissociatedCellSample A collection of dissociated cells or nuclei derived from dissociation of a tissue sample.
                                EnrichedCellSample A collection of enriched cells or nuclei after enrichment process, usually via fluorescence-activated cell sorting (FACS) using the enrichment plan, is applied to dissociated cell sample.
                                Library A collection of fragmented and barcode-indexed DNA molecules for sequencing. An index or barcode is typically introduced to enable identification of library origin to allow libraries to be pooled together for sequencing.
                                LibraryAliquot One library in the library pool. Each library aliquot in a library pool will have a unique R1/R2 index to allow for sequencing together then separating the sequencing output by originating library aliquot through the process of demultiplexing. The resulting demultiplexed fastq files will include the library aliquot name. A given library may produce multiple library aliquots, which is done in the case of resequencing. Each library aliquot will produce a set of fastq files.
                                LibraryPool A library pool is made up of library aliquots from multiple libraries. Each library aliquot in a library pool will have a unique R1/R2 index to allow for sequencing together then separating the sequencing output by originating library aliquot through the process of demultiplexing.
                                TissueSample The final intact piece of tissue before cell or nuclei prep. This piece of tissue will be used in dissociation and has an region of interest polygon (ROI) associated with it.
                Procedure A series of actions conducted in a certain order or manner
                        CdnaAmplification The process of creating a collection of cDNA molecules derived and amplified from an input barcoded cell sample. A large amount of cDNA is needed to have accurate and reliable sequencing detection of gene expression. This process generates multiple copies of each mRNA transcript (expressed gene) within each cell while retaining the cell's unique barcode from the barcoding step. This is a necessary step for GEX methods but is not used for ATAC methods.
                        CellBarcoding The process of adding a molecular barcode to individual cells in a sample. The input will be either dissociated cell sample or enriched cell sample. Cell barcodes are only guaranteed to be unique within this one collection. One dissociated cell sample or enriched cell sample can lead to multiple barcoded cell samples.
                        CellDissociation The process of generating dissociated cells from an input tissue sample. This process could also introduce a tissue-source barcode (eg cell hashing), allowing mixing of cell dissociation samples at the cell barcoding step.
                        CellEnrichment The process of enriching a dissociated cell sample by including or excluding cells of different types based on an enrichment plan using techniques such as fluorescence-activated cell sorting (FACS). This process could also introduce a tissue-source barcode (eg cell hashing), allowing mixing of cell enriched samples at the cell barcoding step.
                        DissectionRoiDelineation The process of outlining a region of interest on a brain slab image to guide the dissection and generation of a tissue sample.
                        EnrichedCellSampleSplitting The process of splitting an enriched cell sample into several portions. Each portion may be used by the same or different groups for different scientific studies.
                        LibraryConstruction The process of constructing a library from input material (such as amplified cDNA or barcoded cell sample) derived from one or more cell samples. cDNA is fragmented into smaller pieces appropriate for sequencing and at the same time a library index barcode is incorporated to enable identification of library origin, allowing libraries to be pooled together for sequencing.
                        LibraryPooling The process of constructing of a libray pool by combining library aliquots from a set of input libraries. Each library aliquot in a library pool will have a unique R1/R2 index to allow for sequencing together then separating the sequencing output by originating library aliquot through the process of demultiplexing.
                        TissueDissection The process of dissecting a tissue sample from a brain slab guided by a dissection region of interest (ROI) delineation.
GenomicEntity None
MacromolecularMachineMixin A union of gene locus, gene product, and macromolecular complex. These are the basic units of function in a cell. They either carry out individual biological activities, or they encode molecules which do this.
        GeneOrGeneProduct A union of gene loci or gene products. Frequently an identifier for one will be used as proxy for another
OntologyClass a concept or class in an ontology, vocabulary or thesaurus. Note that nodes in a biolink compatible KG can be considered both instances of biolink classes, and OWL classes in their own right. In general you should not need to use this class directly. Instead, use the appropriate biolink class. For example, for the GO concept of endocytosis (GO:0006897), use bl:BiologicalProcess as the type.
        TaxonomicRank A descriptor for the rank within a taxonomic classification. Example instance: TAXRANK:0000017 (kingdom)
PhysicalEssenceOrOccurrent Either a physical or processual entity.
        Occurrent A processual entity.
                ActivityAndBehavior Activity or behavior of any independent integral living, organization or mechanical actor in the world
        PhysicalEssence Semantic mixin concept. Pertains to entities that have physical properties such as mass, volume, or charge.
ProvActivity An activity is something that occurs over a period of time and acts upon or with entities; it may include consuming, processing, transforming, modifying, relocating, using, or generating entities.
ProvEntity An entity is a physical, digital, conceptual, or other kind of thing with some fixed aspects; entities may be real or imaginary.
SubjectOfInvestigation An entity that has the role of being studied in an investigation, study, or experiment
ThingWithTaxon A mixin that can be used on any entity that can be taxonomically classified. This includes individual organisms; genes, their products and other molecular entities; body parts; biological processes

Slots

Slot Description
age_at_death_description Text description of the age of death following typical scientific convention ...
age_at_death_reference_point The reference point for an age interval; for example, birth or conception
age_at_death_unit The unit used for representing the donor age from the reference point
age_at_death_value The value representing the donor age from the reference point
annotates The brain slab that was annotated by the delineation process
attribute_name The human-readable 'attribute name' can be set to a string which reflects its...
average_size_bp Average size of the library in terms of base pairs
biological_sex Biological sex of donor at birth
category Name of the high level ontology class in which this entity is categorized
cell_prep_type The type of cell preparation
cell_source_oligo_name Name of cell source oligo used in cell plexing
checksum_algorithm The type of cryptographic hash function used to calculate the checksum value
concentration_nm Concentration of library in terms of nM (nMol/L)
content_url
creation_date date on which an entity was created
data_type The type of data in the file
deprecated A boolean flag indicating that an entity is no longer considered current or v...
description a human-readable description of an entity
digest Stores checksum information
dissection_was_guided_by The dissection ROI polygon that was used to guide the dissection
enrichment_population Actual percentage of cells as a result of using set of fluorescent marker lab...
format
full_name a long-form human readable name for a thing
has_attribute connects any entity to an attribute
has_attribute_type connects an attribute to a class that describes it
has_biological_sequence connects a genomic feature to its sequence
has_numeric_value connects a quantity value to a number
has_qualitative_value connects an attribute to a value
has_quantitative_value connects an attribute to a value
has_taxonomic_rank
has_unit connects a quantity value to a unit
histone_modification_marker Histone modification marker antibodies (eg H3K27ac, H3K27me3, H3K9me3) used i...
id A unique identifier for an entity
in_taxon connects an entity to its taxonomic classification
in_taxon_label The human readable scientific name for the taxon of the entity
input_quantity Number of enriched or dissociated cells/nuclei going into the barcoding proce...
input_quantity_ng Amount of cDNA going into library construction in nanograms
iri An IRI for an entity
license
local_tube_id Library Pool Tube local name
method Standardized nomenclature to describe the general barcoding method used
name A human-readable name for an attribute or entity
node_property A grouping for any property that holds between a node and a value
number_of_expected_cells Expected number of cells/nuclei of a barcoded_cell_sample that will be barcod...
pass_fail_result Pass or Fail result based on qualitative assessment of cDNA yield and size
pcr_cycles Number of PCR cycles used during cDNA amplification for this cDNA
percent_cdna_longer_than_400bp QC metric to measure mRNA degradation of cDNA
port_well Specific position of the loaded port of the 10x chip
process_date Date of cell dissociation process
provided_by The value in this node property represents the knowledge provider that create...
quantity_fmol Amount of library generated in terms of femtomoles
quantity_ng Amount of cDNA produced after cDNA amplification measured in nanograms
r1_r2_index Name of the pair of library indexes used for sequencing
related_to A relationship that is asserted between two named things
related_to_at_instance_level Represents a relationship held between two instances of a data classes
rights
set cDNA amplification set, containing multiple amplified_cDNA_names that were pr...
species Species of donor
structure Strucure of tissue sample
symbol Symbol for a particular thing
synonym Alternate human-readable names for a thing
type
used Usage is the beginning of utilizing an entity by an activity
value The checksum value obtained from a specific cryotographic hash function
version
was_derived_from A derivation is a transformation of an entity into another, an update of an e...
was_generated_by Generation is the completion of production of a new entity by an activity
was_guided_by The dissection ROI polygon which was used to guide the tissue dissection
xref A database cross reference or alternative identifier for a NamedThing or edge...

Enumerations

Enumeration Description
AgeAtDeathReferencePoint
AgeAtDeathUnit
AmplifiedCdnaRnaAmplificationPassFail
BarcodedCellSampleTechnique
DigestType
DissociatedCellSampleCellLabelBarcode
DissociatedCellSampleCellPrepType
LibraryPrepPassFail
LibraryR1R2Index
LibraryTechnique
Sex

Types

Type Description
BiologicalSequence
Boolean A binary (true or false) value
ChemicalFormulaValue A chemical formula
Curie a compact URI
Date a date (year, month and day) in an idealized calendar
DateOrDatetime Either a date or a datetime
Datetime The combination of a date and time
Decimal A real number with arbitrary precision that conforms to the xsd:decimal speci...
Double A real number that conforms to the xsd:double specification
Float A real number that conforms to the xsd:float specification
FrequencyValue
Integer An integer
IriType An IRI
Jsonpath A string encoding a JSON Path
Jsonpointer A string encoding a JSON Pointer
LabelType A string that provides a human-readable name for an entity
NarrativeText A string that provides a human-readable description of something
Ncname Prefix part of CURIE
Nodeidentifier A URI, CURIE or BNODE that represents a node in a model
Objectidentifier A URI or CURIE that represents an object in the model
PercentageFrequencyValue
PredicateType A CURIE from the biolink related_to hierarchy
Quotient
Sparqlpath A string encoding a SPARQL Property Path
String A character string
SymbolType
Time A time object represents a (local) time of day, independent of any particular...
TimeType
Unit
Uri a complete URI
Uriorcurie a URI or a CURIE

Subsets

Subset Description
Alignment A subset of slots/attributes that are required for alignment
Analysis A subset of slots/attributes that are required for analysis
Bican A subset of classes that are associated with BICAN
LibraryGeneration A subset of classes that are associated with library generation
ModelOrganismDatabase Subset that is relevant for a typical Model Organism Database (MOD)
ProcessingElements A subset of classes that are associated with processing
Samples Sample/biosample datamodel
SequencingElements A subset of classes that are associated with sequencing
Testing TBD
TissueSpecimen A subset of classes that are associated with tissue specimens
Tracking A subset of slots/attributes that are required for tracking
TranslatorMinimal Minimum subset of translator work