Clinical Genomics Resource Incubator
0.1.0-ci-build - CI Build International flag

Clinical Genomics Resource Incubator, published by HL7 International / Clinical Genomics. This guide is not an authorized publication; it is the continuous build for version 0.1.0-ci-build built by the FHIR (HL7® FHIR® Standard) CI Build. This version is based on the current content of https://github.com/HL7/cg-incubator/ and changes regularly. See the Directory of published versions

: Genomic Study Data Format

Page standards status: Draft Maturity Level: 1

Raw json | Download

{
  "resourceType" : "CodeSystem",
  "id" : "genomicstudy-dataformat",
  "meta" : {
    "lastUpdated" : "2021-01-05T10:01:24.148+11:00",
    "profile" : [
      🔗 "http://hl7.org/fhir/StructureDefinition/shareablecodesystem"
    ]
  },
  "language" : "en",
  "text" : {
    "status" : "generated",
    "div" : "<div xmlns=\"http://www.w3.org/1999/xhtml\"><p class=\"res-header-id\"><b>Generated Narrative: CodeSystem genomicstudy-dataformat</b></p><a name=\"genomicstudy-dataformat\"> </a><a name=\"hcgenomicstudy-dataformat\"> </a><div style=\"display: inline-block; background-color: #d9e0e7; padding: 6px; margin: 4px; border: 1px solid #8da1b4; border-radius: 5px; line-height: 60%\"><p style=\"margin-bottom: 0px\">Last updated: 2021-01-05 10:01:24+1100</p><p style=\"margin-bottom: 0px\">Profile: <a href=\"http://hl7.org/fhir/R5/shareablecodesystem.html\">Shareable CodeSystem</a></p></div><p>This case-sensitive code system <code>http://hl7.org/fhir/uv/cg-incubator/CodeSystem/genomicstudy-dataformat</code> defines the following codes:</p><table class=\"codes\"><tr><td style=\"white-space:nowrap\"><b>Code</b></td><td><b>Display</b></td><td><b>Definition</b></td></tr><tr><td style=\"white-space:nowrap\">bam<a name=\"genomicstudy-dataformat-bam\"> </a></td><td>BAM</td><td>Binary Alignment/Map format for storing read alignments against reference sequences.</td></tr><tr><td style=\"white-space:nowrap\">bed<a name=\"genomicstudy-dataformat-bed\"> </a></td><td>BED</td><td>Browser Extensible Data format for representing genomic regions and associated annotations.</td></tr><tr><td style=\"white-space:nowrap\">bedpe<a name=\"genomicstudy-dataformat-bedpe\"> </a></td><td>BEDPE</td><td>Paired-End BED format for representing pairwise genomic interactions.</td></tr><tr><td style=\"white-space:nowrap\">bedgraph<a name=\"genomicstudy-dataformat-bedgraph\"> </a></td><td>BedGraph</td><td>BED Graph format for representing genomic signals as continuous-valued data.</td></tr><tr><td style=\"white-space:nowrap\">bigbed<a name=\"genomicstudy-dataformat-bigbed\"> </a></td><td>bigBed</td><td>Binary indexed BED format for efficiently storing large amounts of genomic region data.</td></tr><tr><td style=\"white-space:nowrap\">bigWig<a name=\"genomicstudy-dataformat-bigWig\"> </a></td><td>bigWig</td><td>Binary indexed Wig format for efficiently storing large amounts of continuous-valued genomic data.</td></tr><tr><td style=\"white-space:nowrap\">birdsuite-files<a name=\"genomicstudy-dataformat-birdsuite-files\"> </a></td><td>Birdsuite-Files</td><td>File format used by the Birdsuite suite of software for SNP genotyping and copy number analysis.</td></tr><tr><td style=\"white-space:nowrap\">broadpeak<a name=\"genomicstudy-dataformat-broadpeak\"> </a></td><td>broadPeak</td><td>BED format variant for representing broad peaks in ChIP-Seq data.</td></tr><tr><td style=\"white-space:nowrap\">cbs<a name=\"genomicstudy-dataformat-cbs\"> </a></td><td>CBS</td><td>Copy number data format output by Circular Binary Segmentation analysis.</td></tr><tr><td style=\"white-space:nowrap\">chemical-reactivity-probing-profiles<a name=\"genomicstudy-dataformat-chemical-reactivity-probing-profiles\"> </a></td><td>Chemical-Reactivity-Probing-Profiles</td><td>Profiles of chemical reactivity for RNA structure analysis.</td></tr><tr><td style=\"white-space:nowrap\">chrom-sizes<a name=\"genomicstudy-dataformat-chrom-sizes\"> </a></td><td>chrom-sizes</td><td>File listing chromosome names and their sizes.</td></tr><tr><td style=\"white-space:nowrap\">cn<a name=\"genomicstudy-dataformat-cn\"> </a></td><td>CN</td><td>Copy number data format.</td></tr><tr><td style=\"white-space:nowrap\">custom-file-formats<a name=\"genomicstudy-dataformat-custom-file-formats\"> </a></td><td>Custom-File-Formats</td><td>User-defined or proprietary file formats for genomic data.</td></tr><tr><td style=\"white-space:nowrap\">cytoband<a name=\"genomicstudy-dataformat-cytoband\"> </a></td><td>Cytoband</td><td>Chromosome cytogenetic band locations and characteristics.</td></tr><tr><td style=\"white-space:nowrap\">fasta<a name=\"genomicstudy-dataformat-fasta\"> </a></td><td>FASTA</td><td>Format for representing sequences of nucleic acids or proteins using single letter codes.</td></tr><tr><td style=\"white-space:nowrap\">gct<a name=\"genomicstudy-dataformat-gct\"> </a></td><td>GCT</td><td>Gene Cluster Text format for storing gene expression data.</td></tr><tr><td style=\"white-space:nowrap\">cram<a name=\"genomicstudy-dataformat-cram\"> </a></td><td>CRAM</td><td>Compressed Reference-Aligned Map format for storing read alignments more compactly than BAM.</td></tr><tr><td style=\"white-space:nowrap\">genepred<a name=\"genomicstudy-dataformat-genepred\"> </a></td><td>genePred</td><td>Format for storing gene predictions with exon and CDS information.</td></tr><tr><td style=\"white-space:nowrap\">gff-gtf<a name=\"genomicstudy-dataformat-gff-gtf\"> </a></td><td>GFF/GTF</td><td>General Feature Format / Gene Transfer Format for storing genomic features and annotations.</td></tr><tr><td style=\"white-space:nowrap\">gistic<a name=\"genomicstudy-dataformat-gistic\"> </a></td><td>GISTIC</td><td>Genomic Identification of Significant Targets in Cancer output format for copy number analysis.</td></tr><tr><td style=\"white-space:nowrap\">goby<a name=\"genomicstudy-dataformat-goby\"> </a></td><td>Goby</td><td>Compact file format for storing read alignments, variations, and base quality information.</td></tr><tr><td style=\"white-space:nowrap\">gwas<a name=\"genomicstudy-dataformat-gwas\"> </a></td><td>GWAS</td><td>Genome-Wide Association Study format for storing association results.</td></tr><tr><td style=\"white-space:nowrap\">igv<a name=\"genomicstudy-dataformat-igv\"> </a></td><td>IGV</td><td>Integrative Genomics Viewer session or display format.</td></tr><tr><td style=\"white-space:nowrap\">loh<a name=\"genomicstudy-dataformat-loh\"> </a></td><td>LOH</td><td>Loss of Heterozygosity data format.</td></tr><tr><td style=\"white-space:nowrap\">maf-multiple-alignment-format<a name=\"genomicstudy-dataformat-maf-multiple-alignment-format\"> </a></td><td>MAF-Multiple Alignment Format</td><td>Multiple Alignment Format for storing aligned sequences.</td></tr><tr><td style=\"white-space:nowrap\">maf-mutation-annotation-format<a name=\"genomicstudy-dataformat-maf-mutation-annotation-format\"> </a></td><td>MAF-Mutation-Annotation-Format</td><td>Mutation Annotation Format for storing somatic mutation data.</td></tr><tr><td style=\"white-space:nowrap\">merged-bam-file<a name=\"genomicstudy-dataformat-merged-bam-file\"> </a></td><td>Merged BAM File</td><td>BAM file containing read alignments from multiple samples or lanes merged together.</td></tr><tr><td style=\"white-space:nowrap\">mut<a name=\"genomicstudy-dataformat-mut\"> </a></td><td>MUT</td><td>Mutation data format.</td></tr><tr><td style=\"white-space:nowrap\">narrowpeak<a name=\"genomicstudy-dataformat-narrowpeak\"> </a></td><td>narrowPeak</td><td>BED format variant for representing narrow peaks in ChIP-Seq data.</td></tr><tr><td style=\"white-space:nowrap\">psl<a name=\"genomicstudy-dataformat-psl\"> </a></td><td>PSL</td><td>Pattern Space Layout format for storing sequence alignments.</td></tr><tr><td style=\"white-space:nowrap\">res<a name=\"genomicstudy-dataformat-res\"> </a></td><td>RES</td><td>Resolution data format.</td></tr><tr><td style=\"white-space:nowrap\">rna-secondary-structure-formats<a name=\"genomicstudy-dataformat-rna-secondary-structure-formats\"> </a></td><td>RNA-Secondary-Structure-Formats</td><td>Formats for representing RNA secondary structure information.</td></tr><tr><td style=\"white-space:nowrap\">sam<a name=\"genomicstudy-dataformat-sam\"> </a></td><td>SAM</td><td>Sequence Alignment/Map format for storing read alignments, the uncompressed version of BAM.</td></tr><tr><td style=\"white-space:nowrap\">sample-info-attributes-file<a name=\"genomicstudy-dataformat-sample-info-attributes-file\"> </a></td><td>Sample-Info-Attributes-file</td><td>File containing sample information and attributes.</td></tr><tr><td style=\"white-space:nowrap\">seg<a name=\"genomicstudy-dataformat-seg\"> </a></td><td>SEG</td><td>Segmented data format for storing copy number or other segmented genomic data.</td></tr><tr><td style=\"white-space:nowrap\">tdf<a name=\"genomicstudy-dataformat-tdf\"> </a></td><td>TDF</td><td>Tiled Data Format for efficient storage and display of large genomic datasets.</td></tr><tr><td style=\"white-space:nowrap\">track-line<a name=\"genomicstudy-dataformat-track-line\"> </a></td><td>Track Line</td><td>UCSC Genome Browser track line header defining display properties for genomic data.</td></tr><tr><td style=\"white-space:nowrap\">type-line<a name=\"genomicstudy-dataformat-type-line\"> </a></td><td>Type Line</td><td>Type line header for defining genomic data track properties.</td></tr><tr><td style=\"white-space:nowrap\">vcf<a name=\"genomicstudy-dataformat-vcf\"> </a></td><td>VCF</td><td>Variant Call Format for storing variant information including SNPs, indels, and structural variations.</td></tr><tr><td style=\"white-space:nowrap\">wig<a name=\"genomicstudy-dataformat-wig\"> </a></td><td>WIG</td><td>Wiggle Track format for storing continuous-valued genomic data.</td></tr></table></div>"
  },
  "extension" : [
    {
      "url" : "http://hl7.org/fhir/StructureDefinition/structuredefinition-wg",
      "valueCode" : "cg"
    },
    {
      "url" : "http://hl7.org/fhir/StructureDefinition/structuredefinition-standards-status",
      "valueCode" : "draft"
    },
    {
      "url" : "http://hl7.org/fhir/StructureDefinition/structuredefinition-fmm",
      "valueInteger" : 1
    }
  ],
  "url" : "http://hl7.org/fhir/uv/cg-incubator/CodeSystem/genomicstudy-dataformat",
  "identifier" : [
    {
      "system" : "urn:ietf:rfc:3986",
      "value" : "urn:oid:2.16.840.1.113883.4.642.4.1978"
    }
  ],
  "version" : "0.1.0-ci-build",
  "name" : "GenomicStudyDataFormat",
  "title" : "Genomic Study Data Format",
  "status" : "draft",
  "experimental" : true,
  "date" : "2022-08-17T14:49:24-05:00",
  "publisher" : "HL7 International / Clinical Genomics",
  "contact" : [
    {
      "name" : "HL7 International / Clinical Genomics",
      "telecom" : [
        {
          "system" : "url",
          "value" : "http://www.hl7.org/Special/committees/clingenomics"
        },
        {
          "system" : "email",
          "value" : "clingenomics@lists.hl7.org"
        }
      ]
    }
  ],
  "description" : "The data format relevant to genomics. These formats and relevant codes were pulled from [Integrative Genomics Viewer Documentation](https://software.broadinstitute.org/software/igv/FileFormats) by Broad Institute.",
  "jurisdiction" : [
    {
      "coding" : [
        {
          "system" : "http://unstats.un.org/unsd/methods/m49/m49.htm",
          "code" : "001",
          "display" : "World"
        }
      ]
    }
  ],
  "caseSensitive" : true,
  "valueSet" : "http://hl7.org/fhir/uv/cg-incubator/ValueSet/genomicstudy-dataformat",
  "content" : "complete",
  "concept" : [
    {
      "code" : "bam",
      "display" : "BAM",
      "definition" : "Binary Alignment/Map format for storing read alignments against reference sequences."
    },
    {
      "code" : "bed",
      "display" : "BED",
      "definition" : "Browser Extensible Data format for representing genomic regions and associated annotations."
    },
    {
      "code" : "bedpe",
      "display" : "BEDPE",
      "definition" : "Paired-End BED format for representing pairwise genomic interactions."
    },
    {
      "code" : "bedgraph",
      "display" : "BedGraph",
      "definition" : "BED Graph format for representing genomic signals as continuous-valued data."
    },
    {
      "code" : "bigbed",
      "display" : "bigBed",
      "definition" : "Binary indexed BED format for efficiently storing large amounts of genomic region data."
    },
    {
      "code" : "bigWig",
      "display" : "bigWig",
      "definition" : "Binary indexed Wig format for efficiently storing large amounts of continuous-valued genomic data."
    },
    {
      "code" : "birdsuite-files",
      "display" : "Birdsuite-Files",
      "definition" : "File format used by the Birdsuite suite of software for SNP genotyping and copy number analysis."
    },
    {
      "code" : "broadpeak",
      "display" : "broadPeak",
      "definition" : "BED format variant for representing broad peaks in ChIP-Seq data."
    },
    {
      "code" : "cbs",
      "display" : "CBS",
      "definition" : "Copy number data format output by Circular Binary Segmentation analysis."
    },
    {
      "code" : "chemical-reactivity-probing-profiles",
      "display" : "Chemical-Reactivity-Probing-Profiles",
      "definition" : "Profiles of chemical reactivity for RNA structure analysis."
    },
    {
      "code" : "chrom-sizes",
      "display" : "chrom-sizes",
      "definition" : "File listing chromosome names and their sizes."
    },
    {
      "code" : "cn",
      "display" : "CN",
      "definition" : "Copy number data format."
    },
    {
      "code" : "custom-file-formats",
      "display" : "Custom-File-Formats",
      "definition" : "User-defined or proprietary file formats for genomic data."
    },
    {
      "code" : "cytoband",
      "display" : "Cytoband",
      "definition" : "Chromosome cytogenetic band locations and characteristics."
    },
    {
      "code" : "fasta",
      "display" : "FASTA",
      "definition" : "Format for representing sequences of nucleic acids or proteins using single letter codes."
    },
    {
      "code" : "gct",
      "display" : "GCT",
      "definition" : "Gene Cluster Text format for storing gene expression data."
    },
    {
      "code" : "cram",
      "display" : "CRAM",
      "definition" : "Compressed Reference-Aligned Map format for storing read alignments more compactly than BAM."
    },
    {
      "code" : "genepred",
      "display" : "genePred",
      "definition" : "Format for storing gene predictions with exon and CDS information."
    },
    {
      "code" : "gff-gtf",
      "display" : "GFF/GTF",
      "definition" : "General Feature Format / Gene Transfer Format for storing genomic features and annotations."
    },
    {
      "code" : "gistic",
      "display" : "GISTIC",
      "definition" : "Genomic Identification of Significant Targets in Cancer output format for copy number analysis."
    },
    {
      "code" : "goby",
      "display" : "Goby",
      "definition" : "Compact file format for storing read alignments, variations, and base quality information."
    },
    {
      "code" : "gwas",
      "display" : "GWAS",
      "definition" : "Genome-Wide Association Study format for storing association results."
    },
    {
      "code" : "igv",
      "display" : "IGV",
      "definition" : "Integrative Genomics Viewer session or display format."
    },
    {
      "code" : "loh",
      "display" : "LOH",
      "definition" : "Loss of Heterozygosity data format."
    },
    {
      "code" : "maf-multiple-alignment-format",
      "display" : "MAF-Multiple Alignment Format",
      "definition" : "Multiple Alignment Format for storing aligned sequences."
    },
    {
      "code" : "maf-mutation-annotation-format",
      "display" : "MAF-Mutation-Annotation-Format",
      "definition" : "Mutation Annotation Format for storing somatic mutation data."
    },
    {
      "code" : "merged-bam-file",
      "display" : "Merged BAM File",
      "definition" : "BAM file containing read alignments from multiple samples or lanes merged together."
    },
    {
      "code" : "mut",
      "display" : "MUT",
      "definition" : "Mutation data format."
    },
    {
      "code" : "narrowpeak",
      "display" : "narrowPeak",
      "definition" : "BED format variant for representing narrow peaks in ChIP-Seq data."
    },
    {
      "code" : "psl",
      "display" : "PSL",
      "definition" : "Pattern Space Layout format for storing sequence alignments."
    },
    {
      "code" : "res",
      "display" : "RES",
      "definition" : "Resolution data format."
    },
    {
      "code" : "rna-secondary-structure-formats",
      "display" : "RNA-Secondary-Structure-Formats",
      "definition" : "Formats for representing RNA secondary structure information."
    },
    {
      "code" : "sam",
      "display" : "SAM",
      "definition" : "Sequence Alignment/Map format for storing read alignments, the uncompressed version of BAM."
    },
    {
      "code" : "sample-info-attributes-file",
      "display" : "Sample-Info-Attributes-file",
      "definition" : "File containing sample information and attributes."
    },
    {
      "code" : "seg",
      "display" : "SEG",
      "definition" : "Segmented data format for storing copy number or other segmented genomic data."
    },
    {
      "code" : "tdf",
      "display" : "TDF",
      "definition" : "Tiled Data Format for efficient storage and display of large genomic datasets."
    },
    {
      "code" : "track-line",
      "display" : "Track Line",
      "definition" : "UCSC Genome Browser track line header defining display properties for genomic data."
    },
    {
      "code" : "type-line",
      "display" : "Type Line",
      "definition" : "Type line header for defining genomic data track properties."
    },
    {
      "code" : "vcf",
      "display" : "VCF",
      "definition" : "Variant Call Format for storing variant information including SNPs, indels, and structural variations."
    },
    {
      "code" : "wig",
      "display" : "WIG",
      "definition" : "Wiggle Track format for storing continuous-valued genomic data."
    }
  ]
}