FHIR Chat · transcript and amino acid variants · genomics

Stream: genomics

Topic: transcript and amino acid variants


view this post on Zulip Larry Babb (Sep 17 2019 at 12:37):

The ObsVariant profile provides a lot of levers to share different kinds of variation SNPs, InDels, CNVs, etc...
It appears as if a subset of the component fields aim to provide a computationally reliable representation of the genomic-level sequence, position, ref and alt states of the variant. As most folks know this set of fields is critical at accurately representing the genomic data that defines the variant change and can provide valuable computational utility of the information in downstream processes.

However, it does not appear as if the same component structure is available for a transcript only or amino acid only representation. If the ObsVariant is only meant to accurately reflect genomic sequence observations then this might make sense, but if the ObsVariant is meant to be able to pass just about any type of sequence variation regardless of the type of sequence molecule then you may want to consider how to handle the components that would correspond to attributes like transcript start-end, transcript ref, transcript alt, amino acid start-end, amino acid ref, amino acid alt.

This is one of the many forthcoming challenges with using ObsVariant to share definitional data accurately and precisely in a standard form within HL7's FHIR resources.

Suggestion: If there was a set of classes or data types to define a sequence allele with precise and/or imprecise end points such that the underlying sequence could be representative of any type of molecule that contained a sequence of residues, then this might provide a foundational structure for building the accurate genomic representations that are targeted in testing, linked to observed findings and linked to variant evidence and knowledge that is inherently represented in reporting and the upstream processes used to fulfill clinical assessments used by clinical reporting.

Having these critical definitional structures embedded in these higher-order observations that add additional context and transform the definitional representation into case-specific or instantiated forms of variants makes it virtually impossible to reasonable share the breadth of data structures and associated evidence/knowledge needed to share computationally useful genetic results.

view this post on Zulip Bret H (Sep 17 2019 at 13:42):

who actually sequences amino acids currently? is there really mass spec data being passed around? Consider the comments in ObsVaraint transcript/aa component.

view this post on Zulip Larry Babb (Sep 17 2019 at 14:07):

In the VICC they often get folks reporting amino acid forms of variation, on which they attach evidence, annotations and knowledge for sharing with labs that may or may not use it to report on results and findings for somatic testing.

My understanding is that while they may not actually sequence amino acids, the data and knowledge in these variant level databases may only be reported based on the amino acid sequence. If so, then wouldn't HL7 need to provide a means for representing these forms of variation?

Some reporting labs may choose to only report back the amino acid variation form. Without the basis of the amino acid sequence and its start/stop positions, one would have to rely solely on the p. HGVS expression (I presume). How would downstream systems and clinical research efforts ever reliably find and understand what that data is without a structured computational form that is consistent and alignable to some reference?


Last updated: Apr 12 2022 at 19:14 UTC