Stream: genomics
Topic: COVID19 viral sequences?
Bret H (Apr 10 2020 at 13:20):
Posted to Listserv but looking for input here
Bret H (Apr 10 2020 at 13:20):
To enable tracking of virus spread and potentially targeted therapies, sharing the genetic information of a positive COVID19 test is desirable. Right now, there is not great guidance on providing discrete genetic data for microorganisms. Given the urgency. Until there is more formal guidance, for the statement: "Virus observed in patient A using specimen X, the virus is SARS-CoV-2, and the genetic sequence as evidence of this is P." The following could be used with FHIR R4 and has the ability to be retrieved from a system with a FHIR query.
With a Lab observation of a positive COVID19 test, here are two options ObservationVariant or MolecularSequence. Use Observation.dervierdFrom
pointing to either ObservationVariant (http://hl7.org/fhir/uv/genomics-reporting/variant.html)
where reference, position, and change are used
CHANGE: component:ref-allele and component:alt-allele
REFERENCE: component:genomic-ref-seq (LONIC code 48013-7)
POSITION: component:coordinate-system, component:variant-length, and either component:exact-start-end or component:outer-start-end or component:inner-start-end
OR
pointing to MolecularSequence (https://www.hl7.org/fhir/molecularsequence.html)
where reference is used to be more expressive position, change can be used
SEQUENCING DATA: MolecularSequence.observedSeq allows for large chunks of sequence data (such as an entire viral genome) to be related
MINIMALLY for REFERENCE MolecularSequence.referenceSeq.referenceSeqId
Additionally: MolecularSequence.repository can be used to provide a link to a sequencing data file or repository.
A genomic reference for SARS-CoV-2 is code "NC_045512" with codesystem http://www.ncbi.nlm.nih.gov/refseq
*note a key is use a proper reference for SARS-CoV-2, the name of the coronavirus associated with COVID19.
I would recommend ObservationVariant for deviations from a reference virus genome. MolecularSequence when a large contiguous chunk of sequence data is being reported (as in the case with sequencing the entire viral genome).
To retrieve data, use a query on the parent Observation with an appropriate list of LOINC codes for COVID19 detection and followed the derviedFrom link to MolecularSequence or ObservationVaraint.
OR
use a query for Observations and MolecularSequence with a SARS-CoV-2 sequence reference value for the reference id )such as code "NC_045512" with codesystem "http://www.ncbi.nlm.nih.gov/refseq"). The field for reference is MolecularSequnece.referenceSeq.referenceSeqId OR Observation.component:genomic-ref-seq.code = LOINC code 48013-7 and observation.component:genomic-ref-seq.value.
Example ObservationVariant query
GET /Observation?subject:Patient=123&component-code-value-concept=http://loinc.org|48013-7$http://www.ncbi.nlm.nih.gov/refseq|NC_045512
Example MolecularSequence query
GET
/MolecularSequence:Pateint=123&MolecularSequence.referenceSeq.referenceSeqId=http://loinc.org|48013-7$http://www.ncbi.nlm.nih.gov/refseq|NC_045512
Bob Milius (Apr 10 2020 at 14:10):
I like MolecularSequence for describing a set of variants of one sequence. But EHRs don't support MS. Not sure if that's important. I can describe a set of variants in one resource instance. Otherwise I would have to have a separate Observation for each variant. So if my reference is the whole viral sequence, I can describe many variations in the the new sequence, e.g.,
{ "resourceType": "MolecularSequence", "type" : "dna", "coordinateSystem": 1, "referenceSeq": { "referenceSeqId": { "coding": [ { "system": "http://www.ncbi.nlm.nih.gov/refseq", "code": "NC_045512.2" } ]}}, "variant": [ { "start": 60, "end": 60, "observedAllele": "A", "referenceAllele": "T" }, { "start": 27421, "end": 27423, "observedAllele": "TAC", "referenceAllele": "ATA" }, ...etc... ] }
Patrick Werner (Apr 10 2020 at 18:15):
hmmmm. To me a positive covid test is an Observation. I didn't get the need to point to variant. If the virus was detected i maybe want to send the exact RNA pattern as a proof or for research. The actual Sequence is either a MolecularSequence or a Binary containing the sequence.
Patrick Werner (Apr 10 2020 at 18:15):
Just spotted the "from the virus genome", yes then Variant would be correct.
Patrick Werner (Apr 10 2020 at 18:17):
This reminds me that we still have to discuss the scope of MolecularSequence. I still don't like the redundancy to Variant. Maybe MolecularSequence could be scoped down, to actually represent a Sequence and using Variants to describe Variations to this Sequence.
Bob Milius (Apr 10 2020 at 18:20):
re MolecularSequence, I really like being able to list all the variants in a single resource, akin to a VCF file, which is what I can do in MS. Using Variant observation requires me to send a separate resource for each variant. This is overkill.
Jamie Jones (Apr 10 2020 at 18:21):
Sequence is currently potentially a 4th way for Obs-variant to communicate the definitional variant observed in a patient.
Patrick Werner (Apr 10 2020 at 18:22):
I get that point, but still think that it doesn't fit in the rest of our IG. For example: you can't point to this variants from Implications and other profiles.
Jamie Jones (Apr 10 2020 at 18:24):
I agree it has great potential use in being a more concise way to send multiple variants, especially if we give it some support. Also agree it doesn't match the rest of our approach very well
Patrick Werner (Apr 10 2020 at 18:24):
Also the example stated above replicates Variant usage with obs and ref allele + position + refSeq. It is the same, but instead of a Bundle it is a list stuck inside of MolSeq, not usable as individual Information objects. Somehow similar to contained resources.
Jamie Jones (Apr 10 2020 at 18:26):
it's closer to a VCF file. You could make multiple Observations based on it
Bret H (Apr 10 2020 at 20:57):
https://www.ncbi.nlm.nih.gov/bioproject/614995 is an example repository of genetic sequences. Right now the Utah Health Department is doing all our state's sequencing. They upload new virus samples as they go. And this is kinda cool as an end goal: https://nextstrain.org/ncov/global This works for treating things as isolates, but loses the clinical data associated with the patient - you don't know who or what test - which is what would come from a top level observation to is using derivedFrom to point to MolecularSequence or ObservationVariant.
Bret H (Apr 10 2020 at 20:59):
(deleted)
Jamie Jones (Apr 10 2020 at 20:59):
Any genomic elements in the covid IG others have been working on?
Bret H (Apr 10 2020 at 21:01):
I only know this one COVID19 IG from Logica https://covid-19-ig.logicahealth.org/?mc_cid=15d567de77&mc_eid=4a139423cc
Bret H (Apr 10 2020 at 21:02):
no advice yet in it
Jamie Jones (Apr 10 2020 at 21:02):
There's an audacious inquiry one somewhere too
Vassil Peytchev (Apr 10 2020 at 21:34):
Kevin Power (Apr 10 2020 at 21:47):
Probably would find it, but here is that IG:
http://build.fhir.org/ig/AudaciousInquiry/fhir-saner/branches/master/index.html
Bret H (Apr 11 2020 at 05:36):
@ Bob Milius suggested a variation on where to put variant which while different from how the current IG handles human sequences would limit the advice to using on MolecularSequence. Right now there is not a lot of sharing that includes clinical, patient level data, isolates of the virus are identified by the sequencing lab and the sequences posted to repositories. This leads me more towards MolecularSequence.
Bob Milius (Apr 11 2020 at 13:09):
I'm not sure what you mean by "limit the advice to using MolecularSequence." You can create an Observation with derivedFrom MolecularSequence and put the advice in that Obs, correct? And put both in a bundle.
Patrick Werner (Apr 13 2020 at 16:14):
Reporting the existence of a covid19 infection is handled by several other IGs atm (PROMs from Switzerland, Logica for Infection reporting and some more)
Patrick Werner (Apr 13 2020 at 16:15):
If a use-case needs the exchange of actual sequences, they could go with BAM files or VCF + refSequence.
Patrick Werner (Apr 13 2020 at 16:16):
If there is a need to express this in FHIR you could use MolecularSequence.
Patrick Werner (Apr 13 2020 at 16:18):
If there are implications which have to be reported, you would create these and point them to MolSeq via derivedFrom (because it is sliced open)
Patrick Werner (Apr 13 2020 at 16:18):
If you need to point to a specific variant, you create a Variant Obs, pointing to MolSeq, and point the Implication to the Variant Obs
Patrick Werner (Apr 13 2020 at 16:24):
I think this would be an important guidance in general.
Bret H (Apr 28 2020 at 15:20):
great! so the span of options provided at the start of this thread (along with query guidance provided) sounds acceptable. Shall I post to the more general Zulip thread? Do we need a committee vote on a guidance blurb? Or some other blessing from the committee?
Jamie Jones (Apr 28 2020 at 15:32):
I think we will need to tighten it up with some examples. Also, where would this guidance be going (in the case we get something meaningful and helpful to say)
Bret H (Apr 28 2020 at 15:38):
@Jamie Jones just posting in the more generalized Zuip chat...but you co-chairs have more outlets for publication of information such as our webpages on the HL7 site etc... : ^ ) .... I can cook-up an example reference COIVD19 genome MolSeq profile for reference by a 'basic' lab positive observation (i'd be borrowing from Logica's draft IG).
Bret H (May 05 2020 at 12:47):
LOGIC focused on probe detection. not sure if its too far from the case, but on Friday CDC issued statement on ramping up viral genome collection: https://www.cdc.gov/media/releases/2020/p0501-SARS-CoV-2-transmission-map.html
Swapna Abhyankar (May 05 2020 at 13:04):
We created a LOINC code (currently in prerelease) for SARS-CoV-2 whole genome sequence: SARS coronavirus 2 whole genome [Nucleotide sequence] in Isolate by Sequencing (https://loinc.org/94764-8/)
Bret H (May 05 2020 at 13:16):
FYI: please post on https://chat.fhir.org/#narrow/stream/226195-Covid-19-Response/topic/Viral.20whole.20genome.20sequencing from here on. Thanks.
Kevin Power (May 05 2020 at 13:21):
Not sure its necessary to move it, but since you did, sure :slight_smile:
Last updated: Apr 12 2022 at 19:14 UTC