FHIR Chat · genotype and haplotype · genomics

Stream: genomics

Topic: genotype and haplotype


view this post on Zulip Bob Milius (Mar 28 2018 at 21:25):

In the CG IG, figure 4 - genetic findings, there's a Genotype 844413-4 and a Haplotype 844414-2 observation. Are these observation.codes or observation.component.codes? I'm not sure how to say an observation is a genotype or a haplotype.

view this post on Zulip Kevin Power (Mar 28 2018 at 21:36):

Those would be used as Observation.code. You would use Observation.valueString to represent the value of the genotype/haplotype, and then use the components listed in the 'Computable Genetic Finding' profile (gene, chromosome, dna region) as Observation.components.code values on the same observation.

view this post on Zulip Kevin Power (Mar 28 2018 at 21:39):

In general, the LOINC mentioned in the diagrams in the title would be Observation.code. A better option is to review the profiles themselves for guidance like this.

view this post on Zulip Kevin Power (Mar 28 2018 at 21:41):

And now that you made me look at this, I notice a difference. For Haplotype and Genotype, the diagram says to use valueString, but the profiles say to use valueCodeableConcept. @Lloyd McKenzie ?

view this post on Zulip Kevin Power (Mar 28 2018 at 21:45):

Also @Lloyd McKenzie - I think we noticed this before, but I see that 'obs-base' and 'obs-comp-gen-finding' and 'obs-genotype' (and likely others) show up on the profile pages with the display name of "CG Observations"? Not sure where that is pulled from in the spreadsheets, but might be good to change if we can?

view this post on Zulip Kevin Power (Mar 28 2018 at 21:52):

It gets pretty confusing because on the profiles (like obs-genotype), it shows as the structure being derived from "CG Observations", but then looking at the URL shows its (correctly) pointing at obs-comp-gen-finding .

It also then shows "CG Observations" multiple times as valid types for the .derivedFrom reference to other resources as well.

view this post on Zulip Bob Milius (Mar 28 2018 at 21:54):

I click on the Differential tab. There, if I hover over the different CG Observations, it shows they are pointing to different structure definitions

view this post on Zulip Kevin Power (Mar 28 2018 at 21:59):

Yup, the actual links are correct. Its the display that is confusing because they all say the same thing. Am hoping that is easy to fix, but will likely to have let Lloyd weigh in.

view this post on Zulip Lloyd McKenzie (Mar 28 2018 at 22:13):

valueCodeableConcept. Diagrams will be fixed on my next commit

view this post on Zulip Lloyd McKenzie (Mar 28 2018 at 22:13):

@Kevin Power Can you add a todo on the spreadsheet on "CG Observations"?

view this post on Zulip Kevin Power (Mar 28 2018 at 22:21):

Done

view this post on Zulip Kevin Power (Mar 28 2018 at 22:22):

And, shout outs to @Bret H and @ Bob Milius for loading up our issue list today :slight_smile:

view this post on Zulip Bob Dolin (Oct 30 2021 at 17:48):

@Kevin Power Do you know if the intent of FHIR-14315 and FHIR-14316 were to include location coordinates for haplotypes and genotypes? Reason I ask is that I was trying to represent an intergenic haplotype, and I'm not sure I can do it without a refSeq and locations (https://pubmed.ncbi.nlm.nih.gov/23959643/).

view this post on Zulip Kevin Power (Oct 30 2021 at 19:21):

@Bob Dolin - TBH, I am not sure. For the most part, we have been using Haplotype/Genotype more as a container to ultimately represent a list of Variants, which is where you would find the refSeq and locations. However, perhaps that is not always appropriate.

I can't get to the full article. Do you mind presenting more detail on how you think we should represent haplotypes and genotypes in this use case? Are you thinking about adding some combination of genomic-ref-seq, coordinate-system, chromosome-identifer, exact-start-end, inner-start-end, outer-start-end that are on Variant to Haplotype and Genotype?

And sorry if this is a dumb question, but could you represent the intergenic variant(s) with the Variant profile, then group them with Haplotype and/or Genotype?

Once again, I think you are on the edges or even a bit outside of what we were intending to support :smile:

view this post on Zulip Bob Dolin (Oct 30 2021 at 19:36):

Thanks @Kevin Power . To be honest, I'm still thinking it through as well. (a preprint of article is here: https://digitalcommons.wayne.edu/cgi/viewcontent.cgi?article=1024&context=humbiol_preprints)

In this case, there are a number of haplotypes defined over an intergenic region between EDA2R and AR genes. The haplotype is defined based on ~100 SNPs. It might be impractical to list each SNP as a variant if, for instance, I only want to communicate that patient has the 'EDA2R/AR alpha haplotype'. But I'd also like to include something such as a location that will allow someone to search and find this haplotype if they need to.

I was thinking of perhaps moving the coordinate-system, genomic-ref-seq, and exact-start-end components from Variant up into Genomic Finding might be a good start.

view this post on Zulip Kevin Power (Oct 30 2021 at 19:44):

We would certainly have to look at everything else that is based on Finding.

view this post on Zulip Bob Dolin (Oct 30 2021 at 19:46):

It gets worse - the current genes component is 0..*, and in the draft GA4GH model, a genotype can include multiple refSeqs. Sounds like possibly a topic for the next round...

view this post on Zulip Kevin Power (Oct 30 2021 at 20:32):

This is the fixing I hope comes out of integrating @Bob Freimuth’s IM work into the IG

view this post on Zulip Bret H (Nov 01 2021 at 15:26):

Bob Dolin said:

Thanks Kevin Power . To be honest, I'm still thinking it through as well. (a preprint of article is here: https://digitalcommons.wayne.edu/cgi/viewcontent.cgi?article=1024&context=humbiol_preprints)

In this case, there are a number of haplotypes defined over an intergenic region between EDA2R and AR genes. The haplotype is defined based on ~100 SNPs. It might be impractical to list each SNP as a variant if, for instance, I only want to communicate that patient has the 'EDA2R/AR alpha haplotype'. But I'd also like to include something such as a location that will allow someone to search and find this haplotype if they need to.

I was thinking of perhaps moving the coordinate-system, genomic-ref-seq, and exact-start-end components from Variant up into Genomic Finding might be a good start.

Here you can send the genotype with references back to the location of the variant instances on the originating FHIR server. but the basic variant data is important to have as well as the summation genotype if one wants to have the discrete details.


Last updated: Apr 12 2022 at 19:14 UTC