Stream: genomics
Topic: Beacon API
EvaM (Dec 10 2019 at 12:56):
Can someone explain me which needs the Beacon API fulfill, that the standard FHIR GC API and the GA4GH does not? That is, when does one want to use the Beacon API insted of the two former? Has it only to do with security, or is there something else as well?
And is the Beacon API an open API that can be easily employed by researchers (and clinicians) in all countries?
Thank you, and kind regards, Eva
Bret H (Dec 10 2019 at 17:07):
from https://developer.mozilla.org/en-US/docs/Web/API/Beacon_API "The Beacon interface addresses the needs of analytics and diagnostics code that typically attempts to send data to a web server before unloading the document. Sending the data any sooner may result in a missed opportunity to gather data. However, ensuring that the data is sent during the unloading of a document is something that has traditionally been difficult for developers." @EvaM the FHIR API is much more specific to the structure of the messages, intended for HealthCare. Beacon API sounds more generic. I have not used Beacon. Hopefully someone who has will have more to say
Bret H (Dec 10 2019 at 17:08):
"Example use cases of the Beacon API are logging activity and sending analytics data to the server"
Bret H (Dec 10 2019 at 17:09):
The FHIR API also allows for querying medical information out of the box. I am not sure Beacon API would allow for that as directly without additions to the API.
Patrick Werner (Dec 10 2019 at 22:38):
i think this is about g4gh‘s beacon api: https://github.com/ga4gh-beacon/specification
Patrick Werner (Dec 10 2019 at 22:39):
And it is a project with no commits for a year, never a good sign.
Jamie Jones (Dec 10 2019 at 22:43):
It seems GA4GH Beacon's main point is to be added to a genomic data server to allow researchers to easily query if a SNP/variation they are interested in is present in the server, so they could arrange to purchase/gain access to the data.
Jamie Jones (Dec 10 2019 at 22:45):
My understanding is that it only responds with a 'yes' or 'no', does not expose any information on how many hits or non-hits or samples are stored.
Jamie Jones (Dec 10 2019 at 22:50):
In this case it would be mainly for security of the data, and to assist in filtering through large amounts of available data sets to prioritize research efforts.
I don't see clinicians having much use for Beacon, though a hospital EHR storing FHIR variants or MolecularSequences may be interested in setting it up to partner with research institutions.
Jamie Jones (Dec 10 2019 at 22:54):
I suppose if one were to individually store patient-level genomic data in individual server "beacons", that could be useful to a clinician or a SMART-on-FHIR app, but I don't know of anyone who is doing this currently.
EvaM (Dec 11 2019 at 08:09):
Thank you all for answers!
Does the Beacon API work between countries as well? That is, can one ask for variants (with yes/no answer to avoid GDPR) across all nations and institutes as long as they have the Beacon API?
If yes, this platform sounds extremely useful for a way of datasharing that can evoid all the strong rules of data protection. Or am I misunderstanding something?
EvaM (Dec 11 2019 at 08:35):
And one more question; has the sequrity of the Beacon API evolved since the study which showed that in a beacon with 1,000 individuals, re-identification is possible with just 5,000 queries (see link below)?
https://www.cell.com/ajhg/fulltext/S0002-9297(15)00374-2
Jamie Jones (Dec 11 2019 at 15:40):
Hi Eva,
I'm not aware of anyone on this stream being an expert in the Beacon API, but maybe @Bob Freimuth can identify someone who may have more knowledge on implementation details. Are you looking at hosting a genomic data server? More of a description of your intended use case would be very helpful.
In terms of security concerns, that article is a fun read but from a quick review, I don't feel the hypothetical situation it proposes is realistic in today's data climate. Not only does it assume the attacker already knows the whole genome of the person in question, but it also assumes they know two things about the beacon that I don't believe it has to disclose: the number of individuals in the beacon, and the site frequency spectrum (SFS) of the population in the beacon... is this a situation you are concerned with in terms of data security?
Kevin Power (Dec 11 2019 at 16:02):
@EvaM If you have continued questions about Beacon, it seems they have a Google group setup, so perhaps it will be a better forum for questions like this:
https://groups.google.com/forum/#!forum/ga4gh-beacon-network
Last updated: Apr 12 2022 at 19:14 UTC