• memfree@beehaw.org
    link
    fedilink
    English
    arrow-up
    57
    ·
    1 year ago

    H-h-how? HOW? do they ‘anonymize’ DNA?!?! Remember how in 2007 ‘anonymized’ netflix data was linked back to actual members? That was just checking what people watched on Netflix compared to what they rated on IMDB.

    With DNA, you should be able to figure out who someone is by the fact you an exact DNA record! I mean, it’ll share similarities with your parents, and children, and to a lesser degree, more removed relatives. How hard can it be to figure out that this woman is related to that guy with an arrest record. Or more specifically: this is the exact person because we see other records from any doctor or whatever with the same DNA.

    • Uninvited Guest@lemmy.ca
      link
      fedilink
      arrow-up
      19
      ·
      1 year ago

      This was an obvious outcome when they were going to IPO. When it was announced they were going public, I exported all of my data and had all of my records with them destroyed.

      Then I made a little bit of money on their stock and got out of that too.

    • The Doctor@beehaw.org
      link
      fedilink
      English
      arrow-up
      15
      ·
      1 year ago

      As a general rule, when someone says that data is anonymized, they’re one part lying and one part clueless. It sounds great when they say it, but ultimately it’s bullshit. Maybe if we started calling claims like this lies when they were made, a few more people would pay attention.

      • floofloof@lemmy.ca
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 year ago

        That’s basically what it usually is: anonymized so that to discover people’s identities you’d need to combine the data set with at least one other readily available data set.

    • Victor Villas@beehaw.org
      link
      fedilink
      arrow-up
      9
      ·
      edit-2
      1 year ago

      H-h-how? HOW? do they ‘anonymize’ DNA?!?!

      If you really curious, it is possible depending on the sections of the DNA being shared and how aggregated they are. Not saying that this will be the case - it’s quite likely that this sale would be done prioritizing value instead of privacy - but it is possible. The key part is to not treat the whole DNA as a data sample, but specific sequence sections, as isolated as possible.

      And the Netflix example is instructive but not super relevant here. If you already have your SNPs in a public database out there, then yeah 23andMe might not be able to effectively anonymize your samples; but you don’t (I hope).

      • sudoshakes@reddthat.com
        link
        fedilink
        arrow-up
        4
        ·
        edit-2
        1 year ago

        All prisoners in the US, regardless of infraction, have DNA samples taken in many states.

        That is not voluntary.

        It was ruled constitutional by SCOTUS.

        If you had that done, and you have family dumb enough to use 23andme, then you just got screwed, involuntarily, twice.

      • 4dpuzzle@beehaw.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 year ago

        A vast majority of those millions are going to be for the identity rather than just the relevant data. Meanwhile, the genetic profiling companies, drug companies and insurance companies are sociopathic enough to lie through their noses about it.

        I have a strong feeling that the data transfer has already happened through data brokers. They are just easing the public into acceptance.

  • Swim@lemmy.ca
    link
    fedilink
    arrow-up
    47
    ·
    1 year ago

    all they ever wanted was a database to sell. we keep falling for the same game…

    • lightnsfw@reddthat.com
      link
      fedilink
      arrow-up
      13
      ·
      1 year ago

      I tried telling my family this from the jump but half of them have used these services. Now me and all their descendants are fucked over because they just had to be idiots. It’s extra stupid because we already have our genealogy on both sides going back to like 1500s.

    • The Doctor@beehaw.org
      link
      fedilink
      English
      arrow-up
      13
      ·
      1 year ago

      When we said that data was the new oil in the 80’s, nobody listened.

      They still haven’t listened, but if you own stock in those companies at least it’s profitable.

  • CaptObvious@literature.cafe
    link
    fedilink
    arrow-up
    29
    ·
    1 year ago

    I made the mistake of having them sequence my DNA before the first Big Pharma deal with GSK, which took a lot of people by surprise. I’ve since made a point of feeding them as much disinformation as possible every time I’m on their site.

    • apis@beehaw.org
      link
      fedilink
      English
      arrow-up
      17
      ·
      1 year ago

      Be quite amusing if we could poison their well by persuading a great many people to send in samples from other life forms.

      Probably easier, cheaper & faster to make their data unusable via other means though.

      • Xavier@lemmy.ca
        link
        fedilink
        arrow-up
        20
        ·
        1 year ago

        It is fairly easy to differenciate DNA samples from different species and exclude them. Since it has always been an issue to have contamination by foreign DNA (bacterias, fungus, virus, plancton, fauna and flora of all sorts, etc.), tools/methods/protocols are specifically made to quickly separate out (amplify the DNA we are interested in) from whatever is not to focus of the current study.

        Moreover, a random anonymous sample without associated information can quickly be analysed and compared against large libraries of genome datasets/maps to ascertain and corroborate what it is from, closest species, even family trees of related inviduals and most importantly get an overview of multiple phenotype of interest.

        From the day the full human genome map had been declared complete in 2003 (at 85% of the genome), research has only accelerated in improving the map while understanding the various functions of many different parts of our DNA.

      • The Doctor@beehaw.org
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 year ago

        That was one of the first things they put in place when they started accepting samples from people: Detect and filter out every sample of non-human DNA to keep people from messing with their data set.

    • SeaJ@lemm.ee
      link
      fedilink
      arrow-up
      8
      ·
      1 year ago

      I am guessing this is only for the people who opted in to having their data shared for research.

    • liv@beehaw.org
      link
      fedilink
      English
      arrow-up
      13
      ·
      1 year ago

      I’m waiting for the part where the US insurance companies are discovered using that data en mass to increase premiums and deny coverage.

      That’s going to be my “I told you so”.

      • Sina@beehaw.org
        link
        fedilink
        arrow-up
        3
        ·
        edit-2
        1 year ago

        That will take a long time, right now analyzing one person’s DNA to a point where an insurance company could profit from it costs way more than the extra profits from denying some potentially short-lived clients.

        (or an AI based analyzer is already in the works)

        • ConsciousCode@beehaw.org
          link
          fedilink
          arrow-up
          3
          ·
          1 year ago

          Considering prior authorization is predicated on the fact that if they reject enough requests inevitably some people won’t fight them, meaning they don’t have to pay out, I wouldn’t be surprised if they use a slightly better than chance prediction as justification for denying coverage, if they even need an actual excuse to begin with.

        • liv@beehaw.org
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          1 year ago

          I’m old enough that a lot of things that were going to take a long time have come to pass, so I feel confident this will come.

          AI and genetics are both moving fairly fast, and insurance is about numbers and probabilities.

    • bitsplease@lemmy.ml
      link
      fedilink
      arrow-up
      10
      ·
      1 year ago

      Don’t expect much lol, when I pointed out that this was inevitable the most common response was “who cares?”

      Privacy is dead mainly because your average person doesn’t actually care about it

      • 4dpuzzle@beehaw.org
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 year ago

        Privacy is dead because of the average person. They were informed several times, but they decided it wasn’t important. And they ruined it for everyone else who cared.

    • Devi@beehaw.org
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      Were they of the idea that when you tick the “my data can be used anonymously for research” box it meant that their data WOULDN’T be used anonymously for research?

  • CanadaPlus@lemmy.sdf.org
    link
    fedilink
    arrow-up
    7
    ·
    edit-2
    1 year ago

    The only way I’m ever getting sequenced is if the machine is in front of me, is an open-source design or can be destroyed afterwards, and I get the only copy on my own encrypted drive. Or it’s done without my consent. Probably the latter the way things are going.

    • 4dpuzzle@beehaw.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      Do you realize that you don’t need to volunteer at all? Do you know that there was a rape and murder case that was proven using data from a similar (same?) company? They found a bunch of people with DNA similar to that from the rape kit and went on to find their common relative.

      The story above may sound good. But it won’t be too hard for medical insurance companies to deduce your approximate genetic profile based on the samples submitted by your relatives.

      Even worse, it doesn’t take a lot of genetic material these days to profile you. The PCR technique (the same used for Covid-19 screening) can amplify samples. You may have submitted a blood sample at some point in the last few years. How would you know if a tiny bit of that was siphoned off to create an exact genetic profile of yours?

      • Devi@beehaw.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        Different company. There’s a site called Gedmatch where you can upload your file and one of the feature you can choose to use is to allow your file be used to identify does or solve serious crimes. Nobody is doing this secretly.

    • SenorBolsa@beehaw.org
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      Only way I’m doing it is if I assembled the machine from a kit and got to inspect the source code myself.

  • Devi@beehaw.org
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 year ago

    People are getting very confused here. You can allow your anonymised data to be used for research. This is not new whatsoever and it’s done by consent.

    What IS new is that a company (GSK) are about to start using this data. Data that’s publicly used already. This may help them to develop some new treatments.

    • interolivary@beehaw.org
      link
      fedilink
      arrow-up
      3
      ·
      edit-2
      1 year ago

      No no, just repeat after me: “I can say I tOlD YoU So!” You don’t want to be caught using anything resembling logic when it comes to pharmaceutical companies.

  • akrz@programming.dev
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    This is only if you opted into research. And I am actually happy this is happening. If only one person is helped by research outcomes or medications developed from this, I am happy. I don’t care if 23andMe gets rich from it or not.