Uploaded image for project: 'NIF'
  1. NIF
  2. NIF-11607

lack of ID-mapping leads to incorrect results

    XMLWordPrintable

Details

    • NIF
    • Issues closed as MONARCH has transitioned from UCSD services

    Description

      A query for the number of genes in NIF q=gene gives 196,503,867 results. The number for q=gene:* 190,397,889 results (latter is only data in columns titled gene).

      20K-ish genes/genome means there would be @ 1000 organisms

      These are instances of data (+ false positives ) for the genes we have in the system and are not a unique set. To get the total number of genes known, one can query the NCBI gene data, but that does not tell you which data is mapped to any of the genes.

      A user should be able to query for gene and get back all the genes and all the data attached those genes, and have the genes be semantically defined gene entities. These results can then be filtered via a variety of ways.

      It would be a good exercise to see how such numbers changes when ID mapping is working and autocomplete is to a specific gene entity.

      Attachments

        Activity

          People

            agupta Amarnath Gupta (use this)
            mhaendel Melissa Haendel
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: