Uploaded image for project: 'NIF'
  1. NIF
  2. NIF-11894

DISCO: import human genomic feature coordinates

    XMLWordPrintable

Details

    • Task
    • Resolution: Fixed
    • Major
    • None
    • None
    • None
    • NIF

    Description

      Get the list of human genes, and their genomic coords, probably from ENSEMBL (or UCSC as an alternative). Make sure we also pull cytogenic band defs.
      Make sure to ingest the following features (at minimum):
      genes, transcripts, exons, cytogenic bands
      Additional awesomeness would be:
      proteins

      ENSEMBL does not provide GFF3, rather GTF. Do we want to store it that way in the DISCO system, or convert it to GFF3 during ingest?

      Attachments

        1. sample.gtf
          72 kB
        2. sample.gff3
          40 kB

        Issue Links

          Activity

            People

              hoffmasc Scott Hoffmann
              nlw Nicole Washington
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: