Skip to content

Missing identifiers allelematrix / callsets / samples #7

@matthijsbrouwer

Description

@matthijsbrouwer

Allelematrix doesn't link to callsets

1484 from the 6737 unique callSetDbIds from the barley allelematrix endpoint seem not to be included in the callsets endpoint, for example:

 'SAMEA104650675',
 'SAMEA104634582',
 'SAMEA104630144',
 'SAMEA104648159',
 'SAMEA104650723',
 'SAMEA104629428',
  ...

5225 from the 5225 unique callSetDbIds from the wheat allelematrix endpoint seem not to be included in the callsets endpoint, for example:

 'SAMEA7824966',
 'SAMEA7825972',
 'SAMEA7828512',
 'SAMEA7826208',
 'SAMEA7821926',
 'SAMEA7827342',
  ...

Callsets / Samples

Callsets includes sampleDbId as an integer:

{
  "additionalInfo":{},
  "callSetDbId":"SAMEA14020287",
  "created":"2020-02-08",
  "externalReferences":[],
  "sampleDbId":3627152,
  "updated":"2020-02-08",
  "variantSetDbIds":[]
}

However, samples includes sampleDbId as string:

{
  "additionalInfo":null,
  "column":null,
  "externalReferences":[{"referenceId":"SAMEA14020287","referenceSource":""}],
  "germplasmDbId":"AB_07378",
  "observationUnitDbId":null,
  "plateDbId":null,
  "plateName":null,
  "programDbId":null,
  "row":null,
  "sampleBarcode":null,
  "sampleDbId":"3627152",
  "sampleDescription":"DB_ID: SAMEA14020287; sample volume: n/a; concentration: n/a; concentration: n/a; RNASeq-only: n/a; Specify number of lanes: 1; buffer: n/a; library protocol: GBS (narrow size range); Run type & workflow: n/a; Run type & workflow: Single read; flow cell type: SP; Unique dual Index: Meyer K,L,M; of index read cycles: 8 cycles; Average size (Tape): 412; size range: 400-600; size fractionation: BluePippin; REMARKS: n/a; REMARKS: n/a; spike-in: n/a; Sequenz: n/a; Read length: 100; Aussaat: n/a; library source: n/a; library strategy: n/a; library selection: n/a; sequencer/platform: Illumina NovaSeq 6000",
  "sampleGroupDbId":null,
  "sampleName":null,
  "samplePUI":"SAMEA14020287",
  "sampleTimestamp":"2020-02-08",
  "sampleType":"DNA",
  "studyDbId":null,
  "takenBy":null,
  "tissueType":"leaf",
  "trialDbId":null,
  "well":null}

Strictly speaking, this doesn't match.

Furthermore, possibly the data included under sampleDescription does fit under other BrAPI endpoints (?)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions