亚洲精品午夜精品,日本二手网站,国产AⅤ爽AV久久久久成人社区,日本一二三区不收费av

使用 WeGene 需要啟用 Cookies, 請啟用后刷新頁面獲得更好的體驗
鄭一丁 鄭一丁 綜合討論組

WEGENE的數據能直接導入23andme嗎?

如何操作
2016-10-08 ? IP屬地中國
按熱門排序    按默認排序

12 個回復

田園牧歌 - 追本溯源 · 微基因
不能~ 看看哪位高手寫個macro來讓我們可以把數據格式轉換成23andme或其他公司的?
田園牧歌 - 追本溯源 · 微基因
網上好像已經有23andme, ancestry, ftdna互轉的macro了 高手們快出現~~
如果誰能描述一下wegene的數據格式,也許我能寫出一個換格式的腳本。下面是我見過的23andMe的v3版數據格式:
?
以#開始的評論行:說明等,共15行。其后面的數據列用Tab隔開。
數據列1: SNP的rsid (如rs4477212、i5004018)
數據列2: SNP的染色體標號 (順序為:1-22、X、Y、MT)
數據列3: SNP的位置 (reference human assembly build 37)
數據列4: 基因型 (A、C、G、T、I、D;染色體1-22,兩個字母;染色體X、Y、MT,一個字母;所有未測到的都用--表示)
?
??? 在dna.land網站上見到這樣一個問題和回答:
“I have a 23andMe formatted file from Genes For Good. Why can’t I upload it to DNA.LAND?”
“At this time, we are only able to accept files directly from Ancestry, FTDNA, and 23andMe. We look forward to welcoming other communities into DNA.LAND at some point in the future!”
看起來,他們不光要求格式,還要對位點。
??? 另外,網站解釋他們的方法時說:
“DNA.Land imputes your genome, which opens the possibility of seeing genetic variations that were not part of the original file. It is similar to getting whole genome sequencing data (albeit we still miss some rare variations) without investing thousands of dollars. ”
好像是用一個人在給定系列的位點的SNP加上公共數據庫的數據去做一個模擬的全基因組序列。這大概依賴于他們已經研究過的給定系列的位點。
求高手出現!
田園牧歌 - 追本溯源 · 微基因
# This data file generated by WeGene at: Thu, 22 Sep 2016 14:17:00
#
# This file contains the genotype called by WeGene with our
# internal quality control pipeline. The low quality sites
# were discarded. If we could not determine the genotypes,
# it will be —-. As such, the call rate and accuracy may not
# be one hundred percent. So this data is suitable only for
# research, educational, and informational use and not for
# medical or other use.

# This text file is a list of your data which are TAB-
# separated. Each line corresponds to a single SNP or short
# InDel (insertion or deletion).
# For each SNP or short InDel, we provide its identifier (an
# rsid or an internal id), its location on the reference human
# genome (human assembly build 37, GRCh37) and the genotype
# call oriented with respect to the plus strand on the human
# reference sequence. For consistency, the genotypes are always
# ?two base pairs, including hemizygous calls.

# Please note, as our ability to call genotypes improves, it is
# possible that your data may be slightly different at different
# ?times.

# rsid?? ?chromosome?? ?position?? ?genotype
rs8179414?? ?1?? ?565400?? ?CC
rs9701055?? ?1?? ?565433?? ?--
rs9645428?? ?1?? ?566810?? ?GG
......
# This data file generated by 23andMe at: Sat Nov 24 12:23:14 2012
#
# Below is a text version of your data. Fields are TAB-separated
# Each line corresponds to a single SNP.? For each SNP, we provide its identifier
# (an rsid or an internal id), its location on the reference human genome, and the
# genotype call oriented with respect to the plus strand on the human reference
# sequence.???? We are using reference human assembly build 37.? Note that it is possible
# that data downloaded at different times may be different due to ongoing improvements
# in our ability to call genotypes. More information about these changes can be found at:
# https://www.23andme.com/you/download/revisions/
#
# More information on reference human assembly build 37:
# http://www.ncbi.nlm.nih.gov/projects/mapview/map_search.cgi?taxid=9606&build=37
#
# rsid?chromosome?position?genotype
?
# This data file generated by 23andMe at: Mon Nov 21 23:41:25 2016
#
# This file contains raw genotype data, including data that is not used in 23andMe reports.
# This data has undergone a general quality review however only a subset of markers have been
# individually validated for accuracy. As such, this data is suitable only for research,
# educational, and informational use and not for medical or other use.
#
# Below is a text version of your data.? Fields are TAB-separated
# Each line corresponds to a single SNP.? For each SNP, we provide its identifier
# (an rsid or an internal id), its location on the reference human genome, and the
# genotype call oriented with respect to the plus strand on the human reference sequence.
# We are using reference human assembly build 37 (also known as Annotation Release 104).
# Note that it is possible that data downloaded at different times may be different due to ongoing
# improvements in our ability to call genotypes. More information about these changes can be found at:
# https://www.23andme.com/you/download/revisions/
#
# More information on reference human assembly build 37 (aka Annotation Release 104):
# http://www.ncbi.nlm.nih.gov/mapview/map_search.cgi?taxid=9606
#
# rsid?chromosome?position?genotype
?
田園牧歌 - 追本溯源 · 微基因
之前老大說過人家不鳥咱們因為人數太少...
??? 這個網頁(http://www.math.mun.ca/~dapike/FF23utils/)的第一項 Search for Runs of Homozygosity (ROHs)?計算常染色體對上的相同片段的長度。因為芯片上的SNP不全,所以結果是近似的而且與芯片的版本有關。只有拿不同的人用同一種芯片得出的結果才容易直接比較。
??? 這個計算給出一種近親程度分析(一個人的近祖是否來自關系緊密的人群)。曾經有不少人在23andMe的論壇里比較過結果。(第一項自己的網頁上鏈接了幾篇參考文獻。)
??? 按這個網頁上說的,好像要用wegene的數據的話可以把文件的第一行改成這樣 - beginning with # and containing the string "23andMe"。如果真能算,可以開一個新帖比較結果。
?

要回復問題請先登錄注冊

  • <track id="ffr4e"></track>

      <dfn id="ffr4e"></dfn>

      主站蜘蛛池模板: 赤水市| 梅州市| 雅安市| 安丘市| 庆安县| 靖州| 伊宁县| 鱼台县| 海宁市| 奇台县| 襄城县| 彝良县| 宝应县| 虹口区| 东莞市| 天全县| 凉城县| 齐齐哈尔市| 清苑县| 奇台县| 泸定县| 凌云县| 正阳县| 繁峙县| 纳雍县| 神农架林区| 临沭县| 敦化市| 乌兰浩特市| 隆尧县| 南华县| 兰西县| 和平县| 城市| 天峻县| 金坛市| 五指山市| 苗栗市| 高台县| 玉田县| 曲阳县|