Hugo Cellar 果の密室: CG vs. Illumina (Sensitivity)

Journal of Leisure 閒記

It is indeed very hard to keep a homepage up-to-date. I couldn't count how many homepages I have created and then given up . As a result, I decided to write a blog, because it is much more easier to manage so I presume I will update it more often. But keeping a not-so-updated blog just means no one is going to read it. So I also decided to write something here that describes my daily life best, just to keep it not-so-outdated :-)

It's been a hard day's night
And I've been working like a dog
It's been a hard day's night
I should be sleeping like a log

Tuesday, March 13, 2012

CG vs. Illumina (Sensitivity)

Came across MJ's post in response to CG's post about our sequencing platform paper on Nature Biotechnology:

http://mendeliandisorder.blogspot.com/2012/03/cliff-reid-on-cg-vs-illumina.html

MJ pointed out a good point that our small set of Sanger sequencing data was only suggestive. Here is my thought.

A confidence level of 95% and a confidence interval of 5% for each of the platform specific call set requires a minimum sample size of ~380. Any further estimation based on a statistically insignificant set is inconclusive. That's why we went on to SureSelect at a larger scale, which gives us a statistically significant result.

As mentioned on the paper, the SureSelect may have potential bias since it was followed by Illumina sequencing. But if there is a strong bias towards Illumina due to systematic errors, probably the invalidation rate for Illumina itself wouldn't be as much as that for Complete.

Let's take the existing Sanger numbers and calculate it once again with its possible errors. With the same confidence level of 95% aforementioned, the possibly best validation rate for Illumina is 30% and the worst for Complete is 83%, which convert into 104K and 83K true positives in their specific call sets, respectively. That said, Illumina is still having a higher sensitivity, whereas Complete is more accurate (less FDR).

If it looks unfair, that's the problem of extrapolating on a set with big error bars. One thing that is true is that we can do a larger scale of Sanger sequencing on the specific calls, then we can have a better sense of the potential ground truth which will be less controversial.

Until then, we gotta believe that they both have their goods and bads, and performed very well overall.

4 comments:

Dainiksatta said...: online classes in MBA

like the post very much read this with great interest; March 22, 2018 at 4:01 AM
unknown said...: Thank you for share this post for us. This post is very helpful for me and I hope also other. Please keep update and add more topic.
I also write about the android app like Yowhatsapp APK; May 26, 2019 at 4:35 AM
john said...: Nice...
Visit here; July 4, 2020 at 5:11 AM
Gail H said...: Great post tthankyou; July 2, 2024 at 8:50 AM

Hugo Cellar 果の密室

About Me 關於我

Search this blog 搜尋網誌

My Sites 我的網站

Related Organizations 有關組織

University Guide 大學指引

My Catering: Dolphin-icious

Journal of Leisure 閒記

Tuesday, March 13, 2012

CG vs. Illumina (Sensitivity)

4 comments:

Recent Posts 最新文章

Recent Comments 最新留言

Categories 網誌分類

Blog Archive 網詓珍藏

海豚點滴

Carinna & Hugo's Blog 恩果網誌

Ming Pao News 明報重點新聞

BBC World News 世界新聞