{"id":6233,"date":"2021-07-15T10:12:00","date_gmt":"2021-07-15T10:12:00","guid":{"rendered":"https:\/\/41j.com\/blog\/?p=6233"},"modified":"2021-06-15T10:16:41","modified_gmt":"2021-06-15T10:16:41","slug":"twinstrand-biosciences","status":"publish","type":"post","link":"https:\/\/41j.com\/blog\/2021\/07\/twinstrand-biosciences\/","title":{"rendered":"Twinstrand Biosciences"},"content":{"rendered":"\n<p>This post originally appeared on my <a href=\"https:\/\/aseq.substack.com\">substack<\/a> newsletter.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">Business<\/h1>\n\n\n\n<p>Twinstrand is a University of Washington spinout based in Seattle. Crunchbase lists Twinstrand as being founded in 2015 which is not long after the foundational work was done (in 2012),&nbsp;<a href=\"https:\/\/www.businesswire.com\/news\/home\/20210506005487\/en\/TwinStrand-Biosciences-Announces-50-Million-Series-B-to-Expand-the-Adoption-of-its-Duplex-Sequencing-Technology-in-Applications-Requiring-the-Highest-Sensitivity-and-Accuracy\">they\u2019ve recently raised a series B of $50M<\/a>&nbsp;bringing their total to&nbsp;<a href=\"https:\/\/www.crunchbase.com\/organization\/twinstrand-biosciences\/company_financials\">$73.2M<\/a>. I see 62 employees on LinkedIn.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">Approach and Applications<\/h1>\n\n\n\n<p>The basic play is that Illumina sequencing has an error rate that\u2019s too high for some applications. To me, this is was kind of surprising. In Illumina sequencing,&nbsp;<a href=\"https:\/\/www.illumina.com\/documents\/products\/technotes\/technote_Q-Scores.pdf\">around 90% of bases are Q30<\/a>. That\u2019s an error rate of 1 in 1000. Do you really need an error rate lower than this? Twinstrand propose a number of applications, these are largely around very low level mutations.<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Detecting residual acute myeloid leukemia (AML) after treatment.<\/li><li>Mutagenesis assays, for chemical and drug safety testing.<\/li><li>Cellular Immunotherapy Monitoring<\/li><\/ul>\n\n\n\n<p>In general, I\u2019m used to seeing plays (like GRAIL) around cancer screening. But this is aimed more at cancer monitoring. The US national cost of&nbsp;<a href=\"https:\/\/healthpayerintelligence.com\/news\/cost-of-cancer-care-reaches-nearly-150b-nationally\">cancer care is $150B<\/a>, there are&nbsp;<a href=\"https:\/\/www.cancer.gov\/about-cancer\/understanding\/statistics\">around 1.8M cancer cases<\/a>. So, if we assume that this test will be required for cancer monitoring of every patient, and yields $1000 in profit that\u2019s $1.8B in profit. Probably enough to support the company, and make investors happy\u2026<\/p>\n\n\n\n<p>But for the Twinstrand play to work, and justify a healthy valuation, at least the following needs to be true:<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>\u201cultra-high accuracy\u201d is needed for cancer monitoring.<\/li><li>The Twinstrand approach is a practical method of generating \u201cultra-high accuracy\u201d reads.<\/li><li>The Twinstrand approach is the only and best way to get \u201cultra-high accuracy\u201d.<\/li><\/ol>\n\n\n\n<p>The first may be true, but it\u2019s obviously not what GRAIL and other players have been working on for early stage cancer screening, where the focus has shifted toward base modification\/methylation.<\/p>\n\n\n\n<p>As to Twinstrand\u2019s practicality? Hopefully we can gain some insight by reviewing the approach.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">Technology<\/h1>\n\n\n\n<p>The technique relies on adding two pieces of information to double stranded DNA. The first is a unique index (a UMI) which uniquely identifies each double stranded fragment. The second is a strand-defining element (SDE). This a marker that allows the two strands forming a double stranded fragment to be distinguished.<\/p>\n\n\n\n<p>Twinstrand use two UMIs. One of each end of the original double stranded fragment. They call these two UMIs \u201c\u03b1\u201d and \u201c\u03b2\u201d in the figure below.&nbsp;<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter is-resized\"><a href=\"https:\/\/cdn.substack.com\/image\/fetch\/f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed18383-a1a3-4ebe-9d09-2e92f59b63e1_1006x1456.png\" target=\"_blank\" rel=\"noreferrer noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.substack.com\/image\/fetch\/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed18383-a1a3-4ebe-9d09-2e92f59b63e1_1006x1456.png\" alt=\"\" width=\"390\" height=\"564\"\/><\/a><\/figure><\/div>\n\n\n\n<p>The Y shaped adapters (labelled Arm 1,2) in the diagram above introduce an asymmetry between the strands. This provides the strand-defining element (SDE) described above.<\/p>\n\n\n\n<p>To make this clearer I decided to break to the diagram further, showing the individual amplification steps involved:<\/p>\n\n\n\n<p>Post amplification, and in 5\u2019 orientation you will get 4 distinct read types as shown above. Each of these can be classified as coming from either the forward or reverse strand of the original dsDNA fragment.<\/p>\n\n\n\n<p>From there it\u2019s obvious that you can use this information to filter out errors that occurred during amplification (including bridge amplification):<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter is-resized\"><a href=\"https:\/\/cdn.substack.com\/image\/fetch\/f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F90d7ce58-05b0-47ac-b843-3367b5f5893f_1888x1792.png\" target=\"_blank\" rel=\"noreferrer noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.substack.com\/image\/fetch\/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F90d7ce58-05b0-47ac-b843-3367b5f5893f_1888x1792.png\" alt=\"\" width=\"460\" height=\"436\"\/><\/a><\/figure><\/div>\n\n\n\n<p>For amplification errors to propagate they\u2019d need to occur at the same position, and of the same base. So, I\u2019d assume a ballpark estimate is somewhere around Q60\u2026 and their reports include identifying mutation frequency down to a rate of 10^-5.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">Problems<\/h1>\n\n\n\n<p>Wow, great! Q60 reads, who wouldn\u2019t want that!<\/p>\n\n\n\n<p>Well the major problem is that you\u2019re going to throw away a lot of throughput. At best you will need to sequence each strand 2 to 4 times. This might be fine if you have an amplification step in your protocol anyway. Much like UMIs the Twinstrand process will just provide additional information removing error and bias.<\/p>\n\n\n\n<p>But unlike UMIs you want to optimize for duplicates. And not just duplicates but duplicating starting material a fixed number of times. I.e. the ideal is probably to see ~4 different sequences for every original fragment of dsDNA (one of each type).<\/p>\n\n\n\n<p>In practice, this is problematic, in&nbsp;<a href=\"https:\/\/twinstrandbio.com\/wp-content\/uploads\/patents-US9752188.pdf\">their patent<\/a>&nbsp;they state \u201c3.1% of the tags had a matching partner present in the library, resulting in 2.9 million nucleotides of sequence data\u201d. As far as I can tell the input datasets was 390Mb of sequence data. Processed, corrected reads therefore represent about 0.75% of the input dataset. This is a huge hit of your throughput.<\/p>\n\n\n\n<p>The above describes the original IP, from ~2012. Most of their patents appear to be based around this basic process. However a&nbsp;<a href=\"https:\/\/twinstrandbio.com\/wp-content\/uploads\/patents-US9970054.pdf\">patent from 2018<\/a>&nbsp;looks like it might be worth digging into in more detail. In this patent it looks like they try to more closely model errors that occur during the sequencing process (incorporating fluorescence intensity information into a two pass basecalling process).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This post originally appeared on my substack newsletter. Business Twinstrand is a University of Washington spinout based in Seattle. Crunchbase lists Twinstrand as being founded in 2015 which is not long after the foundational work was done (in 2012),&nbsp;they\u2019ve recently raised a series B of $50M&nbsp;bringing their total to&nbsp;$73.2M. I see 62 employees on LinkedIn. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[1],"tags":[],"class_list":["post-6233","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p1RRoU-1Cx","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/posts\/6233","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/comments?post=6233"}],"version-history":[{"count":4,"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/posts\/6233\/revisions"}],"predecessor-version":[{"id":6239,"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/posts\/6233\/revisions\/6239"}],"wp:attachment":[{"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/media?parent=6233"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/categories?post=6233"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/41j.com\/blog\/wp-json\/wp\/v2\/tags?post=6233"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}