JP7802295B2

JP7802295B2 - Methods for increasing the yield of sequencing libraries

Info

Publication number: JP7802295B2
Application number: JP2022567511A
Authority: JP
Inventors: アンドリューシー．アディ，; ライアンマルクイーン，; フランクスティーマーズ，; ディミトリーケー．ポコロック，; ファンジャン，; エスターマスグレイブ－ブラウン，
Original assignee: Oregon Health and Science University
Current assignee: Oregon Health and Science University
Priority date: 2020-06-09
Filing date: 2021-06-09
Publication date: 2026-01-20
Anticipated expiration: 2041-06-09
Also published as: EP4162080A1; CA3182810A1; IL298821A; AU2021287900A1; US20210380972A1; CN115552035A; JP2023533418A; JP2025178294A; KR20230020977A; BR112022025184A2; MX2022015521A

Description

（関連出願の相互参照）
本出願は、２０２０年６月９日に出願された米国仮特許出願第６３／０３６，７１０号の利益を主張し、その全体が参照により本明細書に組み込まれる。 CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims the benefit of U.S. Provisional Patent Application No. 63/036,710, filed June 9, 2020, which is incorporated herein by reference in its entirety.

（政府出資）
本発明は、ＮａｔｉｏｎａｌＩｎｓｔｉｔｕｔｅｓｏｆＨｅａｌｔｈにより授与されたＲ３５ＧＭ１２４７０４の下で政府の支援によってなされた。政府は本発明において一定の権利を有する。 (Government investment)
This invention was made with government support under R35GM124704 awarded by the National Institutes of Health. The government has certain rights in this invention.

（配列表）
本出願は、２０２１年６月８日に作成されたサイズ２キロバイトの「２０２１－０６－０８－ＳｅｑｕｅｎｃｅＬｉｓｔｉｎｇ＿ＳＴ２５．ｔｘｔ」と題されたＡＳＣＩＩテキストファイルとして、ＥＦＳ－Ｗｅｂを介して米国特許商標庁に電子的に提出された配列表を含む。配列表に含まれる情報は、参照により本明細書に組み込まれる。 (Sequence Listing)
This application contains a Sequence Listing that has been submitted electronically to the United States Patent and Trademark Office via EFS-Web as an ASCII text file entitled "2021-06-08-SequenceListing_ST25.txt", 2 kilobytes in size, created on June 8, 2021. The information contained in the Sequence Listing is incorporated herein by reference.

（発明の分野）
本開示の実施形態は、シークエンシングのために核酸を調製することに関する。特に、本明細書で提供される方法、組成物、システム、及びキットの実施形態は、核酸ライブラリーを対称ユニバーサル配列を含む断片から非対称ユニバーサル配列を含む断片に変換し、そこから配列データを取得することに関する。 FIELD OF THE INVENTION
Embodiments of the present disclosure relate to preparing nucleic acids for sequencing. In particular, embodiments of the methods, compositions, systems, and kits provided herein relate to converting a nucleic acid library from fragments containing symmetric universal sequences to fragments containing asymmetric universal sequences, and obtaining sequence data therefrom.

次世代シークエンシング（Next-generation sequencing、ＮＧＳ）技術により、ゲノム研究が革命的に変化する。有効であることが証明されたＮＧＳへの１つのアプローチは、断片が各末端に異なるアダプターを有するように処理されるシークエンシングライブラリーの生成である。次いで、ペアエンドシークエンシングを使用して、両方の鎖から配列情報を得る。ペアエンドアプローチの利点は、ランダムな様式で２つの独立した鋳型のそれぞれからの「ｎ」個の塩基をシークエンシングするよりも、単一の鋳型からの「ｎ」個の塩基をそれぞれ２本シークエンシングする方が、有意により多くの情報を得られる、ということである。しかしながら、各末端に異なるアダプターを付加するための方法は、第１のアダプターをＤＮＡ断片の一端及び第２のアダプターを同じＤＮＡ断片の他方の末端に選択的に標的化することが困難であるため、多くの場合非効率的である。例えば、シークエンシングライブラリーは、高度に効率的なタグメンテーションを使用して生成することができるが、実行可能なシークエンシングライブラリー分子は、順方向又は逆方向の一次配列の形態で異なるアダプターが分子の各末端に組み込まれている場合にのみ生成される。いくつかのタグメンテーション反応中に、２つの配列の各々が組み込まれる確率は等しく、したがって、半分の分子が順方向－順方向又は逆方向－逆方向アダプターの組み合わせを有するという結果になり、それによって理論収率が５０％に低減する。 Next-generation sequencing (NGS) technology is revolutionizing genomic research. One approach to NGS that has proven effective is the generation of sequencing libraries in which fragments are processed to have different adapters at each end. Paired-end sequencing is then used to obtain sequence information from both strands. The advantage of the paired-end approach is that sequencing two "n" bases from a single template yields significantly more information than sequencing "n" bases from each of two independent templates in a random fashion. However, methods for adding different adapters to each end are often inefficient due to the difficulty of selectively targeting one adapter to one end of a DNA fragment and a second adapter to the other end of the same DNA fragment. For example, while sequencing libraries can be generated using highly efficient tagmentation, viable sequencing library molecules are only generated when different adapters are incorporated at each end of the molecule in the form of forward or reverse primary sequences. During some tagmentation reactions, each of the two sequences has an equal probability of being incorporated, resulting in half of the molecules having a forward-forward or reverse-reverse adapter combination, thereby reducing the theoretical yield to 50%.

本明細書に提示されるのは、核酸をシークエンシングライブラリーに効率的に変換する方法及び組成物である。本明細書に提示される方法は、核酸の上鎖、核酸の下鎖、又は核酸の上鎖及び下鎖の両方について、順方向及び逆方向の両方のアダプターでタグ付けされた標的核酸のライブラリーを生成するためにアダプター交換を使用する代替戦略を含む。本方法は、全ゲノムシークエンシング、ゲノム立体配座捕捉、循環ＤＮＡシークエンシング、標的シークエンシング、２つ以上の分析物のコアッセイ、例えば、ＲＮＡとＡＴＡＣ又はＤＮＡとＲＮＡ、及び単一細胞ゲノムを含むがこれらに限定されない、広範囲にわたるシークエンシングライブラリー調製方法に有用である。更に、このフォーマットにより、アダプター内に埋め込まれた１つ以上のインデックス配列の使用が可能になり、単一細胞コンビナトリアルインデックス付け（single-cell combinatorial indexing、ｓｃｉ）の適用が可能になる（例えば、Ｃｕｓａｎｏｖｉｃｈ，ｅｔａｌ．，Ｓｃｉｅｎｃｅ３４８，９１０－９１４（２０１５）、Ｖｉｔａｋｅｔａｌ．，Ｎａｔ．Ｍｅｔｈｏｄｓ１４，３０２－３０８（２０１７）、Ｍｕｌｑｕｅｅｎｅｔａｌ．，Ｎａｔ．Ｂｉｏｔｅｃｈｎｏｌ．３６，４２８－４３１（２０１８））。本明細書で提供される方法は、データ品質の改善、例えば、ｓｃｉ－ＨｉＣと比較した場合、ｓ３－ＡＴＡＣの場合は、シグナル濃縮を犠牲にすることなく細胞ごとに得られる通過読み取りに関して既知の方法に対する顕著な改善、ｓ３－ＷＧＳの場合はカバレッジ均一性、及びｓ３－ＧＣＣの場合は細胞ごとに得られるクロマチン接触の改善をもたらす。ｓ３－ＡＴＡＣ、ｓ３－ＷＧＳ、及びｓ３－ＧＣＣが本明細書に記載されている。 Presented herein are methods and compositions for efficiently converting nucleic acids into sequencing libraries. The methods presented herein include an alternative strategy that uses adapter exchange to generate libraries of target nucleic acids tagged with both forward and reverse adapters for the top strand of the nucleic acid, the bottom strand of the nucleic acid, or both the top and bottom strands of the nucleic acid. The methods are useful for a wide range of sequencing library preparation methods, including, but not limited to, whole genome sequencing, genome conformation capture, circular DNA sequencing, targeted sequencing, co-assays of two or more analytes, e.g., RNA and ATAC or DNA and RNA, and single-cell genomes. Furthermore, this format allows for the use of one or more index sequences embedded within the adapters, enabling the application of single-cell combinatorial indexing (sci) (e.g., Cusanovich, et al., Science 348, 910-914 (2015); Vitak et al., Nat. Methods 14, 302-308 (2017); Mulqueen et al., Nat. Biotechnol. 36, 428-431 (2018)). The methods provided herein provide improved data quality, e.g., when compared to sci-HiC, significant improvements over known methods in terms of passage reads obtained per cell without sacrificing signal enrichment in the case of s3-ATAC, coverage uniformity in the case of s3-WGS, and improved chromatin contacts obtained per cell in the case of s3-GCC. s3-ATAC, s3-WGS, and s3-GCC are described herein.

定義
本明細書で使用される用語は、別段の指定がない限り、関連技術の通常の意味をとるものと理解されるであろう。本明細書で使用されるいくつかの用語及びそれらの意味は、以下に記載される。 Definitions Terms used herein will be understood to have their ordinary meaning in the relevant art unless otherwise specified. Some terms used herein and their meanings are set forth below.

本明細書で使用される場合、用語「生物」及び「対象」とは、交換可能に使用され、微生物（例えば、原核生物又は真核生物）、動物、及び植物を指す。動物の例は、ヒトなどの哺乳類である。 As used herein, the terms "organism" and "subject" are used interchangeably and refer to microorganisms (e.g., prokaryotes or eukaryotes), animals, and plants. An example of an animal is a mammal, such as a human.

本明細書で使用される場合、用語「標的核酸」とは、核酸に関して使用する場合、本明細書に記載の方法又は組成物の文脈における核酸の意味的識別子として意図され、別途明示的に示されるもの以外の核酸の構造又は機能を必ずしも限定するものではない。標的核酸は、本質的に既知又は未知の配列の任意の核酸であってもよい。例えば、ゲノムＤＮＡ断片（例えば、染色体ＤＮＡ）、プラスミドなどの染色体外ＤＮＡ、循環ＤＮＡ、又は循環ＲＮＡ、１つの細胞又は複数の細胞からの核酸、無細胞ＤＮＡ、ＲＮＡ（例えば、ｍＲＮＡ）、又はｃＤＮＡであり得る。シークエンシングは、標的分子の全体又は一部の配列の決定をもたらし得る。標的は、核などの一次核酸試料に由来し得る。一実施形態では、標的は、各標的断片の末端にユニバーサル配列を配置することによって増幅に好適な鋳型に処理することができる。標的はまた、ｃＤＮＡへの逆転写によって一次ＲＮＡ試料から得ることもできる。一実施形態では、標的は、細胞内のＤＮＡ又はＲＮＡのサブセットに関して使用される。標的シークエンシングは、一般にはＰＣＲ増幅（例えば、領域特異的プライマー）又はハイブリダイゼーションベースの捕捉法又は抗体のいずれかによる、対象とする遺伝子の選択及び単離を使用する。標的濃縮は、方法の様々な段階で行うことができる。例えば、標的ＲＮＡ表現は、逆転写工程で標的特異的プライマーを使用するか、より複雑なライブラリーからサブセットをハイブリダイゼーションベースで濃縮することで得られる。例としては、エクソームシークエンシング又はＬ１０００アッセイがある（Ｓｕｂｒａｍａｎｉａｎｅｔａｌ．，２０１７，Ｃｅｌｌ，１７１；１４３７－１４５２）。標的シークエンシングは、当業者に既知の濃縮プロセスのいずれかを含み得る。ユニバーサル配列の一端又は両端を有する標的核酸は、修飾標的核酸と称され得る。標的核酸など核酸への言及は、別途記載のない限り、一本鎖核酸及び二本鎖核酸の両方を含む。例えば、対称及び非対称の標的核酸は、本開示の方法において、二本鎖、一本鎖、又はある点で部分的に二本鎖及び一本鎖であり得る。 As used herein, the term "target nucleic acid," when used with respect to a nucleic acid, is intended as a semantic identifier of the nucleic acid in the context of a method or composition described herein and does not necessarily limit the structure or function of the nucleic acid other than as otherwise expressly indicated. A target nucleic acid can be essentially any nucleic acid of known or unknown sequence. For example, it can be a genomic DNA fragment (e.g., chromosomal DNA), extrachromosomal DNA such as a plasmid, circulating DNA or circulating RNA, nucleic acid from a cell or multiple cells, cell-free DNA, RNA (e.g., mRNA), or cDNA. Sequencing can result in determining the sequence of all or part of a target molecule. Targets can be derived from a primary nucleic acid sample, such as a nucleus. In one embodiment, targets can be processed into templates suitable for amplification by placing universal sequences at the ends of each target fragment. Targets can also be obtained from a primary RNA sample by reverse transcription into cDNA. In one embodiment, targets are used with respect to a subset of DNA or RNA within a cell. Targeted sequencing typically uses selection and isolation of genes of interest, either by PCR amplification (e.g., region-specific primers) or hybridization-based capture methods or antibodies. Target enrichment can occur at various stages of the method. For example, target RNA representation can be achieved using target-specific primers in a reverse transcription step or by hybridization-based enrichment of subsets from a more complex library. Examples include exome sequencing or the L1000 assay (Subramanian et al., 2017, Cell, 171; 1437-1452). Target sequencing can include any enrichment process known to those skilled in the art. Target nucleic acids having a universal sequence at one or both ends may be referred to as modified target nucleic acids. References to nucleic acids, such as target nucleic acids, include both single-stranded and double-stranded nucleic acids unless otherwise specified. For example, symmetric and asymmetric target nucleic acids can be double-stranded, single-stranded, or partially double-stranded and single-stranded in some respects in the methods of the present disclosure.

本明細書で使用する場合、用語「アダプター」及びその派生語、例えば、ユニバーサルアダプターとは、一般に、標的核酸に付加され得る任意の線状オリゴヌクレオチドを指す。アダプターは、一本鎖又は二本鎖ＤＮＡであり得るか、又は二本鎖領域及び一本鎖領域の両方を含み得る。アダプターは、プライマー、例えばユニバーサルプライマーの少なくとも一部と実質的に同一であるか、又は実質的に相補的である配列；下流エラー補正、識別、又はシークエンシングを補助するためのインデックス（本明細書ではバーコード又はタグとも呼ばれる）、並びに／又はＵＭＩを含み得る。いくつかの実施形態では、アダプターは、試料中に存在する任意の標的配列の３’末端又は５’末端に実質的に非相補的である。いくつかの実施形態では、好適なアダプターの長さは、約６～１００ヌクレオチド、約１２～６０ヌクレオチド、又は約１５～５０ヌクレオチドの長さの範囲である。例えば、用語「アダプター（adaptor）」及び「アダプター（adapter）」は、交換可能に使用される。 As used herein, the term "adapter" and its derivatives, such as universal adapter, generally refer to any linear oligonucleotide that can be added to a target nucleic acid. Adapters can be single-stranded or double-stranded DNA, or can contain both double-stranded and single-stranded regions. Adapters can include a sequence that is substantially identical to or substantially complementary to at least a portion of a primer, such as a universal primer; an index (also referred to herein as a barcode or tag) to aid in downstream error correction, identification, or sequencing; and/or a UMI. In some embodiments, adapters are substantially non-complementary to the 3' or 5' end of any target sequence present in a sample. In some embodiments, suitable adapter lengths range from about 6 to 100 nucleotides, about 12 to 60 nucleotides, or about 15 to 50 nucleotides in length. For example, the terms "adaptor" and "adapter" are used interchangeably.

本明細書で使用するとき、用語「ユニバーサル」は、ヌクレオチド配列を記述するために使用する場合、２つ以上の核酸分子に共通する配列の領域を指し、分子はまた、互いに異なる配列の領域を有する。核酸コレクションの異なるメンバー内に存在するユニバーサル配列は、インデックスなどの別のヌクレオチド配列を標的核酸に付加するためのプライマーとして使用され得るヌクレオチド配列をアニーリングする後続工程において、「ランディングパッド」として使用され得る。核酸コレクションの異なるメンバー内に存在するユニバーサル配列は、ユニバーサル捕捉核酸の集団、例えば、ユニバーサル配列の一部に相補的な捕捉オリゴヌクレオチド、例えば、ユニバーサル捕捉配列を使用して、複数の異なる核酸を捕捉することができる。ユニバーサル捕捉配列の非限定的な例としては、Ｐ５及びＰ７プライマーと同一又は相補的な配列が挙げられる。同様に、分子の集合の異なるメンバーに存在するユニバーサル配列は、ユニバーサル配列の一部に相補的なユニバーサルプライマーの集団、例えば、ユニバーサルアンカー配列を使用して、複数の異なる核酸を複製（例えば、シークエンシング）又は増幅することができる。用語「Ａ１４」及び「Ｂ１５」とは、ユニバーサルアンカー配列を指す場合に使用され得る。用語「Ａ１４’」（Ａ１４プライム）及び「Ｂ１５’」（Ｂ１５プライム）は、それぞれＡ１４及びＢ１５の相補体を指す。本明細書に提示される方法において、任意の好適なユニバーサルアンカー配列を使用し得ること、及びＡ１４及びＢ１５の使用は例示的な実施形態のみであることが理解されるであろう。一実施形態では、ユニバーサルアンカー配列は、ユニバーサルプライマー（例えば、リード１又はリード２のためのシークエンシングプライマー）がシークエンシングのためにアニーリングする部位として使用される。したがって、捕捉オリゴヌクレオチド又はユニバーサルプライマーは、ユニバーサル配列に特異的にハイブリダイズすることができる配列を含む。 As used herein, the term "universal," when used to describe a nucleotide sequence, refers to a region of sequence common to two or more nucleic acid molecules, with the molecules also having regions of sequence that differ from one another. Universal sequences present in different members of a nucleic acid collection can be used as "landing pads" in subsequent steps to anneal nucleotide sequences that can be used as primers to add additional nucleotide sequences, such as indexes, to target nucleic acids. Universal sequences present in different members of a nucleic acid collection can capture multiple different nucleic acids using a population of universal capture nucleic acids, e.g., capture oligonucleotides complementary to a portion of the universal sequence, e.g., universal capture sequences. Non-limiting examples of universal capture sequences include sequences identical to or complementary to P5 and P7 primers. Similarly, universal sequences present in different members of a collection of molecules can replicate (e.g., sequence) or amplify multiple different nucleic acids using a population of universal primers complementary to a portion of the universal sequence, e.g., universal anchor sequences. The terms "A14" and "B15" can be used to refer to universal anchor sequences. The terms "A14'" (A14 prime) and "B15'" (B15 prime) refer to the complements of A14 and B15, respectively. It will be understood that any suitable universal anchor sequence may be used in the methods presented herein, and the use of A14 and B15 is only an exemplary embodiment. In one embodiment, a universal anchor sequence is used as the site to which a universal primer (e.g., a sequencing primer for Read 1 or Read 2) anneals for sequencing. Thus, the capture oligonucleotide or universal primer comprises a sequence that can specifically hybridize to a universal sequence.

用語「Ｐ５」及び「Ｐ７」は、ユニバーサル捕捉配列又は捕捉オリゴヌクレオチドを指す場合に使用され得る。用語「Ｐ５’」（Ｐ５プライム）及び「Ｐ７’」（Ｐ７プライム）は、それぞれＰ５及びＰ７の相補体を指す。本明細書に提示される方法において、任意の好適なユニバーサル捕捉配列又は捕捉ヌクレオチドを使用することができ、Ｐ５及びＰ７の使用は例示的な実施形態のみであることが理解されるであろう。フローセル上でのＰ５及びＰ７又はそれらの相補体などの捕捉ヌクレオチドの使用は、国際公開第２００７／０１０２５１号、同第２００６／０６４１９９号、同第２００５／０６５８１４号、同第２０１５／１０６９４１号、同第１９９８／０４４１５１号、及び同第２０００／０１８９５７号の開示によって例示されるように、当技術分野において既知である。例えば、任意の好適な順方向増幅プライマーは、固定化されているか又は溶液中にあるかに関わらず、相補的配列及び配列の増幅のために本明細書に提示される方法において有用であり得る。同様に、任意の好適な逆増幅プライマーは、固定化されているか又は溶液中にあるかに関わらず、相補的配列及び配列の増幅のために本明細書に提示される方法において有用であり得る。当業者であれば、本明細書に提示される核酸の捕捉及び／又は増幅に好適なプライマー配列の設計及び使用方法を理解するであろう。 The terms "P5" and "P7" may be used to refer to universal capture sequences or capture oligonucleotides. The terms "P5'" (P5 prime) and "P7'" (P7 prime) refer to the complements of P5 and P7, respectively. It will be understood that any suitable universal capture sequence or capture nucleotide can be used in the methods presented herein, and the use of P5 and P7 is only an exemplary embodiment. The use of capture nucleotides such as P5 and P7 or their complements on flow cells is known in the art, as exemplified by the disclosures of WO 2007/010251, WO 2006/064199, WO 2005/065814, WO 2015/106941, WO 1998/044151, and WO 2000/018957. For example, any suitable forward amplification primer, whether immobilized or in solution, can be useful in the methods provided herein for amplifying complementary sequences and sequences. Similarly, any suitable reverse amplification primer, whether immobilized or in solution, can be useful in the methods provided herein for amplifying complementary sequences and sequences. Those of skill in the art will understand how to design and use suitable primer sequences for capturing and/or amplifying nucleic acids as provided herein.

本明細書で使用する場合、用語「プライマー」及びその派生語は、一般に、対象とする標的配列にハイブリダイズすることができる任意の核酸を指す。典型的には、プライマーは、ヌクレオチドがポリメラーゼによって重合され得るか、又はポリヌクレオチドがライゲートされ得る基質として機能するが、いくつかの実施形態では、プライマーは、合成された核酸鎖に組み込まれ、別のプライマーがハイブリダイズして、合成された核酸分子に相補的な新たな鎖合成をプライムすることができる部位を提供することができる。プライマーは、ヌクレオチド又はその類似体の任意の組み合わせを含み得る。いくつかの実施形態では、プライマーは、一本鎖オリゴヌクレオチド又はポリヌクレオチドである。用語「ポリヌクレオチド」及び「オリゴヌクレオチド」とは、任意の長さのヌクレオチドのポリマー形態を指すために本明細書において交換可能に使用され、リボヌクレオチド、デオキシリボヌクレオチド、これらの類似体、又はこれらの混合物を含み得る。これらの用語は、同等物として、ヌクレオチド類似体から作製されたＤＮＡ、ＲＮＡ、ｃＤＮＡ、又は抗体－オリゴ共役のいずれかの類似体を含み、一本鎖（センス又はアンチセンスなど）及び二本鎖ポリヌクレオチドに適用可能であることを理解されたい。本明細書で使用するこの用語はまた、例えば逆転写酵素の作用によって、ＲＮＡ鋳型から生成される相補的又はコピーＤＮＡであるｃＤＮＡも包含する。この用語は、分子の一次構造のみを指す。したがって、この用語は、三本鎖、二本鎖、及び一本鎖デオキシリボ核酸（「ＤＮＡ」）、並びに三本鎖、二本鎖、及び一本鎖リボ核酸（「ＲＮＡ」）を含む。 As used herein, the term "primer" and its derivatives generally refer to any nucleic acid capable of hybridizing to a target sequence of interest. Typically, a primer serves as a substrate onto which nucleotides can be polymerized by a polymerase or onto which a polynucleotide can be ligated; however, in some embodiments, a primer can be incorporated into a synthesized nucleic acid strand, providing a site to which another primer can hybridize and prime synthesis of a new strand complementary to the synthesized nucleic acid molecule. A primer can comprise any combination of nucleotides or their analogs. In some embodiments, a primer is a single-stranded oligonucleotide or polynucleotide. The terms "polynucleotide" and "oligonucleotide" are used interchangeably herein to refer to polymeric forms of nucleotides of any length and can include ribonucleotides, deoxyribonucleotides, their analogs, or mixtures thereof. It should be understood that these terms include, as equivalents, analogs of any of DNA, RNA, cDNA, or antibody-oligoconjugates made from nucleotide analogs, and are applicable to single-stranded (e.g., sense or antisense) and double-stranded polynucleotides. As used herein, the term also encompasses cDNA, which is complementary or copy DNA produced from an RNA template, for example, by the action of reverse transcriptase. The term refers only to the primary structure of the molecule. Thus, the term includes triple-, double-, and single-stranded deoxyribonucleic acid ("DNA"), as well as triple-, double-, and single-stranded ribonucleic acid ("RNA").

本明細書で使用する場合、「インデックス」（「インデックス領域」、「インデックスアダプター」、「タグ」、又は「バーコード」とも呼ばれる）とは、核酸材料の試料若しくは供給源、又は標的核酸が存在する区画を識別するために使用することができる固有の核酸タグを指す。インデックスは、溶液中若しくは固体支持体上に存在し得るか、又は固体支持体に付着又は結合され、溶液若しくは区画に放出され得る。核酸試料が複数の供給源に由来する場合、各核酸試料中の核酸は、試料の供給源を特定することができるように、異なる核酸タグでタグ付けすることができる。当技術分野で知られているように、及び米国特許第８，０５３，１９２号、国際公開第０５／０６８６５６号、及び米国特許出願公開第２０１３／０２７４１１７号の開示によって例示されるように、任意の適切なインデックス又はインデックスのセットを使用することができる。いくつかの実施形態では、インデックスは、Ｉｌｌｕｍｉｎａ社（ＳａｎＤｉｅｇｏ，ＣＡ）の６塩基インデックス１（ｉ７）配列、８塩基インデックス１（ｉ７）配列、８塩基インデックス２（ｉ５ｅ）配列、１０塩基インデックス１（ｉ７）配列、又は１０塩基インデックス２（ｉ５）配列を含み得る。 As used herein, "index" (also referred to as "index region," "index adapter," "tag," or "barcode") refers to a unique nucleic acid tag that can be used to identify a sample or source of nucleic acid material, or a compartment in which a target nucleic acid is present. The index can be in solution or on a solid support, or can be attached or bound to a solid support and released into a solution or compartment. When nucleic acid samples are derived from multiple sources, the nucleic acids in each nucleic acid sample can be tagged with a different nucleic acid tag so that the source of the sample can be identified. Any suitable index or set of indexes can be used, as known in the art and as exemplified by the disclosures of U.S. Pat. No. 8,053,192, WO 05/068656, and U.S. Patent Application Publication No. 2013/0274117. In some embodiments, the index may include a 6-base index 1 (i7) sequence, an 8-base index 1 (i7) sequence, an 8-base index 2 (i5e) sequence, a 10-base index 1 (i7) sequence, or a 10-base index 2 (i5) sequence from Illumina (San Diego, CA).

本明細書で使用するとき、用語「固有分子識別子」又は「ＵＭＩ」は、核酸に付けられ得る、ランダム、非ランダム、又は半ランダムのいずれかの分子タグを指す。核酸に組み込まれる場合、増幅後にシークエンシングされる固有分子識別子（unique molecular identifier、ＵＭＩ）を直接カウントすることによって、ＵＭＩを使用して後続の増幅バイアスを補正することができる。ＵＭＩは、同様の核酸、例えば、アダプターに結合することができ、各核酸を固有にする。 As used herein, the term "unique molecular identifier" or "UMI" refers to a molecular tag that can be attached to a nucleic acid, either randomly, non-randomly, or semi-randomly. When incorporated into a nucleic acid, a unique molecular identifier (UMI) can be used to correct for subsequent amplification bias by directly counting the UMI when sequenced after amplification. A UMI can bind to a similar nucleic acid, e.g., an adapter, making each nucleic acid unique.

本明細書で使用するとき、用語「アンプリコン」は、核酸に関して使用する場合、核酸のコピーの生成物を意味し、この生成物は、核酸のヌクレオチド配列の少なくとも一部と同じ又は相補的なヌクレオチド配列を有する。アンプリコンは、例えばポリメラーゼ伸長、ポリメラーゼ連鎖反応（polymerase chain reaction、ＰＣＲ）、ローリングサークル増幅（rolling circle amplification、ＲＣＡ）、ライゲーション伸長、又はライゲーション連鎖反応を含む鋳型として、核酸又はそのアンプリコンを使用する様々な増幅法のいずれかによって生成することができる。アンプリコンは、特定のヌクレオチド配列（例えば、ＰＣＲ生成物）の単一コピー又はヌクレオチド配列（例えば、ＲＣＡのコンカテマー生成物）の複数のコピーを有する核酸分子であり得る。標的核酸の第１のアンプリコンは、典型的には相補的なコピーである。後続のアンプリコンは、第１のアンプリコンの生成後に、標的核酸又は第１のアンプリコンから作成されたコピーである。後続のアンプリコンは、標的核酸と実質的に相補的であるか、又は標的核酸と実質的に同一である配列を有し得る。 As used herein, the term "amplicon," when used with reference to a nucleic acid, refers to the product of copying a nucleic acid, which product has a nucleotide sequence that is the same as or complementary to at least a portion of the nucleotide sequence of the nucleic acid. Amplicons can be generated by any of a variety of amplification methods using a nucleic acid or its amplicon as a template, including, for example, polymerase extension, polymerase chain reaction (PCR), rolling circle amplification (RCA), ligation extension, or ligation chain reaction. An amplicon can be a nucleic acid molecule having a single copy of a particular nucleotide sequence (e.g., a PCR product) or multiple copies of a nucleotide sequence (e.g., a concatemeric product of RCA). A first amplicon of a target nucleic acid is typically a complementary copy. Subsequent amplicons are copies made from the target nucleic acid or first amplicon after generation of the first amplicon. Subsequent amplicons can have a sequence that is substantially complementary to or substantially identical to the target nucleic acid.

本明細書で使用する場合、「増幅する」、「増幅」又は「増幅反応」及びそれらの派生語は、一般に、核酸分子の少なくとも一部が少なくとも１つの追加の核酸分子に複製又はコピーされる任意の作用又はプロセスを指す。追加の核酸分子は、任意選択で、鋳型核酸分子の少なくとも一部と実質的に同一であるか、又は実質的に相補的である配列を含む。鋳型核酸分子は一本鎖又は二本鎖であってよく、追加の核酸分子は、独立して一本鎖又は二本鎖であり得る。増幅は、核酸分子の線形又は指数関数的複製を任意選択的に含む。いくつかの実施形態では、このような増幅は、等温条件を使用して行うことができ、他の実施形態では、このような増幅は、熱サイクリングを含み得る。いくつかの実施形態では、増幅は、単一増幅反応における複数の標的配列の同時増幅を含む多重増幅である。いくつかの実施形態では、「増幅」は、ＤＮＡ及びＲＮＡベースの核酸の少なくとも一部を単独で、又は組み合わせて増幅することを含む。増幅反応は、当業者に既知の増幅プロセスのいずれかを含み得る。いくつかの実施形態では、増幅反応は、ポリメラーゼ連鎖反応（polymerase chain reaction、ＰＣＲ）を含む。 As used herein, "amplifying," "amplification," or "amplification reaction," and derivatives thereof, generally refer to any act or process in which at least a portion of a nucleic acid molecule is duplicated or copied onto at least one additional nucleic acid molecule. The additional nucleic acid molecule optionally comprises a sequence that is substantially identical to or substantially complementary to at least a portion of a template nucleic acid molecule. The template nucleic acid molecule may be single-stranded or double-stranded, and the additional nucleic acid molecules may independently be single-stranded or double-stranded. Amplification optionally involves linear or exponential replication of the nucleic acid molecule. In some embodiments, such amplification can be performed using isothermal conditions; in other embodiments, such amplification can involve thermal cycling. In some embodiments, amplification is multiplex amplification, which involves simultaneous amplification of multiple target sequences in a single amplification reaction. In some embodiments, "amplification" involves amplifying at least a portion of DNA- and RNA-based nucleic acids, alone or in combination. The amplification reaction can involve any amplification process known to those of skill in the art. In some embodiments, the amplification reaction involves polymerase chain reaction (PCR).

本明細書で使用する場合、用語「ポリメラーゼ連鎖反応」（「ＰＣＲ」）とは、クローニング又は精製することなくゲノムＤＮＡの混合物中の対象となるポリヌクレオチドのセグメントの濃度を増加させるための方法を記載するＭｕｌｌｉｓの方法（米国特許第４，６８３，１９５号及び同第４，６８３，２０２号）を指す。対象のポリヌクレオチドを増幅するためのこのプロセスは、所望の対象ポリヌクレオチドを含有するＤＮＡ混合物に、多量の過剰の２つのオリゴヌクレオチドプライマーを導入すること、続いてＤＮＡポリメラーゼの存在下で一連の熱サイクリングを行うことからなる。２つのプライマーは、対象の二本鎖ポリヌクレオチドのそれぞれの鎖に相補的である。最初に混合物がより高温で変性され、次いで、プライマーが、目的の分子のポリヌクレオチド内の相補的配列にアニーリングされる。アニーリング後、プライマーをポリメラーゼで伸長させて、相補鎖の新しい対を形成する。変性、プライマーアニーリング、及びポリメラーゼ伸長は、所望の目的ポリヌクレオチドの高濃度の増幅セグメントを得るために、何度も繰り返され得る（熱サイクリングと呼ばれる）。所望の目的ポリヌクレオチドの増幅セグメントの長さ（アンプリコン）は、互いに対するプライマーの相対位置によって決定され、したがって、この長さは制御可能なパラメータである。このプロセスを繰り返すことにより、この方法はＰＣＲと呼ばれる。対象となるポリヌクレオチドの所望の増幅セグメントは、混合物中の主要な核酸配列（濃度に関して）になるため、これらは「ＰＣＲ増幅された」と言われる。上記の方法の修飾において、標的核酸分子は、複数の異なるプライマー対を使用してＰＣＲ増幅することができ、場合によっては、対象とする標的核酸分子当たり１つ以上のプライマー対を使用してＰＣＲ増幅することができ、それによって多重ＰＣＲ反応を形成することができる。 As used herein, the term "polymerase chain reaction" ("PCR") refers to the Mullis method (U.S. Pat. Nos. 4,683,195 and 4,683,202), which describes a method for increasing the concentration of a polynucleotide segment of interest in a mixture of genomic DNA without cloning or purification. This process for amplifying a polynucleotide of interest involves introducing a large excess of two oligonucleotide primers to a DNA mixture containing the desired polynucleotide of interest, followed by a series of thermal cycling reactions in the presence of a DNA polymerase. The two primers are complementary to each strand of the double-stranded polynucleotide of interest. The mixture is first denatured at a higher temperature, and then the primers are annealed to complementary sequences within the polynucleotide of the molecule of interest. After annealing, the primers are extended with a polymerase to form a new pair of complementary strands. Denaturation, primer annealing, and polymerase extension can be repeated multiple times (called thermal cycling) to obtain a highly concentrated amplified segment of the desired polynucleotide of interest. The length of the amplified segment of the desired target polynucleotide (the amplicon) is determined by the relative positions of the primers with respect to each other; therefore, this length is a controllable parameter. By repeating this process, the method is called PCR. Because the desired amplified segments of the target polynucleotide become the predominant nucleic acid sequences (in terms of concentration) in the mixture, they are said to be "PCR amplified." In a modification of the above method, target nucleic acid molecules can be PCR amplified using multiple different primer pairs, and in some cases, more than one primer pair per target nucleic acid molecule of interest, thereby forming a multiplex PCR reaction.

本明細書で使用するとき、「増幅条件」及びその派生語は、一般に、１つ以上の核酸配列を増幅するのに好適な条件を指す。このような増幅は、線形又は指数関数的であり得る。いくつかの実施形態では、増幅条件は、等温条件を含むことができ、あるいは、熱サイクリング条件、又は等温及び熱サイクリング条件の組み合わせを含み得る。いくつかの実施形態では、１つ以上の核酸配列を増幅するのに好適な条件としては、ポリメラーゼ連鎖反応（ＰＣＲ）条件が挙げられる。典型的には、増幅条件は、ユニバーサル配列、若しくは標的特異的プライマーに隣接する１つ以上の標的配列などの核酸を増幅するか、又は１つ以上のアダプターに隣接する増幅標的配列を増幅するのに十分な反応混合物を指す。一般に、増幅条件は、増幅用の触媒、又は核酸合成、例えばポリメラーゼ、増幅される核酸に対してある程度相補性を有するプライマー、及び核酸にハイブリダイズしたときにプライマーの伸長を促進するためのデオキシリボヌクレオチド三リン酸（deoxyribonucleotide triphosphate、ｄＮＴＰ）などのヌクレオチドを含む。増幅条件は、プライマーの核酸へのハイブリダイゼーション又はアニーリング、プライマーの伸長、及び伸長プライマーが増幅を受ける核酸配列から分離される変性を必要とし得る。典型的には、必ずしもそうとは限らないが、増幅条件は、熱サイクリングを含み得るが、いくつかの実施形態では、増幅条件は、アニーリング、伸長、及び分離の工程が繰り返される複数のサイクルを含む。典型的には、増幅条件としては、Ｍｇ^２＋又はＭｎ^２＋などのカチオンが挙げられ、イオン強度の様々な改質剤も含み得る。 As used herein, "amplification conditions" and its derivatives generally refer to conditions suitable for amplifying one or more nucleic acid sequences. Such amplification can be linear or exponential. In some embodiments, amplification conditions can include isothermal conditions, or can include thermal cycling conditions, or a combination of isothermal and thermal cycling conditions. In some embodiments, conditions suitable for amplifying one or more nucleic acid sequences include polymerase chain reaction (PCR) conditions. Typically, amplification conditions refer to a reaction mixture sufficient to amplify a nucleic acid, such as one or more target sequences flanked by a universal sequence or target-specific primer, or to amplify an amplification target sequence flanked by one or more adapters. Generally, amplification conditions include a catalyst for amplification or nucleic acid synthesis, e.g., a polymerase, primers having a degree of complementarity to the nucleic acid to be amplified, and nucleotides such as deoxyribonucleotide triphosphates (dNTPs) to facilitate primer extension when hybridized to the nucleic acid. Amplification conditions can require hybridization or annealing of primers to the nucleic acid, extension of the primers, and denaturation, in which the extended primers are separated from the nucleic acid sequence undergoing amplification. Typically, although not necessarily, amplification conditions may include thermal cycling, although in some embodiments, amplification conditions include multiple cycles in which the steps of annealing, extension, and separation are repeated. Typically, amplification conditions include cations such as ^Mg or ^Mn , and may also include various modifiers of ionic strength.

本明細書で定義するように、「多重増幅」は、少なくとも１つの標的特異的プライマーを使用した、試料内の２つ以上の標的配列の選択的かつ非ランダム増幅を指す。いくつかの実施形態では、標的配列の一部又は全てが単一の反応容器内で増幅されるように多重増幅が行われる。所与の多重増幅の「プレックス」は、一般に、当該単一多重増幅中に増幅される、異なる標的特異的配列の数を指す。いくつかの実施形態では、プレックスは、約１２プレックス、２４プレックス、４８プレックス、９６プレックス、１９２プレックス、３８４プレックス、７６８プレックス、１５３６プレックス、３０７２プレックス、６１４４プレックス、又はそれ以上であり得る。増幅された標的配列をいくつかの異なる方法論（例えば、ゲル電気泳動とそれに続くデンシトメトリー、バイオアナライザー又は定量的ＰＣＲによる定量化、標識プローブでのハイブリダイゼーション、ビオチン化プライマーの組み込みとそれに続くアビジン－酵素共役の検出、増幅標的配列への^３２Ｐ標識デオキシヌクレオチド三リン酸の組み込み）によって検出することも可能である。 As defined herein, "multiplex amplification" refers to the selective, non-random amplification of two or more target sequences within a sample using at least one target-specific primer. In some embodiments, multiplex amplification is performed such that some or all of the target sequences are amplified in a single reaction vessel. The "plex" of a given multiplex amplification generally refers to the number of different target-specific sequences amplified during that single multiplex amplification. In some embodiments, the plex can be about 12-plex, 24-plex, 48-plex, 96-plex, 192-plex, 384-plex, 768-plex, 1536-plex, 3072-plex, 6144-plex, or more. Amplified target sequences can also be detected by several different methodologies, such as gel electrophoresis followed by densitometry, quantification by bioanalyzer or quantitative PCR, hybridization with labeled probes, incorporation of biotinylated primers followed by avidin-enzyme conjugate detection, and incorporation of ^32P -labeled deoxynucleotide triphosphates into the amplified target sequences.

本明細書で使用するとき、用語「増幅部位」は、１つ以上のアンプリコンが生成され得るアレイ内又はアレイ上の部位を指す。増幅部位は、その部位で生成される少なくとも１つのアンプリコンを含有、保持、又は付着させるように更に構成することができる。 As used herein, the term "amplification site" refers to a site within or on an array where one or more amplicons can be generated. An amplification site can be further configured to contain, retain, or attach at least one amplicon generated at that site.

本明細書で使用するとき、用語「アレイ」は、相対的な位置に従って互いに区別することができる部位の集団を指す。アレイの異なる部位にある異なる分子は、アレイ内の部位の位置に従って互いに区別することができる。アレイの個々の部位は、特定の種類の１つ以上の分子を含み得る。例えば、部位は、特定の配列を有する単一の標的核酸分子を含むことができ、又は部位は、同じ配列（及び／又はその相補的配列）を有するいくつかの核酸分子を含むことができる。アレイの部位は、同じ基質上に位置する異なる特徴とすることができる。例示的な特徴としては、液滴、基質中のウェル、基質中若しくは基質上のビーズ（又は他の粒子）、基質からの突起、基質上の隆起、又は基質内のチャネルが挙げられるが、これらに限定されない。アレイの部位は、それぞれ異なる分子を有する別個の基質とすることができる。別個の基質に付着した異なる分子は、基質が会合する表面上の基質の位置に従って、又は液体若しくはゲル内の基質の位置に従って特定することができる。別個の基質が表面上に配置される例示的なアレイとしては、ウェル内にビーズを有するものが挙げられるが、これらに限定されない。 As used herein, the term "array" refers to a collection of sites that can be distinguished from one another according to their relative positions. Different molecules at different sites of an array can be distinguished from one another according to the site's position within the array. Each site of an array can contain one or more molecules of a particular type. For example, a site can contain a single target nucleic acid molecule having a particular sequence, or a site can contain several nucleic acid molecules having the same sequence (and/or its complementary sequence). The sites of an array can be different features located on the same substrate. Exemplary features include, but are not limited to, droplets of liquid, wells in a substrate, beads (or other particles) in or on a substrate, protrusions from a substrate, bumps on a substrate, or channels within a substrate. The sites of an array can be separate substrates, each with a different molecule. The different molecules attached to the separate substrates can be identified according to the position of the substrate on a surface to which the substrates are associated, or according to the position of the substrate within a liquid or gel. An exemplary array in which separate substrates are located on a surface includes, but is not limited to, beads in wells.

本明細書で使用するとき、用語「区画」は、他の物から何かを分離又は単離する領域又は容積を意味することを意図する。例示的な区画としては、バイアル、チューブ、ウェル、液滴、ボーラス、ビーズ、容器、表面特徴、フローセル、又は流体の流れ、磁性、電流等の物理的な力によって分離された領域若しくは体積が挙げられるが、これらに限定されない。一実施形態では、区画は、９６又は３８４ウェルプレートなどのマルチウェルプレートのウェルである。本明細書で使用するとき、液滴は、１つ以上の核又は細胞を封入するためのビーズであり、ヒドロゲル組成物を含む、ヒドロゲルビーズを含み得る。いくつかの実施形態では、液滴は、ヒドロゲル材料の均質な液滴であるか、又はポリマーヒドロゲルシェルを有する中空液滴である。均質又は中空であるかどうかに関わらず、液滴は、１つ以上の核又は細胞を封入することが可能であり得る。いくつかの実施形態では、液滴は、界面活性剤安定化液滴である。いくつかの実施形態では、単一細胞又は核は、区画ごとに存在する。いくつかの実施形態では、区画ごとに２つ以上の細胞又は核が存在する。いくつかの実施形態では、各区画は、区画特異的インデックスを含む。いくつかの実施形態では、インデックスは、溶液中にあるか、又は各区画内の固相に付着若しくは結合している。 As used herein, the term "compartment" is intended to mean an area or volume that separates or isolates something from another. Exemplary compartments include, but are not limited to, vials, tubes, wells, droplets, boluses, beads, containers, surface features, flow cells, or areas or volumes separated by physical forces such as fluid flow, magnetism, or electric current. In one embodiment, a compartment is a well of a multi-well plate, such as a 96- or 384-well plate. As used herein, a droplet is a bead for encapsulating one or more nuclei or cells and may include hydrogel beads, including hydrogel compositions. In some embodiments, the droplets are homogenous droplets of hydrogel material or hollow droplets with a polymer hydrogel shell. Whether homogenous or hollow, the droplets may be capable of encapsulating one or more nuclei or cells. In some embodiments, the droplets are surfactant-stabilized droplets. In some embodiments, a single cell or nucleus is present per compartment. In some embodiments, two or more cells or nuclei are present per compartment. In some embodiments, each compartment includes a compartment-specific index. In some embodiments, the index is in solution or attached or bound to a solid phase within each compartment.

本明細書で使用する場合、用語「フローセル」とは、１つ以上の流体試薬を全体に流すことができる固体表面を含むチャンバーを指す。本開示の方法において容易に使用することができるフローセル及び関連する流体システム及び検出プラットフォームの例は、例えば、Ｂｅｎｔｌｅｙｅｔａｌ．，Ｎａｔｕｒｅ４５６：５３－５９（２００８）、国際公開第０４／０１８４９７号、米国特許第７，０５７，０２６号、国際公開第９１／０６６７８号、国際公開第０７／１２３７４４号、米国特許第７，３２９，４９２号、同第７，２１１，４１４号、同第７，３１５，０１９号、同第７，４０５，２８１号、及び米国特許出願公開第２００８／０１０８０８２号に記載されている。 As used herein, the term "flow cell" refers to a chamber containing a solid surface through which one or more fluid reagents can flow. Examples of flow cells and associated fluidic systems and detection platforms that can readily be used in the methods of the present disclosure are described, for example, in Bentley et al., Nature 456:53-59 (2008), WO 04/018497, U.S. Pat. No. 7,057,026, WO 91/06678, WO 07/123744, U.S. Pat. Nos. 7,329,492, 7,211,414, 7,315,019, and 7,405,281, and U.S. Patent Application Publication No. 2008/0108082.

本明細書で使用するとき、用語「クローン集団」は、特定のヌクレオチド配列に対して均質である核酸の集団を指す。均質な配列は、典型的には、少なくとも１０ヌクレオチド長であるが、更に長い、例えば、少なくとも５０、１００、２５０、５００又は１０００ヌクレオチド長を含み得る。クローン集団は、単一の標的核酸又は鋳型核酸に由来し得る。典型的には、クローン集団中の全ての核酸は、同じヌクレオチド配列を有する。クロナリティーから逸脱することなく、少数の変異（例えば、増幅アーチファクトによる）が生じ得ることが理解されよう。 As used herein, the term "clonal population" refers to a population of nucleic acids that are homogeneous with respect to a particular nucleotide sequence. Homogeneous sequences are typically at least 10 nucleotides in length but may be longer, e.g., at least 50, 100, 250, 500, or 1000 nucleotides in length. A clonal population can be derived from a single target or template nucleic acid. Typically, all nucleic acids in a clonal population have the same nucleotide sequence. It will be understood that minor mutations (e.g., due to amplification artifacts) can occur without deviating from clonality.

本明細書で使用するとき、用語「それぞれ」は、項目の集合に関して使用する場合、集合内の個々の項目を識別することを意図しているが、文脈が明確に別段の指示をしない限り、必ずしも集合内の全ての項目を指すものではない。 As used herein, the term "each," when used in reference to a set of items, is intended to identify each individual item in the set, but does not necessarily refer to every item in the set, unless the context clearly dictates otherwise.

本明細書及び添付の特許請求の範囲で使用される場合、「又は」という用語は、内容が別途明確に指示されない限り、「及び／又は」を含む意味で一般に用いられる。用語「及び／又は」は、列挙された要素の１つ若しくは全て、又は列挙された要素のうちの任意の２つ以上の組み合わせを意味する。場合によっては、「及び／又は」の使用は、他の例では「又は」の使用が「及び／又は」を意味し得ないことを意味しない。 As used in this specification and the appended claims, the term "or" is generally used in its sense including "and/or" unless the context clearly dictates otherwise. The term "and/or" means one or all of the listed elements, or a combination of any two or more of the listed elements. The use of "and/or" in some instances does not imply that the use of "or" cannot mean "and/or" in other instances.

「好ましい」及び「好ましくは」という語は、特定の状況下で特定の利益をもたらし得る本開示の実施形態を指す。しかしながら、同じ又は他の状況下で、他の実施形態が好ましい場合もある。更に、１つ以上の好ましい実施形態の記載は、その他の実施形態が有用でないことを示唆するものではなく、本開示の範囲から他の実施形態を除外することを意図するものではない。 The words "preferred" and "preferably" refer to embodiments of the present disclosure that may offer certain benefits, under particular circumstances. However, other embodiments may also be preferred, under the same or other circumstances. Furthermore, the recitation of one or more preferred embodiments does not imply that other embodiments are not useful, and is not intended to exclude other embodiments from the scope of the present disclosure.

本明細書で使用される場合、「有する（have）」、「有する（has）」、「有している（having）」、「含む（include）」、「含む（includes）」、「含んでいる（including）」、「含む（comprise）」、「含む（comprises）」、「含んでいる（comprising）」などは、制約のない包括的な意味で用いられ、一般に、「含む（include）が、これらに限定されない」、「含む（includes）が、これらに限定されない」、又は「含んでいる（including）が、これらに限定されない」ことを意味する。 As used herein, the words "have," "has," "having," "include," "includes," "including," "comprise," "comprises," "comprising," and the like are used in an open-ended, inclusive sense and generally mean "include, but not limited to," "includes, but not limited to," or "including, but not limited to."

本明細書で、「有する（have）」、「有する（has）」、「有している（having）」、「含む（include）」、「含む（includes）」、「含んでいる（including）」、「含む（comprise）」、「含む（comprises）」、「含んでいる（comprising）」などの語で本明細書に記載されている場合、さもなければ「からなる（consisting of）」及び／又は「から本質的になる（consisting essentially of）」という用語で説明される類似の実施形態もまた提供されることが理解される。「からなる」という用語は、「からなる」という句に続くものを含むことを意味する。すなわち、「からなる」は、列挙された要素が必要とされるか又は必須であり、他の要素が存在し得ないことを示す。「から本質的になる」という用語は、語句の後に列挙されたいずれの要素も含まれ、それらの要素が列挙された要素の開示において明記された活動又は作用に干渉しないか、又は寄与しない限り、列挙されたもの以外の他の要素が含まれ得ることを示す。 When anything is described herein using words such as "have," "has," "having," "include," "includes," "including," "comprise," "comprises," "comprising," and the like, it is understood that similar embodiments described using the terms "consisting of" and/or "consisting essentially of" are also provided. The term "consisting of" is intended to include what follows the phrase "consisting of." That is, "consisting" indicates that the listed elements are required or essential, and that no other elements may be present. The term "consisting essentially of" indicates that any elements listed after the phrase are included, and that other elements other than those listed may be included, so long as those elements do not interfere with or contribute to the activity or function specified in the disclosure of the listed elements.

別途記載のない限り、「ａ」、「ａｎ」、「ｔｈｅ」、及び「少なくとも１つ」は、交換可能に使用され、１つ又は２つ以上を意味する。 Unless otherwise noted, "a," "an," "the," and "at least one" are used interchangeably and mean one or more than one.

発生する事象に「好適」である条件、又は「好適な」条件は、そのような事象が発生することを妨げない条件である。したがって、これらの条件は、事象を可能にし、強化し、促進する、及び／又はそれの助けとなる。 Conditions that are "favorable" or "favorable" for an event to occur are conditions that do not prevent such an event from occurring. Thus, these conditions enable, enhance, facilitate, and/or are conducive to the event.

本明細書で使用する場合、例えば、組成物若しくは核酸の文脈における「提供する」とは、組成物若しくは核酸を作製すること、組成物若しくは核酸を購入すること、又はさもなければ化合物若しくは核酸を得ることを意味する。 As used herein, "providing," for example, in the context of a composition or nucleic acid, means making the composition or nucleic acid, purchasing the composition or nucleic acid, or otherwise obtaining the compound or nucleic acid.

「一実施形態」、「実施形態」、「特定の実施形態」、又は「いくつかの実施形態」などへの言及は、本実施形態に関連して説明される特定の特徴、構成、組成、又は特性が、本開示の少なくとも１つの実施形態に含まれることを意味する。したがって、本明細書全体を通して様々な場所でのこのような語句の出現は、必ずしも本開示の同じ実施形態を指すものではない。更に、特定の特徴、構成、組成、又は特性は、１つ以上の実施形態において任意の好適な方法で組み合わされてもよい。 References to "one embodiment," "embodiment," "particular embodiment," or "some embodiments" mean that the particular feature, configuration, composition, or characteristic described in connection with the embodiment is included in at least one embodiment of the present disclosure. Thus, the appearances of such phrases in various places throughout this specification do not necessarily refer to the same embodiment of the present disclosure. Furthermore, the particular features, configurations, compositions, or characteristics may be combined in any suitable manner in one or more embodiments.

本開示の様々な態様は、範囲形式で提示され得る。範囲形式での説明は、単に便宜上及び簡潔さのためのものであり、本開示の範囲に対する柔軟性がない限定として解釈されるべきではないことを理解されたい。したがって、範囲の説明は、その範囲内の全ての可能な部分的な範囲並びに個々の数値を具体的に開示しているとみなされるべきである。例えば、１～６などの範囲の説明は、１～３、１～４、１～５、２～４、２～６、３～６など、並びにその範囲内の個々の数、例えば、１、２、２．７．３、４、５、５．３、及び６などの部分的な範囲を具体的に開示しているとみなされるべきである。これは、範囲の幅に関係なく適用される。 Various aspects of the present disclosure may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the disclosure. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as 1 to 6 should be considered to have specifically disclosed subranges such as 1 to 3, 1 to 4, 1 to 5, 2 to 4, 2 to 6, 3 to 6, etc., as well as individual numbers within that range, for example, 1, 2, 2, 7, 3, 4, 5, 5, 3, and 6. This applies regardless of the breadth of the range.

別個の工程を含む本明細書に開示される任意の方法では、工程は、任意の実行可能な順序で行われてもよい。また、適切には、２つ以上の工程の任意の組み合わせを同時に行うことができる。
本開示の例示的な実施形態の以下の詳細な説明は、以下の図面と併せて読むと、最も良く理解され得る。 In any method disclosed herein that includes separate steps, the steps may be performed in any practicable order, and, suitably, any combination of two or more steps may be performed simultaneously.
The following detailed description of the exemplary embodiments of the present disclosure can be best understood when read in conjunction with the following drawings.

本開示によるシークエンシングのためのライブラリーを生成する一実施形態の一般的な例示的方法の一般的なブロック図を示す。FIG. 1 shows a general block diagram of one embodiment of a general exemplary method for generating a library for sequencing according to the present disclosure. Ａ～Ｄは、本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称に変換する実施形態の概略図を示す。簡略化するために、１つの標的核酸のみを示す。1A-D show schematic diagrams of embodiments for converting a target nucleic acid from symmetric to asymmetric according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. Ａ～Ｄは、本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称に変換し、別のアダプターを付加する実施形態の概略図を示す。簡略化するために、１つの標的核酸のみを示す。1A-D show schematic diagrams of embodiments of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称変換し、別のアダプターを付加する実施形態の模式図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称変換し、別のアダプターを付加する実施形態の模式図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称変換し、別のアダプターを付加する実施形態の模式図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称変換し、別のアダプターを付加する実施形態の模式図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称変換し、別のアダプターを付加する実施形態の模式図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、標的核酸を対称から非対称変換し、別のアダプターを付加する実施形態の模式図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting a target nucleic acid from symmetric to asymmetric and adding another adapter according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本開示による、単一細胞コンビナトリアルインデックス付けのための一般的な例示的方法の一般的なブロック図を示す。FIG. 1 shows a general block diagram of a general exemplary method for single-cell combinatorial indexing according to the present disclosure. 本明細書に提示される本開示の様々な態様による、全細胞ゲノムＤＮＡを対称標的核酸に変換し、次いで非対称標的核酸（ｓ３－ＷＧＳ）に変換する実施形態の概略図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting total cellular genomic DNA into a symmetric target nucleic acid and then into an asymmetric target nucleic acid (s3-WGS) according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、アクセス可能なゲノムＤＮＡを対称標的核酸に、次いで非対称標的核酸（ｓ３－ＡＴＡＣ）に変換する実施形態の概略図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of converting accessible genomic DNA into a symmetric target nucleic acid and then an asymmetric target nucleic acid (s3-ATAC) according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. ＤＮＡへのｍＲＮＡ核酸のプロセッシングし、次いで非対称核酸の３つの集団をもたらすためのその後プロセッシングの実施形態の概略図を示す。簡略化するために、１つのｍＲＮＡ核酸のみを示す。1 shows a schematic diagram of an embodiment of the processing of an mRNA nucleic acid to DNA and then subsequent processing to result in three populations of asymmetric nucleic acids. For simplicity, only one mRNA nucleic acid is shown. ＤＮＡへのｍＲＮＡ核酸のプロセッシングし、次いで非対称核酸の３つの集団をもたらすためのその後プロセッシングの実施形態の概略図を示す。簡略化するために、１つのｍＲＮＡ核酸のみを示す。1 shows a schematic diagram of an embodiment of the processing of an mRNA nucleic acid to DNA and subsequent processing to result in three populations of asymmetric nucleic acids. For simplicity, only one mRNA nucleic acid is shown. ＤＮＡへのｍＲＮＡ核酸のプロセッシングし、次いで非対称核酸の３つの集団をもたらすためのその後プロセッシングの実施形態の概略図を示す。簡略化するために、１つのｍＲＮＡ核酸のみを示す。1 shows a schematic diagram of an embodiment of the processing of an mRNA nucleic acid to DNA and then subsequent processing to result in three populations of asymmetric nucleic acids. For simplicity, only one mRNA nucleic acid is shown. ＤＮＡへのｍＲＮＡ核酸のプロセッシングし、次いで非対称核酸の３つの集団をもたらすためのその後プロセッシングの実施形態の概略図を示す。簡略化するために、１つのｍＲＮＡ核酸のみを示す。1 shows a schematic diagram of an embodiment of the processing of an mRNA nucleic acid to DNA and subsequent processing to result in three populations of asymmetric nucleic acids. For simplicity, only one mRNA nucleic acid is shown. 本明細書に提示される本開示の様々な態様による、全細胞ゲノムＤＮＡを対称標的核酸に、次いで非対称標的核酸（ｓ３－ＧＣＣ）に変換する同時アッセイの実施形態の概略図を示す。簡略化するために、１つの標的核酸のみを示す。1 shows a schematic diagram of an embodiment of a simultaneous assay for converting total cellular genomic DNA into a symmetric target nucleic acid and then into an asymmetric target nucleic acid (s3-GCC) according to various aspects of the disclosure presented herein. For simplicity, only one target nucleic acid is shown. プレートベースのコンビナトリアルインデックス付けのためのプロトコルの実施形態の模式図を示す。FIG. 1 shows a schematic of an embodiment of a protocol for plate-based combinatorial indexing. ＤＮＡ損傷のサイズのライブラリー生成に対する影響を示す。The effect of DNA lesion size on library generation is shown. 改変ヌクレオチドのアダプターを付加するための伸長に対する影響を示す。1 shows the effect of modified nucleotides on extension to add adapters. 第２の伸長を増強する改変ヌクレオチドを示す。Modified nucleotides that enhance the second extension are shown. アニーリング温度の効果を示す。The effect of annealing temperature is shown. フラッシュ凍結ヒト皮質及びマウス全脳試料由来の核を使用して複数の９６ウェルプレートのタグメンテーション及びＰＣＲの両段階でのインデックス付けを示すバーンヤード実験の実験レイアウトを示す。1 shows the experimental layout of a barnyard experiment demonstrating indexing at both the tagmentation and PCR stages of multiple 96-well plates using nuclei from flash-frozen human cortex and mouse whole brain samples. １細胞当たりの固有の読み取りによって投影されたライブラリーの複雑さの箱ひげ図を示す。ｓ３ＡＴＡＣは、予測された固有のライブラリー分子に基づいて、フラッシュ凍結マウス皮質上の全ての他の公開された単一細胞ＡＴＡＣ配列ライブラリーより優れている。Boxplots of projected library complexity by unique reads per cell are shown. s3ATAC outperforms all other published single-cell ATAC sequence libraries on flash-frozen mouse cortex based on predicted unique library molecules. 「真のバーンヤード」（左、混合種タグメンテーションウェル）及びＰＣＲバーンヤード（右；ＰＣＲ段階で混合された種）における１細胞当たりのヒト及びマウス読み取りの比較を示し、ライブラリー分子の細胞間交換をほとんど又は全く示さない。真のバーンヤード内の５．１２％のインデックス衝突率は、許容衝突率について１ウェル当たりの最適な１５核を示唆している。Comparison of human and mouse reads per cell in a "true barnyard" (left, mixed-species tagmentation wells) and a PCR barnyard (right; species mixed at the PCR stage) shows little to no cell-to-cell exchange of library molecules. The 5.12% index collision rate in the true barnyard suggests an optimal 15 nuclei per well for acceptable collision rates. ヒト核のＵＭＡＰ投影を示す。1 shows a UMAP projection of the human nucleus. 皮質内の肉眼的細胞型のカノニカルマーカーが異なる細胞集団を明らかにすることを示す。We show that canonical markers of macroscopic cell types within the cortex reveal distinct cell populations. 皮質内の肉眼的細胞型のカノニカルマーカーが異なる細胞集団を明らかにすることを示す。We show that canonical markers of macroscopic cell types within the cortex reveal distinct cell populations. 皮質内の肉眼的細胞型のカノニカルマーカーが異なる細胞集団を明らかにすることを示す。We show that canonical markers of macroscopic cell types within the cortex reveal distinct cell populations. マウス核のＵＭＡＰ投影を示す。UMAP projections of mouse nuclei are shown. マウス脳内の肉眼的細胞型のカノニカルマーカーが異なる細胞集団を明らかにすることを示す。We show that canonical markers of macroscopic cell types in the mouse brain reveal distinct cell populations. マウス脳内の肉眼的細胞型のカノニカルマーカーが異なる細胞集団を明らかにすることを示す。We show that canonical markers of macroscopic cell types in the mouse brain reveal distinct cell populations. マウス脳内の肉眼的細胞型のカノニカルマーカーが異なる細胞集団を明らかにすることを示す。We show that canonical markers of macroscopic cell types in the mouse brain reveal distinct cell populations. 複数の９６ウェルプレートのタグメンテーション及びＰＣＲ段階の両方でのインデックス付けを示すｓ３－ＷＧＳライブラリーを生成するためのＰＤＡＣ低継代の患者由来系統の実験レイアウトを示す。1 shows the experimental layout of PDAC low-passage patient-derived lines to generate s3-WGS libraries showing indexing at both the tagmentation and PCR stages of multiple 96-well plates. １細胞当たりの固有の読み取り、並びにライブラリー飽和への投影によって測定されたライブラリーの複雑さの箱ひげ図を示す。Boxplots of library complexity as measured by unique reads per cell as well as projection onto library saturation are shown. ビンにわたる不偏のゲノムカバレッジについての平均絶対偏差（mean absolute deviation、ＭＡＤ）スコアの箱ひげ図を示す。キーは、図２３から続く。Figure 2 shows a boxplot of mean absolute deviation (MAD) scores for unbiased genome coverage across bins. Key continued from Figure 23. ｓ３－ＧＣＣライブラリーの生成のためのＰＤＡＣ低継代の患者由来系統の実験レイアウトを示し、複数の９６ウェルプレートのタグメンテーション及びＰＣＲ段階の両方でのインデックス付けを示す。1 shows the experimental layout of PDAC low-passage patient-derived lines for the generation of s3-GCC libraries, demonstrating indexing at both the tagmentation and PCR stages of multiple 96-well plates. １細胞当たりの固有の読み取り、並びに５０％及び９５％のライブラリー飽和への投影によって測定されたライブラリーの複雑さの箱ひげ図を示す。上部：総読み取り、中央：遠位（＞１ｋｂｐマッピング）染色体内読み取り、下部：染色体にマッピングされた読み取り。Boxplots of library complexity measured by unique reads per cell and projection to 50% and 95% library saturation are shown. Top: total reads, middle: distal (>1 kbp mapping) intrachromosomal reads, bottom: reads mapped to chromosomes. 遠位領域捕捉を示すマッピングされた読み取り長分布の密度プロットを示す。10 shows a density plot of the mapped read length distribution showing distal region capture. 共有トポロジードメイン上の単一細胞ＧＣＣライブラリーのクラスタリングを示す。細胞株（左）及びＫ平均は、定義されたクラスター（右）を意味する。Clustering of single-cell GCC libraries on shared topology domains: cell lines (left) and K-means defined clusters (right). トランスポゾンの第１の鎖の例示的なヌクレオチド配列、第２のインデックス配列、Ｐ５、ｉ５、Ｐ７、ｉ７、ＭＥ、Ａ１４、及びＢ１５を含むオリゴヌクレオチドを示す（それぞれ配列番号１～９）。Exemplary nucleotide sequences of the first strand of the transposon, the second index sequence, and oligonucleotides containing P5, i5, P7, i7, ME, A14, and B15 are shown (SEQ ID NOs: 1-9, respectively).

概略図は必ずしも縮尺通りではない。図面に使用される同様の数字は、同様の構成要素、工程などを指す。しかしながら、所与の図の構成要素を指すための数字の使用は、同じ数字でラベル付けされた別の図における構成要素を制限することを意図していないことが理解されるであろう。更に、構成要素を指すために異なる番号を使用することは、異なる番号の構成要素が他の番号付けされた構成要素と同じ又は類似であることができないことを示すことを意図するものではない。 Schematic drawings are not necessarily drawn to scale. Like numbers used in the drawings refer to like components, steps, etc. However, it will be understood that the use of a number to refer to a component in a given figure is not intended to limit the component in another figure labeled with the same number. Furthermore, the use of different numbers to refer to a component is not intended to indicate that the differently numbered component may not be the same as or similar to the other numbered component.

本明細書では、核酸のシークエンシング及び／又はアッセイを実施することに関連する方法、組成物、システム、及びキットが提示される。本開示は、シークエンシングライブラリーに存在する標的核酸の数を著しく増加させる方法を提供する。図１は、本方法の１つの例示的な実施形態の一般的な概要を示す。この例示的な実施形態では、方法は、本明細書で対称アダプターを有する標的核酸を指す、各末端に同じアダプターを含むように修飾された標的核酸を提供することを含む（図１、ブロック１０）。標的核酸の供給源は、限定することを意図するものではなく、標的核酸は、ＤＮＡ又はＤＮＡに変換されたＲＮＡに由来し得る。同様に、標的核酸の末端にアダプターを付加するために使用される方法は、限定することを意図するものではないが、例えば、転位、断片化に続いてライゲーション、ライゲーション、又は伸長及びライゲーションを含むことができる。この方法は更に、対称アダプターのうちの１つを修飾し、対称修飾標的核酸を非対称修飾標的核酸（図１、ブロック１２）、各末端に異なるアダプターを含む標的核酸に変換することを含む。アダプターは、インデックス配列、ＵＭＩ、ユニバーサル配列、及び／又はプライマーに由来する配列を含み得る。任意選択的に、非対称標的核酸を増幅することができる（図１、ブロック１４）。非対称標的核酸の増幅は、１つ以上のインデックス配列、ＵＭＩ配列、ユニバーサル配列、又はプライマー由来の配列を含むがこれらに限定されない他の有用な配列を、一方又は両方の末端に付加することを含み得る。 Methods, compositions, systems, and kits related to performing nucleic acid sequencing and/or assays are presented herein. The present disclosure provides a method for significantly increasing the number of target nucleic acids present in a sequencing library. Figure 1 shows a general overview of one exemplary embodiment of the method. In this exemplary embodiment, the method includes providing a target nucleic acid modified to contain identical adapters at each end, referred to herein as a target nucleic acid with symmetric adapters (Figure 1, Block 10). The source of the target nucleic acid is not intended to be limiting, and the target nucleic acid can be derived from DNA or RNA converted to DNA. Similarly, the method used to add adapters to the ends of the target nucleic acid is not intended to be limiting, and can include, for example, transposition, fragmentation followed by ligation, ligation, or extension and ligation. The method further includes modifying one of the symmetric adapters to convert the symmetrically modified target nucleic acid to an asymmetrically modified target nucleic acid (Figure 1, Block 12), a target nucleic acid containing different adapters at each end. The adapters may include an index sequence, a UMI, a universal sequence, and/or a sequence derived from a primer. Optionally, the asymmetric target nucleic acid can be amplified ( FIG. 1 , block 14). Amplification of the asymmetric target nucleic acid can include adding one or more index sequences, UMI sequences, universal sequences, or other useful sequences, including but not limited to, sequences derived from primers, to one or both ends.

本発明者らは、驚くべきかつ予想外に、修飾標的核酸を、対称標的核酸の非対称標的核酸への変換中、非対称修飾標的核酸の収率を理論的最大収率近くに大幅に増加させる条件に曝露し得ることを観察した。これは、標的核酸の任意の供給源で使用することができ、限定された入力一次核酸を使用する方法を含む、高効率ライブラリー生成が有利である方法に特に有用である。任意のシークエンシングライブラリー方法は、全ゲノムシークエンシング、標的シークエンシング、メチル化シークエンシング、ゲノム立体配座捕捉（genomic conformation capture、ＧＣＣ）、例えば、ＨｉＣ、クロマチン立体配座など、単一細胞アッセイ、単一細胞コンビナトリアルインデックス付け、ＲＮＡ－ｓｅｑ方法とＡＴＡＣ－ｓｅｑ方法、同時アッセイ、例えば、ＤＮＡとＲＮＡ、供給源が無細胞ＤＮＡ若しくはＲＮＡである実施形態、リキッドバイオプシーを含むが、これらに限定されない、高効率生成に利益を得ることができる。高効率変換アッセイはまた、分析物の存在を検出する際にも有用であり、例えば、感度を増加させる。検出又はスクリーニングアッセイの例は、ＰＣＲ、ｑＰＣＲ、デジタルＰＣＲ、ＤＮＡ若しくはＲＮＡ若しくは抗体若しくはタンパク質検出アッセイ、又は一般的な分析物検出アッセイがあるが、これらに限定されない。分析物の例としては、ＤＮＡ、ＲＮＡ、及びタンパク質が挙げられるが、これらに限定されない。 The inventors have surprisingly and unexpectedly observed that modified target nucleic acids can be exposed to conditions during conversion of symmetric target nucleic acids to asymmetric target nucleic acids that significantly increase the yield of asymmetrically modified target nucleic acids to near the theoretical maximum yield. This can be used with any source of target nucleic acid and is particularly useful for methods where high-efficiency library generation is advantageous, including methods using limited input primary nucleic acids. Any sequencing library method can benefit from high-efficiency generation, including, but not limited to, whole genome sequencing, targeted sequencing, methylation sequencing, genomic conformation capture (GCC), e.g., HiC, chromatin conformation, single-cell assays, single-cell combinatorial indexing, RNA-seq and ATAC-seq methods, simultaneous assays, e.g., DNA and RNA, embodiments in which the source is cell-free DNA or RNA, and liquid biopsies. High-efficiency conversion assays are also useful in detecting the presence of analytes, e.g., increasing sensitivity. Examples of detection or screening assays include, but are not limited to, PCR, qPCR, digital PCR, DNA or RNA or antibody or protein detection assays, or general analyte detection assays. Examples of analytes include, but are not limited to, DNA, RNA, and proteins.

標的核酸
本明細書で提供される方法、組成物、システム、及びキットに使用される標的核酸は、典型的には、試料中に存在する一次核酸に由来する。一次核酸は、試料からの二本鎖ＤＮＡ（double-stranded DNA、ｄｓＤＮＡ）形態（例えば、ゲノムＤＮＡ断片、増幅生成物など）に由来し得るか、又はＤＮＡ若しくはＲＮＡとして試料からの一本鎖形態に由来し得、ＤｓＤＮＡ形態に変換され得る。例として、本明細書に記載の方法中に、当技術分野で既知の標準的な技術を使用して、ｍＲＮＡ分子を二本鎖ｃＤＮＡにコピーすることができる。一次核酸試料からのポリヌクレオチド分子の正確な配列は、一般に、本開示にとって重要ではなく、既知又は不明であり得る。 Target Nucleic Acids The target nucleic acids used in the methods, compositions, systems, and kits provided herein are typically derived from primary nucleic acids present in a sample. The primary nucleic acids may be derived from the sample in double-stranded DNA (dsDNA) form (e.g., genomic DNA fragments, amplification products, etc.), or may be derived from the sample in single-stranded form as DNA or RNA and converted to dsDNA form. By way of example, during the methods described herein, mRNA molecules can be copied into double-stranded cDNA using standard techniques known in the art. The exact sequence of polynucleotide molecules from a primary nucleic acid sample is generally not critical to the present disclosure and may be known or unknown.

一実施形態では、一次核酸は、ＤＮＡ分子を含む。一次核酸分子は、生物の遺伝子相補体全体、例えば、イントロン及びエクソン配列の両方、並びにプロモーター及びエンハンサー配列などの非コード調節配列を含むゲノムＤＮＡ分子を表し得る。一実施形態では、例えば、特定の染色体、オープンクロマチンに関連するＤＮＡ、クローズドクロマチンに関連するＤＮＡ、又は特定の遺伝子の領域（例えば、標的シークエンシング）などの１つ以上の特定の配列などのゲノムＤＮＡの特定のサブセットを使用することができる。 In one embodiment, the primary nucleic acid comprises a DNA molecule. The primary nucleic acid molecule may represent an organism's entire gene complement, e.g., a genomic DNA molecule including both intron and exon sequences, as well as non-coding regulatory sequences such as promoter and enhancer sequences. In one embodiment, a specific subset of genomic DNA can be used, e.g., one or more specific sequences, such as a particular chromosome, DNA associated with open chromatin, DNA associated with closed chromatin, or a region of a particular gene (e.g., targeted sequencing).

一実施形態では、一次核酸は、ＲＮＡ分子を含む。一次核酸分子は、トランスクリプトーム全体又は試料の１つの細胞若しくは複数の細胞、例えば、ｍＲＮＡ分子を表し得る。一次核酸分子は、試料の１つの細胞若しくは複数の細胞、例えば、マイクロＲＮＡ又は低分子干渉ＲＮＡの非コードＲＮＡを表し得る。一実施形態では、例えば、特定の遺伝子によってコードされる領域などの１つ以上の特定の配列などのＲＮＡ分子の特定のサブセットを使用することができる。 In one embodiment, the primary nucleic acid comprises an RNA molecule. The primary nucleic acid molecule may represent the entire transcriptome or a single cell or multiple cells of the sample, e.g., mRNA molecules. The primary nucleic acid molecule may represent a non-coding RNA of a single cell or multiple cells of the sample, e.g., microRNA or small interfering RNA. In one embodiment, a specific subset of RNA molecules may be used, e.g., one or more specific sequences, e.g., regions encoded by specific genes.

試料は、生検、腫瘍、擦過物、スワブ、血液、粘液、尿、血漿、精液、毛髪、レーザ捕捉顕微解剖、外科的切除、及び他の臨床的に又は実験室で得られた試料から得られた核酸分子を含み得る。いくつかの実施態様では、試料は、疫学、農業、法医学又は病原性の試料であり得る。いくつかの実施形態では、試料は、培養細胞を含み得る。いくつかの実施態様では、試料は、ヒト又は哺乳動物源などの動物から得られた核酸分子を含むことができる。別の実施態様では、試料は、植物、細菌、ウイルス又は真菌などの非哺乳類源から得られた核酸分子を含むことができる。いくつかの実施態様において、核酸分子の供給源は、保存された又は絶滅した試料若しくは種であり得る。 Samples may include nucleic acid molecules obtained from biopsies, tumors, scrapings, swabs, blood, mucus, urine, plasma, semen, hair, laser capture microdissections, surgical resections, and other clinically or laboratory-derived samples. In some embodiments, samples may be epidemiological, agricultural, forensic, or pathogenic samples. In some embodiments, samples may include cultured cells. In some embodiments, samples may include nucleic acid molecules obtained from animals, such as humans or mammalian sources. In other embodiments, samples may include nucleic acid molecules obtained from non-mammalian sources, such as plants, bacteria, viruses, or fungi. In some embodiments, the source of the nucleic acid molecules may be preserved or extinct samples or species.

更に、本明細書に開示される方法、組成物、システム、及びキットは、法医学試料からの分解及び／又は断片化されたゲノムＤＮＡなどの低品質核酸分子を有する核酸試料を増幅するのに有用であり得る。一実施態様では、法医学試料は、犯罪現場から得られた核酸、行方不明者ＤＮＡデータベースから得られた核酸、法医学調査と関連した研究所から得られた核酸を含み得る、又は法執行機関、１つ以上のミリタリーサービス若しくはそのような隊員によって得られた法医学試料を含むことができる。核酸試料は、唾液、血液、若しくは他の体液で含浸され得る、例えば、口腔スワブ、紙、布、若しくは他の基質に由来する核酸を含む、精製された試料又は粗溶解物であり得る。したがって、いくつかの実施態様では、核酸試料は、ゲノムＤＮＡのような、少量のＤＮＡ又は断片化されたＤＮＡの部分を含み得る。いくつかの実施態様では、標的核酸は、血液、痰、血漿、精液、尿及び血清を含む１つ以上の体液に存在し得るが、これに限定されるものではない。いくつかの実施態様では、標的配列は、犠牲者の毛髪、皮膚、組織試料、剖検又は遺体から得ることができる。いくつかの実施態様では、１つ以上の標的配列を含む核酸は、死亡した動物又はヒトから得ることができる。いくつかの実施態様では、標的配列は、微生物、植物又は昆虫学的ＤＮＡなど非ヒトＤＮＡから得られた核酸を含むことができる。いくつかの実施形態では、標的配列は、法医学試料などの人物同定の目的に関する。 Additionally, the methods, compositions, systems, and kits disclosed herein may be useful for amplifying nucleic acid samples having low-quality nucleic acid molecules, such as degraded and/or fragmented genomic DNA from forensic samples. In one embodiment, the forensic sample may include nucleic acid obtained from a crime scene, from a missing persons DNA database, from a laboratory associated with a forensic investigation, or may include forensic samples obtained by law enforcement agencies, one or more military services, or personnel thereof. The nucleic acid sample may be a purified sample or crude lysate containing nucleic acid from, for example, a buccal swab, paper, cloth, or other substrate impregnated with saliva, blood, or other bodily fluid. Thus, in some embodiments, the nucleic acid sample may contain small amounts of DNA or fragmented portions of DNA, such as genomic DNA. In some embodiments, the target nucleic acid may be present in one or more bodily fluids, including, but not limited to, blood, sputum, plasma, semen, urine, and serum. In some embodiments, the target sequence may be obtained from a victim's hair, skin, tissue sample, autopsy, or corpse. In some embodiments, nucleic acids comprising one or more target sequences can be obtained from a deceased animal or human. In some embodiments, the target sequences can include nucleic acids obtained from non-human DNA, such as microbial, plant, or entomological DNA. In some embodiments, the target sequences are for purposes of human identification, such as forensic samples.

生物学的試料の供給源の更なる非限定的な例には、生物全体、並びに患者から得られた試料が含まれ得る。生体試料は、任意の生体液又は組織から得ることができ、液体流体と組織、固体組織、並びに乾燥、凍結、及び固定形態などの保存形態を含む様々な形態であり得る。試料は、任意の生物学的組織、細胞、又は体液のものであり得る。そのような試料には、痰、血液、血清、血漿、血球（例えば、白血球）、腹水、尿、唾液、涙、痰、膣液、唾液、涙液、唾液、膣液、唾液、涙液、唾液、膣液（分泌物）、医療処置中に得られた洗浄液（例えば、生検、内視鏡検査又は外科手術中に得られる骨盤若しくは他の洗浄液）、組織、乳頭吸引物、コア若しくは細針生検試料、細胞含有体液、腹水、及び胸膜液、又はそれらからの細胞、並びに無細胞循環ＤＮＡなどの遊離浮遊核酸が含まれるが、これらに限定されない。生体試料はまた、組織学的目的若しくは微小解剖された細胞若しくはその細胞外部分のために採取された凍結又は固定切片などの組織の切片を含み得る。いくつかの実施形態では、試料は、例えば全血試料などの血液試料であり得る。別の例では、試料は、未処理の乾燥血液スポット（dried blood spot、ＤＢＳ）試料である。更に別の例では、試料は、ホルマリン固定パラフィン包埋（formalin-fixed paraffin-embedded、ＦＦＰＥ）試料である。更に別の例では、試料は唾液試料である。更に別の例では、試料は、乾燥唾液スポット（dried saliva spot、ＤＳＳ）試料である。 Further non-limiting examples of sources of biological samples include whole organisms, as well as samples obtained from patients. Biological samples can be obtained from any biological fluid or tissue and can be in a variety of forms, including liquid fluids and tissues, solid tissues, and preserved forms such as dried, frozen, and fixed forms. Samples can be of any biological tissue, cell, or bodily fluid. Such samples include, but are not limited to, sputum, blood, serum, plasma, blood cells (e.g., white blood cells), ascites, urine, saliva, tears, sputum, vaginal fluid, lavage fluid obtained during a medical procedure (e.g., pelvic or other lavage fluid obtained during biopsy, endoscopy, or surgery), tissue, nipple aspirate, core or fine needle biopsy sample, cell-containing bodily fluids, peritoneal fluid, and pleural fluid, or cells therefrom, as well as free-floating nucleic acids such as cell-free circulating DNA. Biological samples may also include sections of tissue, such as frozen or fixed sections taken for histological purposes or microdissected cells or their extracellular portions. In some embodiments, the sample may be a blood sample, such as a whole blood sample. In another example, the sample is an unprocessed dried blood spot (DBS) sample. In yet another example, the sample is a formalin-fixed paraffin-embedded (FFPE) sample. In yet another example, the sample is a saliva sample. In yet another example, the sample is a dried saliva spot (DSS) sample.

標的核酸が由来し得る例示的な生物学的試料としては、例えば、真核生物、例えば、げっ歯類、マウス、ラット、ウサギ、モルモット、有蹄動物、ウマ、ヒツジ、ブタ、ヤギ、ウシ、ネコ、イヌ、霊長類、ヒト又は非ヒト霊長類などの哺乳動物からのもの、例えば、シロイヌナズナ、トウモロコシ、ソルガム、オート麦、小麦、米、キャノーラ、又は大豆などの植物、Ｃｈｌａｍｙｄｏｍｏｎａｓｒｅｉｎｈａｒｄｔｉｉなどの藻類、Ｃａｅｎｏｒｈａｂｄｉｔｉｓｅｌｅｇａｎｓなどの線虫、キイロショウジョウバエ、蚊、ショウジョウバエ、ミツバチ、又はクモなどの昆虫、ゼブラフィッシュなどの魚、爬虫類、カエル若しくはアフルカツメガエルなどの両生類、Ｄｉｃｔｙｏｓｔｅｌｉｕｍｄｉｓｃｏｉｄｅｕｍ、Ｐｎｅｕｍｏｃｙｓｔｉｓｃａｒｉｎｉｉ、Ｔａｋｉｆｕｇｕｒｕｂｒｉｐｅｓ、酵母、Ｓａｃｃｈａｒａｍｏｙｃｅｓｃｅｒｅｖｉｓｉａｅ、若しくはＳｃｈｉｚｏｓａｃｃｈａｒｏｍｙｃｅｓｐｏｍｂｅなどの真菌、又は熱帯熱マラリア原虫が挙げられる。標的核酸はまた、細菌、Ｅｓｃｈｅｒｉｃｈｉａｃｏｌｉ、ｓｔａｐｈｙｌｏｃｏｃｃｉ若しくはＭｙｃｏｐｌａｓｍａｐｎｅｕｍｏｎｉａｅ若しくはＭｙｃｏｐｌａｓｍａｐｎｅｕｍｏｎｉａｅなどの原核生物、古細菌、Ｃ型肝炎ウイルス若しくはヒト免疫不全ウイルスなどのウイルス、又はビロイドに由来し得る。標的核酸は、本明細書に記載の均質な培養物若しくは生物の集団に由来し得るか、又は代替的に、例えば、群衆若しくは生態系におけるいくつかの異なる生物のコレクションから由来し得る。 Exemplary biological samples from which target nucleic acids can be derived include, for example, those from eukaryotic organisms, e.g., mammals such as rodents, mice, rats, rabbits, guinea pigs, ungulates, horses, sheep, pigs, goats, cows, cats, dogs, primates, humans, or non-human primates; plants such as Arabidopsis thaliana, corn, sorghum, oats, wheat, rice, canola, or soybeans; algae such as Chlamydomonas reinhardtii; nematodes such as Caenorhabditis elegans; insects such as Drosophila melanogaster, mosquitoes, fruit flies, honeybees, or spiders; fish such as zebrafish; reptiles; amphibians such as frogs or Xenopus laevis; Dictyostelium discoideum; Pneumocystis sp. Examples of target nucleic acids include fungi such as Bacillus carinii, Takifugu rubripes, yeast, Saccharomyces cerevisiae, or Schizosaccharomyces pombe, or Plasmodium falciparum. Target nucleic acids can also be derived from bacteria, prokaryotes such as Escherichia coli, staphylococci, or Mycoplasma pneumoniae, archaea, viruses such as hepatitis C virus or human immunodeficiency virus, or viroids. Target nucleic acids can be derived from a homogenous culture or population of organisms as described herein, or alternatively, can be derived from a collection of several different organisms, for example, in a community or ecosystem.

いくつかの実施形態では、試料は、所望の一次核酸を得るために処理される組織を含む。いくつかの実施形態では、細胞を使用して、所望の一次核酸を得る。いくつかの実施形態では、核を使用して、所望の一次核酸を得る。本方法は、細胞を解離させること、及び／又は核を単離することを更に含み得る。組織から細胞及び核を単離するための方法が利用可能である（国際公開第２０１９／２３６５９９号）。 In some embodiments, the sample comprises tissue that is processed to obtain the desired primary nucleic acid. In some embodiments, cells are used to obtain the desired primary nucleic acid. In some embodiments, nuclei are used to obtain the desired primary nucleic acid. The method may further include dissociating the cells and/or isolating the nuclei. Methods for isolating cells and nuclei from tissue are available (WO 2019/236599).

いくつかの実施形態では、組織内、細胞内、又は単離された核内に存在する核酸は、所望の読み出しに応じて処理され得る。例えば、核酸は、プロセッシング中に固定され得、有用な固定方法が利用可能である（国際公開第２０１９／２３６５９９号）。固定は、試料を保存するか、又は試料、細胞、若しくは核からの分析物の連続性を維持するのに有用であり得る。固定方法は、組織、細胞、及び核形態及びアーキテクチャを保存し、安定化し、タンパク質分解酵素を不活性化し、試料、細胞、及び核を強化し、そのためそれらは更なるプロセッシング及び染色に耐えることができ、混入から保護する。固定が有用であり得る方法の例としては、単離された核の全ゲノムシークエンシング及びＨｉ－Ｃなどの染色体立体配座捕捉方法が挙げられるが、これらに限定されない。一般的な固定方法としては、灌流、浸漬、凍結、及び乾燥（Ｓｒｉｎｉｖａｓａｎｅｔａｌ．，ＡｍＪＰａｔｈｏｌ．２００２Ｄｅｃ；１６１（６）：１９６１－１９７１．ｄｏｉ：１０．１０１６／Ｓ０００２－９４４０（１０）６４４７２－０）を含む。 In some embodiments, nucleic acids present in tissues, cells, or isolated nuclei can be processed depending on the desired readout. For example, nucleic acids can be fixed during processing, and useful fixation methods are available (WO 2019/236599). Fixation can be useful for preserving samples or maintaining the continuity of analytes from samples, cells, or nuclei. Fixation methods preserve and stabilize tissue, cell, and nuclear morphology and architecture, inactivate proteolytic enzymes, and strengthen samples, cells, and nuclei so they can withstand further processing and staining and protect against contamination. Examples of methods in which fixation may be useful include, but are not limited to, whole genome sequencing of isolated nuclei and chromosome conformation capture methods such as Hi-C. Common fixation methods include perfusion, immersion, freezing, and desiccation (Srinivasan et al., Am J Pathol. 2002 Dec;161(6):1961-1971.doi:10.1016/S0002-9440(10)64472-0).

全ゲノムシークエンシングなどのいくつかの実施形態では、単離された核を処理して、核を無傷のままにしながら、ヌクレオソームをＤＮＡから解離させ、ヌクレオソームを含まない核を生成するための方法が利用可能である（国際公開第２０１８／０１８００８号）。一実施形態では、界面活性剤ベースのヌクレオソーム法が使用される（実施例２）。染色体立体配座捕捉法などのいくつかの実施形態では、組織内、細胞内、又は単離された核内に存在する核酸は、例えば、制限エンドヌクレアーゼ消化によって断片化され得る。断片化は、本明細書でより詳細に説明される。染色体立体配座捕捉法などのいくつかの実施形態では、組織内、細胞内、又は単離された核内に存在する核酸は、平滑末端ライゲーションなどの近接ベースのライゲーションのための条件に曝露され得る。 In some embodiments, such as whole genome sequencing, methods are available for treating isolated nuclei to dissociate nucleosomes from DNA while leaving the nuclei intact, generating nucleosome-free nuclei (WO 2018/018008). In one embodiment, a detergent-based nucleosome method is used (Example 2). In some embodiments, such as chromosome conformation capture methods, nucleic acids present in tissues, cells, or isolated nuclei can be fragmented, for example, by restriction endonuclease digestion. Fragmentation is described in more detail herein. In some embodiments, such as chromosome conformation capture methods, nucleic acids present in tissues, cells, or isolated nuclei can be exposed to conditions for proximity-based ligation, such as blunt-end ligation.

いくつかの実施形態では、例えば、複数の細胞からのバルク中の一次核酸を使用して、本明細書に記載のシークエンシングライブラリーを生成することができる。他の実施形態では、個々の細胞又は核は、一次核酸の供給源として使用されて、単一細胞及び核から配列情報を得ることができる。多くの異なる単一細胞ライブラリー調製法が、当該技術分野において既知である。（Ｈｗａｎｇｅｔａｌ．Ｅｘｐｅｒｉｍｅｎｔａｌ＆ＭｏｌｅｃｕｌａｒＭｅｄｉｃｉｎｅ，ｖｏｌ．５０，Ａｒｔｉｃｌｅｎｕｍｂｅｒ：９６（２０１８）、Ｄｒｏｐ－ｓｅｑ法、Ｓｅｑ－ｗｅｌｌ法、単一細胞コンビナトリアルインデックス付け（「ｓｃｉ－」）法が挙げられるが、これらに限定されない。単一細胞製品及び関連技術を提供する企業としては、１０ＸＧｅｎｏｍｉｃｓ、Ｔａｋａｒａｂｉｏｓｃｉｅｎｃｅｓ、ＢＤｂｉｏｓｃｉｅｎｃｅｓ、Ｂｉｏｒａｄ、１ｃｅｌｌｂｉｏ、ＩｓｏＰｌｅｘｉｓ、ＣｅｌｌＳｅｅ、ｎａｎｏｓｅｌｅｃｔ、及びＤｏｌｏｍｉｔｅＢｉｏが挙げられるが、これらに限定されない。ＳＣＩ－ｓｅｑは、スプリットプールバーコーディングを用いて多数の単一細胞又は単一核の核酸内容を一意に標識化する、方法論的フレームワークである。一般には、核又は細胞の数は、少なくとも２つであり得る。上限は、本明細書に記載の方法の他の工程で使用される機器の実際の制限（例えば、マルチウェルプレート、インデックスの数）に依存する。使用され得る核又は細胞の数は、限定することを意図するものではなく、数十億に達することがあり得る。例えば、一実施形態では、核又は細胞の数は、１００，０００，０００以下、１０，０００，０００以下、１，０００，０００，０００以下、１００，０００，０００以下、１０，０００，０００以下、１，０００，０００以下、１００，０００以下、１０，０００以下、１，０００以下、５００以下、又は５０以下であり得る。 In some embodiments, for example, primary nucleic acids in bulk from multiple cells can be used to generate the sequencing libraries described herein. In other embodiments, individual cells or nuclei can be used as a source of primary nucleic acids, allowing sequence information to be obtained from single cells and nuclei. Many different single-cell library preparation methods are known in the art. (Hwang et al. Experimental & Molecular Medicine, vol. 50, Article number: 96 (2018)) include, but are not limited to, Drop-seq, Seq-well, and single-cell combinatorial indexing ("sci-") methods. Companies providing single-cell products and related technologies include 10X Genomics, Takara biosciences, BD biosciences, Biorad, 1cellbio, IsoPlexis, CellSee, nanoselect, and Dolomite. Bio. SCI-seq is a methodological framework that uses split-pool barcoding to uniquely label the nucleic acid content of large numbers of single cells or nuclei. Generally, the number of nuclei or cells can be at least two. The upper limit depends on the practical limitations of the equipment used in other steps of the methods described herein (e.g., multi-well plates, number of indexes). The number of nuclei or cells that can be used is not intended to be limiting and can reach the billions. For example, in one embodiment, the number of nuclei or cells can be 100,000,000 or less, 10,000,000 or less, 1,000,000 or less, 10,000,000 or less, 1,000,000 or less, 100,000 or less, 10,000,000 or less, 1,000,000 or less, 500 or less, or 50 or less.

アダプター
本開示の方法は、標的核酸の両端にアダプターを付加することを含み得る。シークエンシングライブラリーを調製する際に使用するための多くのアダプターが既知であり、本質的に任意のアダプターを使用することができる。例えば、アダプターは、一本鎖、二本鎖、又は二本鎖領域及び一本鎖領域を含むことができる。一実施形態では、単鎖及び二本鎖領域の両方を有するアダプターの一本鎖領域は、各末端に相補的な一本鎖領域を有する標的核酸にアダプターを結合するのを助けるために、「付着末端」として使用され得る。一実施形態では、一本鎖及び二本鎖領域の両方を有するアダプターは、フォーク又はミスマッチアダプターとも呼ばれ、その一般的な特徴は既知である（Ｇｏｒｍｌｅｙｅｔａｌ．，米国特許第７，７４１，４６３号；Ｂｉｇｎｅｌｌｅｔａｌ．，米国特許第８，０５３，１９２号）。一実施形態では、アダプターは、トランスポソーム複合体の一部として存在する。トランスポソーム複合体は、本明細書に詳細に記載されている。 Adapters The disclosed methods can include adding adapters to both ends of a target nucleic acid. Many adapters are known for use in preparing sequencing libraries, and essentially any adapter can be used. For example, an adapter can be single-stranded, double-stranded, or contain a double-stranded region and a single-stranded region. In one embodiment, the single-stranded region of an adapter having both a single-stranded and a double-stranded region can be used as a "sticky end" to aid in binding the adapter to a target nucleic acid having a complementary single-stranded region at each end. In one embodiment, an adapter having both a single-stranded and a double-stranded region is also called a forked or mismatch adapter, the general characteristics of which are known (Gormley et al., U.S. Pat. No. 7,741,463; Bignell et al., U.S. Pat. No. 8,053,192). In one embodiment, the adapter is present as part of a transposome complex. Transposome complexes are described in detail herein.

標的核酸の両方の末端に付加するために使用されるアダプターの一方又は両方の末端を修飾して、アダプターと他の核酸との相互作用を改変することができる。一実施形態では、アダプターの一方の３’末端をブロックして、その特定の末端のライゲーション効率の相互作用を低減することができる。一実施形態では、標的核酸の各末端へのアダプター、例えば二本鎖アダプターの付加は、結果として生じる修飾標的核酸の一方の鎖にギャップをもたらす。一実施形態では、ギャップは、少なくとも１つのヌクレオチドである。一実施形態では、ギャップは、標的核酸の３’末端と標的核酸に結合したアダプターの５’末端との間に位置する。 One or both ends of an adapter used to attach to both ends of a target nucleic acid can be modified to alter the interaction of the adapter with other nucleic acids. In one embodiment, one 3' end of the adapter can be blocked to reduce the ligation efficiency of that particular end. In one embodiment, the addition of an adapter, e.g., a double-stranded adapter, to each end of a target nucleic acid results in a gap in one strand of the resulting modified target nucleic acid. In one embodiment, the gap is at least one nucleotide. In one embodiment, the gap is located between the 3' end of the target nucleic acid and the 5' end of the adapter attached to the target nucleic acid.

アダプターは、１つ以上のインデックス配列、１つ以上のＵＭＩ、１つ以上のユニバーサル配列、１つ以上のＤＮＡ損傷、又はそれらの組み合わせを含み得る。本明細書でより詳細に説明するように、アダプター中のインデックス配列の存在は、ｓｃｉベースの用途、試料インデックス付け、又は単一細胞識別を助けることができる。 The adapter may include one or more index sequences, one or more UMIs, one or more universal sequences, one or more DNA lesions, or a combination thereof. As described in more detail herein, the presence of an index sequence in the adapter can aid in Sci-based applications, sample indexing, or single-cell identification.

ＤＮＡ損傷のヌクレオチドは、ＤＮＡ合成中に鋳型としてＤＮＡポリメラーゼによって使用される場合、特定のＤＮＡポリメラーゼが活性を低減させ、ＤＮＡ損傷でＤＮＡ合成を止める、若しくは終了させる構造を有する。このタイプのＤＮＡポリメラーゼは、本明細書では「損傷不耐性ポリメラーゼ」と称される。ＤＮＡ損傷として使用され得るヌクレオチドの例は、当業者に知られており、脱塩基部位、修飾塩基、ミスマッチ、一本鎖切断、又は架橋ヌクレオチドを含むが、これらに限定されない。修飾塩基の例としては、メチル化塩基（例えば、Ｎ３－メチルアデニン、Ｎ７－Ｏ６－メチルグアニン、Ｎ３－メチルシトシン、Ｏ４メチルチミン）、Ｏ６－アルキルグアニン、Ｏ４－アルキルチミン、ヒポキサンチン、キサンチン、及びウラシルが挙げられるが、これらに限定されない。修飾塩基はまた、ＦａｐｙＴＡ、８－オキソ－Ｇ、及びチミングリコールを含むがこれらに限定されない酸化塩基を含み得る。架橋ヌクレオチドの例としては、チミン二量体が挙げられるが、これに限定されない。 DNA lesion nucleotides, when used by DNA polymerases as templates during DNA synthesis, have a structure that causes certain DNA polymerases to reduce their activity and stall or terminate DNA synthesis at the DNA lesion. This type of DNA polymerase is referred to herein as a "damage-intolerant polymerase." Examples of nucleotides that can be used as DNA lesions are known to those of skill in the art and include, but are not limited to, abasic sites, modified bases, mismatches, single-strand breaks, or bridged nucleotides. Examples of modified bases include, but are not limited to, methylated bases (e.g., N3-methyladenine, N7-O6-methylguanine, N3-methylcytosine, O4-methylthymine), O6-alkylguanine, O4-alkylthymine, hypoxanthine, xanthine, and uracil. Modified bases may also include oxidized bases, including, but not limited to, FapyTA, 8-oxo-G, and thymine glycol. Examples of bridged nucleotides include, but are not limited to, thymine dimers.

損傷不耐性ポリメラーゼは、当業者に既知である（Ｈｅｙｎｅｔａｌ．，ＮｕｃｌｅｉｃＡｃｉｄｓＲｅｓ．２０１０Ｓｅｐ；３８（１６）：ｅ１６１、Ｓｉｋｏｒｓｋｙｅｔａｌ．，ＢｉｏｃｈｅｍＢｉｏｐｈｙｓＲｅｓＣｏｍｍｕｎ．２００７Ａｐｒ６；３５５（２）：４３１－４３７、及びＧｒｕｚｅｔａｌ．，ＮｕｃｌｅｉｃＡｃｉｄｓＲｅｓ．２００３－Ｊｕｌ－１５；３１（１４）：４０２４－４０３０）。有用な損傷不耐性ポリメラーゼの例を表１に示す。 Damage-intolerant polymerases are known to those skilled in the art (Heyn et al., Nucleic Acids Res. 2010 Sep; 38(16): e161, Sikorsky et al., Biochem Biophys Res Commun. 2007 Apr 6; 355(2): 431-437, and Gruz et al., Nucleic Acids Res. 2003-Jul-15; 31(14): 4024-4030). Examples of useful damage-intolerant polymerases are listed in Table 1.

本開示の方法は、損傷不耐性ポリメラーゼを使用する工程を含むことができ、鋳型としてＤＮＡ損傷を使用して活性を低減していないＤＮＡポリメラーゼを使用する別の工程も含むことができる。鋳型としてＤＮＡ損傷を使用する場合、活性が低減していないポリメラーゼは、本明細書では「損傷耐性ポリメラーゼ」と称される。損傷耐性ポリメラーゼは、当業者に既知であり、表１に記載されているものを含むが、これらに限定されない。損傷耐性ポリメラーゼの使用は、対称修飾標的核酸の非対称修飾標的核酸への変換中に起こり得、典型的には、得られるアンプリコンにおけるＤＮＡ損傷の喪失をもたらす。変換中の損傷耐性ポリメラーゼの使用は、本明細書に記載されている。 The disclosed methods can include a step of using a damage-intolerant polymerase and can also include another step of using a DNA polymerase that does not have reduced activity using DNA damage as a template. When DNA damage is used as a template, the polymerase that does not have reduced activity is referred to herein as a "damage-tolerant polymerase." Damage-tolerant polymerases are known to those of skill in the art and include, but are not limited to, those listed in Table 1. The use of a damage-tolerant polymerase can occur during the conversion of a symmetrically modified target nucleic acid to an asymmetrically modified target nucleic acid, typically resulting in the loss of DNA damage in the resulting amplicon. The use of a damage-tolerant polymerase during conversion is described herein.

ＤＮＡ損傷は、ＤＮＡポリメラーゼ活性を低減する活性を有する１つ以上のヌクレオチドを含み得る。例えば、ＤＮＡ損傷を構成するヌクレオチドの数は、少なくとも１つ、少なくとも２つ、少なくとも３つ、少なくとも４つ、又は少なくとも５つであり得る。一実施形態では、ＤＮＡ損傷を構成するヌクレオチドの数は、５つ以下、４つ以下、３つ以下、又は２つ以下であり得る。一実施形態では、ＤＮＡ損傷は、２つ、３つ、又は４つのウラシルヌクレオチドである。ＤＮＡ損傷が２つ以上のヌクレオチドを含む場合、ＤＮＡ損傷のヌクレオチドは、典型的には連続している。 The DNA lesion may contain one or more nucleotides that have activity that reduces DNA polymerase activity. For example, the number of nucleotides that make up the DNA lesion may be at least one, at least two, at least three, at least four, or at least five. In one embodiment, the number of nucleotides that make up the DNA lesion may be five or fewer, four or fewer, three or fewer, or two or fewer. In one embodiment, the DNA lesion is two, three, or four uracil nucleotides. When the DNA lesion contains two or more nucleotides, the nucleotides in the DNA lesion are typically contiguous.

ＤＮＡ損傷は、典型的には、標的核酸の各末端に存在するアダプターの１つの鎖に存在する。一実施形態では、アダプターがＤＮＡ損傷を含み、アダプターが標的核酸に結合している１つの鎖にギャップが存在する場合、ＤＮＡ損傷及びギャップは、異なる鎖上に位置する。 The DNA damage is typically present on one strand of the adapters present at each end of the target nucleic acid. In one embodiment, if the adapter contains a DNA damage and a gap is present on one strand where the adapter is attached to the target nucleic acid, the DNA damage and the gap are located on different strands.

アダプターはまた、捕捉剤を含み得る。本明細書で使用する場合、用語「捕捉剤」とは、核酸（例えば、アダプターの鎖）に付着、保持、又は結合することができる材料、化学物質、分子、又はその部分を指す。例示的な捕捉剤としては、受容体－リガンド結合対のメンバーに結合することができる受容体－リガンド結合対（例えば、アビジン、ストレプトアビジン、ビオチン、レクチン、炭水化物、核酸結合タンパク質、エピトープ、抗体など）のメンバー、又は連結部分と共有結合を形成することができる化学試薬が挙げられるが、これらに限定されない。一実施形態では、捕捉剤は、ビオチンである。捕捉剤は、アダプターの鎖に結合することができ、アダプターの末端に結合して、標的核酸へのアダプターの結合を妨げない。例えば、アダプターの５’末端は、捕捉剤を含むことができ、又はアダプターの３’末端が、捕捉剤を含むことができる。一実施形態では、捕捉剤は、トランスポゾンの鎖の５’末端又はトランスポゾンの他の鎖の３’末端に結合している。捕捉剤は、ビーズ又はウェルなどの固体表面にアダプターを付着させるのに有用である。 The adapter may also comprise a capture agent. As used herein, the term "capture agent" refers to a material, chemical, molecule, or portion thereof that can attach, retain, or bind to a nucleic acid (e.g., an adapter strand). Exemplary capture agents include, but are not limited to, members of receptor-ligand binding pairs (e.g., avidin, streptavidin, biotin, lectins, carbohydrates, nucleic acid-binding proteins, epitopes, antibodies, etc.) that can bind to a member of the receptor-ligand binding pair, or chemical reagents that can form a covalent bond with a linking moiety. In one embodiment, the capture agent is biotin. The capture agent can bind to the adapter strand and bind to the end of the adapter so as not to interfere with binding of the adapter to the target nucleic acid. For example, the 5' end of the adapter can comprise a capture agent, or the 3' end of the adapter can comprise a capture agent. In one embodiment, the capture agent is attached to the 5' end of the transposon strand or the 3' end of the other transposon strand. Capture agents are useful for attaching adapters to solid surfaces such as beads or wells.

アダプターはまた、捕捉剤とアダプターとの間に切断可能なリンカーを含み得る。切断可能なリンカーの例としては、ジスルフィド結合が挙げられるが、これらに限定されず、これは、例えば、捕捉剤を放出するためにジチオスレイトールで切断され得る。切断可能なリンカーを有するビオチン標識ヌクレオチドを含む切断可能なリンカーを有する捕捉剤は、市販されている。 The adaptor may also include a cleavable linker between the capture agent and the adaptor. Examples of cleavable linkers include, but are not limited to, disulfide bonds, which can be cleaved, for example, with dithiothreitol, to release the capture agent. Capture agents with cleavable linkers, including biotin-labeled nucleotides with cleavable linkers, are commercially available.

対称アダプターを有する標的核酸の生成
本明細書で提供される方法、組成物、システム、及びキットは、任意選択的に、各末端に同じアダプターを有することによってシークエンシング及び対称に適した長さを有する修飾標的核酸を得るために、一次核酸のプロセッシングを含み得る。一次核酸の試料は、ゲノムＤＮＡなどの高分子量物質又は液体生検から得られるか、又はＲＮＡのＤＮＡへの変換によって得られる核酸分子などの低分子量物質などを含み得る。バルク中に存在する、単離された核に存在する、又は単離された細胞中に存在する核酸を核酸断片に処理するための様々な方法が、知られている。一実施形態では、トランスポソーム複合体が使用され、アダプターの付加をもたらす。別の実施形態では、ＤＮＡは、例えば、酵素的又は機械的方法によって断片化され、次いで、アダプターが断片の末端に付加される。別の実施形態では、ｍＲＮＡなどのＲＮＡ分子は、ｃＤＮＡに変換され、アダプターが末端に付加される。 Generation of Target Nucleic Acids with Symmetric Adapters The methods, compositions, systems, and kits provided herein may optionally include processing of primary nucleic acids to obtain modified target nucleic acids with the same adapter at each end, thereby achieving a length suitable for sequencing and symmetry. Primary nucleic acid samples may include high-molecular-weight materials, such as genomic DNA, or low-molecular-weight materials, such as nucleic acid molecules obtained from liquid biopsies or by converting RNA to DNA. Various methods are known for processing nucleic acids present in bulk, isolated nuclei, or isolated cells into nucleic acid fragments. In one embodiment, a transposome complex is used to add adapters. In another embodiment, DNA is fragmented, for example, by enzymatic or mechanical methods, and then adapters are added to the ends of the fragments. In another embodiment, RNA molecules, such as mRNA, are converted to cDNA, and adapters are added to the ends.

トランスポソーム複合体は、典型的にトランスポザーゼ認識部位を含むトランスポゾン配列に結合したトランスポザーゼであり、「タグメンテーション」と呼ばれることもあるプロセスで、トランスポザーゼ認識部位をＤＮＡ分子内の標的核酸に挿入することができる。タグメンテーションを単一工程の断片化及びライゲーションに組み合わせて、ユニバーサルアダプターを付加する（Ｇｕｎｄｅｒｓｏｎｅｔａｌ．，国際公開第２０１６／１３０７０４号）。当業者は、非対称標的核酸の生成が転位により容易かつ効率的に達成され、シークエンシングの準備ができているため、各末端に異なるアダプターを含む核酸断片を生成するためにタグメンテーションが典型的に使用されることを認識するであろう。非対称標的核酸を生成するためのタグメンテーション方法は有用であるが、非効率的であり、典型的に理論収率を５０％に低減する。対照的に、本開示の方法で使用される場合、タグメンテーションは、各末端に同じヌクレオチド配列を含む核酸断片を生成し、理論収率をほぼ１００％まで増加させることができる。 The transposome complex typically contains a transposase bound to a transposon sequence containing a transposase recognition site, which can then be inserted into a target nucleic acid within a DNA molecule in a process sometimes referred to as "tagmentation." Tagmentation combines a single-step fragmentation and ligation process to add universal adapters (Gunderson et al., WO 2016/130704). Those skilled in the art will recognize that tagmentation is typically used to generate nucleic acid fragments containing different adapters at each end, because the generation of asymmetric target nucleic acids is easily and efficiently achieved by transposition and is ready for sequencing. While useful, tagmentation methods for generating asymmetric target nucleic acids are inefficient, typically reducing theoretical yields to 50%. In contrast, when used in the methods disclosed herein, tagmentation generates nucleic acid fragments containing the same nucleotide sequence at each end, increasing theoretical yields to nearly 100%.

いくつかの実施形態では、トランスポゾンの１つの鎖は、挿入イベント中に標的核酸の５’末端に転移され、例えば、共有結合され得る。このような鎖は、「移送鎖」と称される。トランスポゾン配列は、１つ以上のインデックス配列、１つ以上のＵＭＩ、１つ以上のユニバーサル配列、１つ以上のＤＮＡ損傷、又はそれらの組み合わせを含み得るアダプターを含むことができる。一実施形態では、ユニバーサル配列は、トランスポザーゼ認識部位である。トランスポザーゼ認識部位の例としては、モザイク要素（mosaic element、ＭＥ）が挙げられるが、これに限定されない。一実施形態では、アダプター、例えば、１つ以上のインデックス配列、１つ以上のＵＭＩ、１つ以上のユニバーサル配列、１つ以上のＤＮＡ損傷、又はそれらの組み合わせが、転移鎖上に存在する。いくつかの実施形態では、トランスポゾンの１つの鎖は、挿入イベント中に標的核酸の３’末端に転移されず、例えば共有結合され得ない。このような鎖は、「転移鎖」と称される。非転移鎖の存在は、標的核酸のヌクレオチドの重複の転位反応中の生成をもたらし、アダプター配列の５’と標的核酸の３’末端との間にギャップを引き起こし得る。ギャップのサイズは変化し得、典型的には使用されるトランスポゾンシステムに依存する。例えば、Ｔｎ５ベースシステムによって導入されるギャップは、典型的には９塩基である。 In some embodiments, one strand of the transposon may be transferred, e.g., covalently linked, to the 5' end of the target nucleic acid during an insertion event. Such a strand is referred to as the "transfer strand." The transposon sequence may include an adapter, which may include one or more index sequences, one or more UMIs, one or more universal sequences, one or more DNA lesions, or a combination thereof. In one embodiment, the universal sequence is a transposase recognition site. Examples of transposase recognition sites include, but are not limited to, mosaic elements (MEs). In one embodiment, an adapter, e.g., one or more index sequences, one or more UMIs, one or more universal sequences, one or more DNA lesions, or a combination thereof, is present on the transferred strand. In some embodiments, one strand of the transposon is not transferred, e.g., covalently linked, to the 3' end of the target nucleic acid during an insertion event. Such a strand is referred to as the "transfer strand." The presence of the non-transferred strand may result in the generation of overlapping nucleotides of the target nucleic acid during the transposition reaction, causing a gap between the 5' end of the adapter sequence and the 3' end of the target nucleic acid. The size of the gap can vary and typically depends on the transposon system used. For example, gaps introduced by Tn5-based systems are typically 9 bases.

いくつかの実施形態は、高活性Ｔｎ５トランスポザーゼ及びＴｎ５型トランスポザーゼ認識部位（ＧｏｒｙｓｈｉｎａｎｄＲｅｚｎｉｋｏｆｆ，Ｊ．Ｂｉｏｌ．Ｃｈｅｍ．，２７３：７３６７（１９９８））、又はＲ１及びＲ２末端配列を含むＭｕＡトランスポザーゼ及びＭｕトランスポザーゼ認識部位（Ｍｉｚｕｕｃｈｉ，Ｋ．，Ｃｅｌｌ，３５：７８５，１９８３、Ｓａｖｉｌａｈｔｉ，Ｈ，ｅｔａｌ．，ＥＭＢＯＪ．，１４：４８９３，１９９５）の使用を含み得る。Ｔｎ５モザイク端（ＭｏｓａｉｃＥｎｄ、ＭＥ）配列、トランスポザーゼ認識部位もまた、当業者によって最適化されたものとしてが使用することができる。 Some embodiments may include the use of hyperactive Tn5 transposase and Tn5-type transposase recognition sites (Goryshin and Reznikoff, J. Biol. Chem., 273:7367 (1998)), or MuA transposase and Mu transposase recognition sites containing R1 and R2 end sequences (Mizuuchi, K., Cell, 35:785, 1983; Savilahti, H. et al., EMBO J., 14:4893, 1995). Tn5 Mosaic End (ME) sequences and transposase recognition sites can also be used, as optimized by those skilled in the art.

本明細書で提供される方法、組成物、システム及びキットの特定の実施形態とともに使用することができる転位システムの更なる例としては、ＳｔａｐｈｙｌｏｃｏｃｃｕｓａｕｒｅｕｓＴｎ５５２（Ｃｏｌｅｇｉｏｅｔａｌ．，Ｊ．Ｂａｃｔｅｒｉｏｌ．，１８３：２３８４－８，２００１、ＫｉｒｂｙＣｅｔａｌ．，Ｍｏｌ．Ｍｉｃｒｏｂｉｏｌ．，４３：１７３－８６，２００２）、Ｔｙ１（Ｄｅｖｉｎｅ＆Ｂｏｅｋｅ，ＮｕｃｌｅｉｃＡｃｉｄｓＲｅｓ．，２２：３７６５－７２，１９９４、及び国際公開第９５／２３８７５号）、トランスポゾンＴｎ７（Ｃｒａｉｇ，ＮＬ，Ｓｃｉｅｎｃｅ．２７１：１５１２，１９９６、Ｃｒａｉｇ，ＮＬ，ＣｕｒｒＴｏｐＭｉｃｒｏｂｉｏｌＩｍｍｕｎｏｌ．中のレビュー、２０４：２７－４８，１９９６）、Ｔｎ／Ｏ及びＩＳ１０（ＫｌｅｃｋｎｅｒＮ，ｅｔａｌ．，ＣｕｒｒＴｏｐＭｉｃｒｏｂｉｏｌＩｍｍｕｎｏｌ．，２０４：４９－８２，１９９６）、Ｍａｒｉｎｅｒトランスポザーゼ（ＬａｍｐｅＤＪ，ｅｔａｌ．，ＥＭＢＯＪ．，１５：５４７０－９，１９９６）、Ｔｃ１（ＰｌａｓｔｅｒｋＲＨ，Ｃｕｒｒ．ＴｏｐｉｃｓＭｉｃｒｏｂｉｏｌ．Ｉｍｍｕｎｏｌ．，２０４：１２５－４３，１９９６）、Ｐ要素（Ｇｌｏｏｒ，ＧＢ，ＭｅｔｈｏｄｓＭｏｌ．Ｂｉｏｌ．，２６０：９７－１１４，２００４）、Ｔｎ３（Ｉｃｈｉｋａｗａ＆Ｏｈｔｓｕｂｏ，ＪＢｉｏｌ．Ｃｈｅｍ．２６５：１８８２９－３２，１９９０）、細菌挿入配列（Ｏｈｔｓｕｂｏ＆Ｓｅｋｉｎｅ，Ｃｕｒｒ．Ｔｏｐ．Ｍｉｃｒｏｂｉｏｌ．Ｉｍｍｕｎｏｌ．２０４：１－２６，１９９６）、レトロウイルス（Ｂｒｏｗｎ，ｅｔａｌ，．ＰｒｏｃＮａｔｌＡｃａｄＳｃｉＵＳＡ，８６：２５２５－９，１９８９）、及び酵母のレトロトランスポゾン（Ｂｏｅｋｅ＆Ｃｏｒｃｅｓ，ＡｎｎｕＲｅｖＭｉｃｒｏｂｉｏｌ．４３：４０３－３４，１９８９）。その他の例としては、ＩＳ５、Ｔｎ１０、Ｔｎ９０３、ＩＳ９１１、及びトランスポザーゼファミリー酵素の操作型（Ｚｈａｎｇｅｔａｌ．，（２００９）ＰＬｏＳＧｅｎｅｔ．５：ｅ１０００６８９．Ｅｐｕｂ２００９Ｏｃｔ１６、ＷｉｌｓｏｎＣ．ｅｔａｌ（２００７）Ｊ．Ｍｉｃｒｏｂｉｏｌ．Ｍｅｔｈｏｄｓ７１：３３２－５）がある。 Further examples of transposition systems that can be used with certain embodiments of the methods, compositions, systems, and kits provided herein include Staphylococcus aureus Tn552 (Colegio et al., J. Bacteriol., 183:2384-8, 2001; Kirby C et al., Mol. Microbiol., 43:173-86, 2002), Ty1 (Devine & Boeke, Nucleic Acids Res., 22:3765-72, 1994, and WO 95/23875), transposon Tn7 (Craig, N L, Science. 271:1512, 1996; Craig, N L, Curr. Review in Top Microbiol Immunol., 204:27-48, 1996), Tn/O and IS10 (Kleckner N, et al., Curr Top Microbiol Immunol., 204:49-82, 1996), Mariner transposase (Lampe D J, et al., EMBO J., 15:5470-9, 1996), Tc1 (Plasterk RH, Curr Topics Microbiol Immunol., 204:125-43, 1996), P element (Gloor G B, Methods Mol. Biol., 260:97-114, 2004), Tn3 (Ichikawa & Ohtsubo, J. Biol. Chem. 265:18829-32, 1990), bacterial insertion sequences (Ohtsubo & Sekine, Curr. Top. Microbiol. Immunol. 204:1-26, 1996), retroviruses (Brown, et al., Proc. Natl. Acad. Sci. USA, 86:2525-9, 1989), and yeast retrotransposons (Boeke & Corces, Annu Rev. Microbiol. 43:403-34, 1989). Other examples include IS5, Tn10, Tn903, IS911, and engineered forms of transposase family enzymes (Zhang et al., (2009) PLoS Genet. 5:e1000689. Epub 2009 Oct 16; Wilson C. et al. (2007) J. Microbiol. Methods 71:332-5).

本明細書で提供される方法及び組成物とともに使用され得るインテグラーゼの他の例には、レトロウイルスインテグラーゼ及びそのようなレトロウイルスインテグラーゼのインテグラーゼ認識配列、例えば、ＨＩＶ－１、ＨＩＶ－２、ＳＩＶ、ＰＦＶ－１、ＲＳＶからのインテグラーゼが含まれる。 Other examples of integrases that can be used with the methods and compositions provided herein include retroviral integrases and integrase recognition sequences for such retroviral integrases, e.g., integrases from HIV-1, HIV-2, SIV, PFV-1, and RSV.

本明細書に記載の方法及び組成物で有用なトランスポゾン配列は、米国特許出願公開第２０１２／０２０８７０５号、米国特許出願公開第２０１２／０２０８７２４号及び国際公開第２０１２／０６１８３２号に提供されている。 Transposon sequences useful in the methods and compositions described herein are provided in U.S. Patent Application Publication No. 2012/0208705, U.S. Patent Application Publication No. 2012/0208724, and WO 2012/061832.

様々なトランスポソーム複合体構成が当該技術分野で既知である。一実施形態では、トランスポソーム複合体は、２つのサブユニット、及び２つの非連続的なトランスポゾン配列を有する二量体トランスポザーゼを含む。このようなトランスポソームの例は、当技術分野において既知である（例えば、米国特許出願公開第２０１０／０１２００９８号参照）。いくつかの実施形態では、トランスポソーム複合体は、２つのトランスポザーゼサブユニットを結合して「ループ状複合体」又は「ループ状トランスポソーム」を形成するトランスポゾン配列核酸を含む。一実施例では、トランスポソームは、二量体トランスポザーゼ及びトランスポゾン配列を含む。ループ状複合体は、標的ＤＮＡを断片化することなく、元の標的ＤＮＡの順序情報を維持しながら、トランスポゾンが標的ＤＮＡに挿入されることを確実にすることができる。理解されるように、ループ状構造は、標的核酸の物理的接続性を維持しながら、標的核酸に所望のアダプター配列を挿入し得る。いくつかの実施形態では、ループ状トランスポソーム複合体のトランスポゾン配列は、トランスポゾン配列を断片化して２つのトランスポゾン配列を含むトランスポソーム複合体を作成することができるように、断片化部位を含むことができる。このようなトランスポソーム複合体は、トランスポゾンが挿入される、近傍の標的ＤＮＡ断片が、アッセイの後の段階で明確に組み立てられ得るバーコードの組み合わせを確実に受け取るのに有用である。 Various transposome complex configurations are known in the art. In one embodiment, a transposome complex comprises a dimeric transposase having two subunits and two discontinuous transposon sequences. Examples of such transposomes are known in the art (see, e.g., U.S. Patent Application Publication No. 2010/0120098). In some embodiments, a transposome complex comprises a transposon sequence nucleic acid that links two transposase subunits to form a "looped complex" or "looped transposome." In one example, a transposome comprises a dimeric transposase and a transposon sequence. The looped complex can ensure that the transposon is inserted into the target DNA without fragmenting the target DNA and while maintaining the sequence information of the original target DNA. As will be appreciated, the looped structure can insert a desired adapter sequence into the target nucleic acid while maintaining the physical connectivity of the target nucleic acid. In some embodiments, the transposon sequence of a looped transposome complex can include a fragmentation site so that the transposon sequence can be fragmented to create a transposome complex containing two transposon sequences. Such transposome complexes are useful for ensuring that adjacent target DNA fragments into which the transposon is inserted receive barcode combinations that can be unambiguously assembled at later stages of the assay.

断片化部位は、トランスポソーム複合体を使用することによって標的核酸に導入され得る。一実施形態では、核酸の断片化後、トランスポザーゼは、核酸断片に結合したままであり、同じゲノムＤＮＡ分子に由来する核酸断片が物理的に連結されたままになる（Ａｄｅｙｅｔａｌ．，２０１４，ＧｅｎｏｍｅＲｅｓ．，２４：２０４１－２０４９）。開裂は、生化学的、化学的、又は他の手段によって行われてよい。いくつかの実施形態では、断片化部位は、様々な手段によって断片化され得るヌクレオチド又はヌクレオチド配列を含み得る。断片化部位の例としては、制限エンドヌクレアーゼ部位、ＲＮＡｓｅにより切断可能な少なくとも１つのリボヌクレオチド、特定の化学剤の存在下で切断可能なヌクレオチド類似体、過ヨウ素酸塩による処理で切断可能なジオール結合、化学還元剤で切断可能なジスルフィド基、光化学的切断に供され得る切断可能部分、及びペプチダーゼ酵素又は他の好適な手段によって切断可能なペプチドが挙げられるが、これらに限定されない（例えば、米国特許出願公開第２０１２／０２０８７０５号、米国特許出願公開第２０１２／０２０８７２４号、及び国際公開第２０１２／０６１８３２号を参照）。 Fragmentation sites can be introduced into target nucleic acids by using transposome complexes. In one embodiment, after fragmentation of the nucleic acid, the transposase remains bound to the nucleic acid fragments, such that nucleic acid fragments derived from the same genomic DNA molecule remain physically linked (Adey et al., 2014, Genome Res., 24:2041-2049). Cleavage can be achieved by biochemical, chemical, or other means. In some embodiments, fragmentation sites can include nucleotides or nucleotide sequences that can be fragmented by various means. Examples of fragmentation sites include, but are not limited to, restriction endonuclease sites, at least one ribonucleotide cleavable by an RNAse, a nucleotide analogue cleavable in the presence of a particular chemical agent, a diol bond cleavable by treatment with periodate, a disulfide group cleavable by a chemical reducing agent, a cleavable moiety that can be subjected to photochemical cleavage, and a peptide cleavable by a peptidase enzyme or other suitable means (see, e.g., U.S. Patent Application Publication No. 2012/0208705, U.S. Patent Application Publication No. 2012/0208724, and WO 2012/061832).

一次核酸がＤＮＡである実施形態では、転位の結果は、修飾標的核酸のライブラリーであり、各断片は、各末端に対称アダプターを含む。対照的に、一次核酸がＲＮＡである実施形態では、転位の結果は、最大３つの異なるタイプの修飾標的核酸である。第１の集団は、修飾標的核酸のライブラリーを含み、各断片は、各末端に対称アダプターを含む。第２及び第３の集団は各々、一方の末端でトランスポゾンによって導入されたアダプターを含み、他端、すなわち、ＲＮＡの３’又は５’末端のいずれかに対応する末端は、鋳型スイッチプライマー、ランダムプライマー、又はポリＴなどの代替方法によって付加される。 In embodiments where the primary nucleic acid is DNA, the result of transposition is a library of modified target nucleic acids, each fragment containing a symmetric adapter at each end. In contrast, in embodiments where the primary nucleic acid is RNA, the result of transposition is up to three different types of modified target nucleic acids. The first population contains a library of modified target nucleic acids, each fragment containing a symmetric adapter at each end. The second and third populations each contain an adapter introduced by the transposon at one end, and the other end, i.e., the end corresponding to either the 3' or 5' end of the RNA, is added by an alternative method such as a template switch primer, a random primer, or polyT.

転位の代わりに、断片化によって標的核酸を得ることができる。試料からの一次核酸の断片化は、酵素法、化学的方法、又は機械的方法によって順不同の様式で達成され得、次いで、アダプターが断片の末端に付加される。酵素的断片化の例としては、ＣＲＩＳＰＲ及びＴａｌｅｎ様酵素、並びにＤＮＡ断片がハイブリダイズし、伸長又は増幅を開始することができる一本鎖領域を作製することができるＤＮＡ（例えば、ヘリカーゼ）をほどく酵素が挙げられる。例えば、ヘリカーゼベースの増幅を使用することができる（Ｖｉｎｃｅｎｔｅｔａｌ．，２００４，ＥＭＢＯＲｅｐ．，５（８）：７９５－８００）。一実施形態では、伸長又は増幅は、ランダムプライマーを用いて開始される。機械的断片化の例としては、噴霧化又は超音波処理が挙げられる。 Instead of transposition, target nucleic acids can be obtained by fragmentation. Fragmentation of primary nucleic acids from a sample can be achieved in any order by enzymatic, chemical, or mechanical methods, followed by the addition of adapters to the ends of the fragments. Examples of enzymatic fragmentation include CRISPR and Talen-like enzymes, as well as enzymes that unwind DNA (e.g., helicases) to create single-stranded regions to which DNA fragments can hybridize and initiate extension or amplification. For example, helicase-based amplification can be used (Vincent et al., 2004, EMBO Rep., 5(8):795-800). In one embodiment, extension or amplification is initiated using random primers. Examples of mechanical fragmentation include nebulization or sonication.

機械的手段による一次核酸の断片化は、平滑末端、３’オーバーハング末端、及び５’オーバーハング末端の異種混合物を有する断片をもたらす。したがって、例えば、平滑部位にアダプターを付加するのに最適な端部を生成するために、当該技術分野において既知の方法を使用して、断片末端を修復することが望ましい。特定の実施形態では、核酸集団の断片末端は、平滑末端である。より具体的には、断片末端は、平滑末端であり、リン酸化されている。リン酸部分は、酵素処理によって、例えば、ポリヌクレオチドキナーゼを使用して導入することができる。 Fragmentation of primary nucleic acids by mechanical means results in fragments having a heterogeneous mixture of blunt ends, 3' overhanging ends, and 5' overhanging ends. Therefore, it may be desirable to repair the fragment ends using methods known in the art, e.g., to generate ends that are optimal for adding adapters to the blunt sites. In certain embodiments, the fragment ends of the nucleic acid population are blunt. More specifically, the fragment ends are blunt and phosphorylated. Phosphate moieties can be introduced by enzymatic treatment, e.g., using polynucleotide kinase.

一実施形態では、断片化した核酸は、オーバーハングヌクレオチドを用いて調製される。例えば、単一のオーバーハングヌクレオチドは、例えばヌクレオチド「Ａ」をＤＮＡ分子の３’末端に付加するなど単一のデオキシヌクレオチドを付加する、鋳型非依存の末端トランスフェラーゼ活性を有する、Ｔａｑポリメラーゼ又はＫｌｅｎｏｗエキソマイナスポリメラーゼなど特定タイプのＤＮＡポリメラーゼの活性によって付加することができる。このような酵素を使用して、二本鎖核酸断片の各鎖の平滑末端の３’末端に単一ヌクレオチド「Ａ」を付加することができる。したがって、Ｔａｑ又はＫｌｅｎｏｗエキソマイナスポリメラーゼとの反応によって、二本鎖標的断片の末端修復された各鎖の３’末端に「Ａ」を付加することができ、一方、アダプターは、ユニバーサルアダプターの二本鎖核酸の各領域の３’末端に存在する適合性のある「Ｔ」オーバーハングを有するＴ構築物であり得る。一実施例では、末端デオキシヌクレオチジルトランスフェラーゼ（terminal deoxynucleotidyl transferase、ＴｄＴ）を使用して、複数の「Ｔ」ヌクレオチド」（ＳｗｉｆｔＢｉｏｓｃｉｅｎｃｅｓ，ＡｎｎＡｒｂｏｒ，ＭＩ）を付加することができる。このタイプの末端修飾はまた、各末端に同じアダプターを有する標的核酸を形成するバイアスが存在するように、ベクター及び標的の両方の自己ライゲーションを防止する。 In one embodiment, the fragmented nucleic acid is prepared with an overhanging nucleotide. For example, a single overhanging nucleotide can be added by the activity of certain types of DNA polymerases, such as Taq polymerase or Klenow exo minus polymerase, which have non-templated terminal transferase activity that adds a single deoxynucleotide, e.g., the nucleotide "A," to the 3' end of a DNA molecule. Using such enzymes, a single "A" nucleotide can be added to the 3' end of the blunt end of each strand of a double-stranded nucleic acid fragment. Thus, an "A" can be added to the 3' end of each end-repaired strand of a double-stranded target fragment by reaction with Taq or Klenow exo minus polymerase, while the adapter can be a T construct with a compatible "T" overhang present at the 3' end of each region of the double-stranded nucleic acid in a universal adapter. In one example, terminal deoxynucleotidyl transferase (TdT) can be used to add multiple "T" nucleotides (Swift Biosciences, Ann Arbor, MI). This type of end modification also prevents self-ligation of both the vector and the target, such that there is a bias toward forming target nucleic acids with the same adapter at each end.

アダプターは、例えば、断片の末端又はアニーリングされたプライマーの伸長への二本鎖アダプターのライゲーションを含む様々な方法によって、断片化されたＤＮＡ又は非対称ＤＮＡ標的核酸の末端に付加され得る。断片の末端への二本鎖アダプターのライゲーションは、断片の末端に存在するオーバーハングを使用することによって、平滑末端化又は助けられ得る。アダプターは、ライゲーション又は重合を含む一本鎖又は二本鎖アダプターを使用して付加することもできる（例えば、ＴｄＴ標識）。一実施形態では、アダプターは、結果として生じる修飾標的核酸の１つの鎖にギャップをもたらすように構成されている。一実施形態では、ギャップは、少なくとも１つのヌクレオチドである。一実施形態では、ギャップは、標的核酸の３’末端と標的核酸に付着したアダプターの５’末端との間に位置する。 Adapters can be added to the ends of fragmented DNA or asymmetric DNA target nucleic acids by a variety of methods, including, for example, ligation of double-stranded adapters to the ends of fragments or extension of annealed primers. Ligation of double-stranded adapters to the ends of fragments can be blunt-ended or aided by using overhangs present at the ends of the fragments. Adapters can also be added using single- or double-stranded adapters (e.g., TdT labeling) involving ligation or polymerization. In one embodiment, the adapter is configured to introduce a gap in one strand of the resulting modified target nucleic acid. In one embodiment, the gap is at least one nucleotide. In one embodiment, the gap is located between the 3' end of the target nucleic acid and the 5' end of the adapter attached to the target nucleic acid.

一次核酸がＲＮＡである実施形態では、対称アダプターを有する標的核酸を生成することは、典型的には、一方又は両方の末端でのアダプターの任意の導入を伴うＤＮＡへのＲＮＡの変換を含む。様々な方法を使用して、ｍＲＮＡの３’側にアダプターを付加することができる。例えば、アダプターは、ｃＤＮＡを生成するために使用されている日常的な方法で付加され得る。３’末端にポリＴ配列を有するプライマー及びポリＴ配列の上流のアダプターをｍＲＮＡ分子にアニーリングされ、逆転写酵素を使用して伸長させることができる。これにより、ＤＮＡへのｍＲＮＡの１工程変換、任意選択的に、３’末端へのユニバーサル配列の１工程変換をもたらす。一実施形態では、プライマーはまた、１つ以上のインデックス配列、１つ以上のＵＭＩ、１つ以上のユニバーサル配列、又はそれらの組み合わせを含み得る。一実施形態では、ランダムプライマーを使用する。 In embodiments in which the primary nucleic acid is RNA, generating a target nucleic acid with symmetric adapters typically involves converting the RNA to DNA, with the optional introduction of adapters at one or both ends. Various methods can be used to add adapters to the 3' end of the mRNA. For example, adapters can be added using routine methods used to generate cDNA. A primer with a poly-T sequence at its 3' end and an adapter upstream of the poly-T sequence can be annealed to the mRNA molecule and extended using reverse transcriptase. This results in a one-step conversion of the mRNA to DNA, and, optionally, a one-step conversion of a universal sequence at the 3' end. In one embodiment, the primer can also include one or more index sequences, one or more UMIs, one or more universal sequences, or a combination thereof. In one embodiment, random primers are used.

非コードＲＮＡはまた、ＤＮＡに変換することができ、任意選択的に、様々な方法を使用してユニバーサル配列を含むように修飾され得る。例えば、ランダム配列及び鋳型スイッチプライマーを含む第１プライマーを使用してアダプターを付加することができ、いずれかのプライマーもアダプターを含むことができる。合成鎖の３’末端への非鋳型ヌクレオチドの付加をもたらすために末端トランスフェラーゼ活性を有する逆転写酵素を使用することができ、鋳型スイッチプライマーは、逆転写酵素により付加される非鋳型ヌクレオチドとアニーリングするヌクレオチドを含む。有用な逆転写酵素の例は、モロニ－マウス白血病ウイルス逆転写酵素である。特定の実施形態では、ＴａｋａｒａＢｉｏＵＳＡ，Ｉｎｃ．から入手可能な試薬ＳＭＡＲＴｅｒ（商標）（Ｃａｔ．Ｎｏ．６３４９２６）を使用して、インデックスを非コードＲＮＡに付加し、必要に応じてｍＲＮＡを付加するために使用される。任意選択的に、鋳型スイッチプライマーを、ポリＴ配列を有するプライマーと併せてｍＲＮＡで用い、ＲＮＡから生成されたＤＮＡ標的核酸の両端にユニバーサル配列を付加することができる。一実施形態では、同じアダプターが両方の末端に付加される。 Non-coding RNA can also be converted to DNA and, optionally, modified to contain a universal sequence using various methods. For example, an adapter can be added using a first primer containing a random sequence and a template-switch primer, and either primer can contain an adapter. A reverse transcriptase with terminal transferase activity can be used to effect the addition of a non-templated nucleotide to the 3' end of the synthesized strand, and the template-switch primer contains a nucleotide that anneals to the non-templated nucleotide added by the reverse transcriptase. An example of a useful reverse transcriptase is Moloney murine leukemia virus reverse transcriptase. In certain embodiments, the SMARTer™ (Cat. No. 634926) reagent available from Takara Bio USA, Inc. is used to add an index to the non-coding RNA and, if necessary, to the mRNA. Optionally, a template-switch primer can be used with mRNA in conjunction with a primer containing a poly-T sequence to add a universal sequence to both ends of the DNA target nucleic acid generated from the RNA. In one embodiment, the same adapter is added to both ends.

標的核酸の集団は、本明細書に記載の方法又は組成物の特定の用途に望ましい又は適切な平均鎖長を有し得る。例えば、本明細書に記載の方法の１つ以上の工程で使用されるか、又は特定の組成物、システム、若しくはキットに存在するメンバーの平均鎖長は、約１００，０００ヌクレオチド、５０，０００ヌクレオチド、１０，０００ヌクレオチド、５，０００ヌクレオチド、１，０００ヌクレオチド、５００ヌクレオチド、１００ヌクレオチド、又は５０ヌクレオチド未満であり得る。代替的に、又は追加的に、平均鎖長は、約１０ヌクレオチド、５０ヌクレオチド、１００ヌクレオチド、５００ヌクレオチド、１，０００ヌクレオチド、５，０００ヌクレオチド、１０，０００ヌクレオチド、５０，０００ヌクレオチド、又は１００，０００ヌクレオチド超であり得る。標的核酸の集団の平均鎖長は、上記の最大値と最小値との間の範囲内であり得る。増幅部位で生成された（又は別の手段で本明細書で作製又は使用された）アンプリコンは、上記に例示されているものから選択される上限と下限との間の範囲の平均鎖長を有し得ることが理解されよう。 A population of target nucleic acids can have an average chain length desirable or appropriate for a particular application of a method or composition described herein. For example, the average chain length of members used in one or more steps of a method described herein or present in a particular composition, system, or kit can be less than about 100,000 nucleotides, 50,000 nucleotides, 10,000 nucleotides, 5,000 nucleotides, 1,000 nucleotides, 500 nucleotides, 100 nucleotides, or 50 nucleotides. Alternatively, or additionally, the average chain length can be greater than about 10 nucleotides, 50 nucleotides, 100 nucleotides, 500 nucleotides, 1,000 nucleotides, 5,000 nucleotides, 10,000 nucleotides, 50,000 nucleotides, or 100,000 nucleotides. The average chain length of a population of target nucleic acids can be within a range between the maximum and minimum values listed above. It will be understood that amplicons produced at an amplification site (or otherwise produced or used herein) can have an average chain length ranging between an upper and lower limit selected from those exemplified above.

いくつかの実施形態では、標的核酸は、例えば、排除増幅を容易にするために、増幅部位の面積に対して大きさを決定される。例えば、アレイの部位の各々の面積は、排除増幅を達成するために、標的核酸の排除体積の直径よりも大きくすることができる。例えば、アレイの表面の特徴を使用する実施形態をとると、各特徴の面積は、増幅部位に輸送される標的核酸の排除体積の直径よりも大きくなり得る。標的核酸の排除体積及びその直径は、例えば、標的核酸の長さから決定することができる。核酸の排除体積及び排除体積の直径を決定するための方法は、例えば、米国特許第７，７８５，７９０号、Ｒｙｂｅｎｋｏｖｅｔａｌ．Ｐｒｏｃ．Ｎａｔｌ．Ａｃａｄ．Ｓｃｉ．Ｕ．Ｓ．Ａ．９０：５３０７－５３１１（１９９３）、Ｚｉｍｍｅｒｍａｎｅｔａｌ．，Ｊ．Ｍｏｌ．Ｂｉｏｌ．２２２：５９９－６２０（１９９１）、又はＳｏｂｅｌｅｔａｌ．，Ｂｉｏｐｏｌｙｍｅｒｓ３１：１５５９－１５６４（１９９１）に記載されている。 In some embodiments, the target nucleic acid is sized relative to the area of the amplification site, for example, to facilitate exclusion amplification. For example, the area of each of the array sites can be larger than the diameter of the exclusion volume of the target nucleic acid to achieve exclusion amplification. For example, in embodiments using surface features of an array, the area of each feature can be larger than the diameter of the exclusion volume of the target nucleic acid transported to the amplification site. The exclusion volume and its diameter of the target nucleic acid can be determined, for example, from the length of the target nucleic acid. Methods for determining the exclusion volume and diameter of the exclusion volume of a nucleic acid are described, for example, in U.S. Pat. No. 7,785,790; Rybenkov et al. Proc. Natl. Acad. Sci. U.S.A. 90:5307-5311 (1993); Zimmerman et al., J. Mol. Biol. 222:599-620 (1991); or Sobel et al. , Biopolymers 31:1559-1564 (1991).

標的核酸のタグメンテーション又は断片化及びプロセッシングによって一次核酸断片を生成することにより、分子の純度を高めるためのクリーンアッププロセスが続くことができる。電気泳動、サイズ排除クロマトグラフィーなどの任意の好適なクリーンアッププロセスが使用されてよい。いくつかの実施形態では、固相可逆性固定常磁性ビーズを用いて、例えば、組み込まれていないプライマーから所望のＤＮＡ分子を分離し、サイズに基づいて核酸を選択することができる。固相可逆性固定常磁性ビーズは、ベックマン・コールター社（ＡｇｅｎｃｏｕｒｔＡＭＰｕｒｅＸＰ）、サーモフィッシャー社（ＭａｇＪｅｔ）、オメガ・バイオテック社（Ｍａｇ－Ｂｉｎｄ）、プロメガ・ビーズ社（Ｐｒｏｍｅｇａ）、及びカパ・バイオシステムズ社（ＫａｐａＰｕｒｅＢｅａｄｓ）から市販されている。 Tagmentation or fragmentation and processing of the target nucleic acid to generate primary nucleic acid fragments can be followed by a cleanup process to increase molecular purity. Any suitable cleanup process, such as electrophoresis or size exclusion chromatography, can be used. In some embodiments, solid-phase reversibly immobilized paramagnetic beads can be used to separate desired DNA molecules from unincorporated primers and select nucleic acids based on size, for example. Solid-phase reversibly immobilized paramagnetic beads are commercially available from Beckman Coulter (Agencourt AMPure XP), Thermo Fisher (MagJet), Omega Biotech (Mag-Bind), Promega Beads (Promega), and Kapa Biosystems (Kapa Pure Beads).

標的核酸の対称から非対称からの変換
本明細書で提供される方法、組成物、システム、及びキットは、対称標的核酸を非対称アダプターで標的核酸に変換することを含む。本明細書で論じられるように、いくつかの実施形態では、標的核酸の各末端へのアダプターの付加は、得られた修飾標的核酸の各鎖にギャップをもたらす。一実施形態では、ギャップは、標的核酸の３’末端と標的核酸の各末端に結合したアダプターの５’末端との間に位置する。一実施形態では、ギャップは、ヌクレオチドで充填され、標的核酸の３’末端をプライマーとして使用してライゲートされ得る。例えば、トランスポソーム複合体を使用するいくつかの実施形態では、Ｔｎ５ベースのトランスポゾン挿入によって作成された９ｂｐの標的配列重複が伸長される。一実施形態では、伸長は、鎖置換ポリメラーゼを使用して、上流配列の置換をもたらす。一実施形態では、転位によって作成された標的配列重複は、伸長されない。一実施形態では、ライゲーションが使用される。伸長がギャップを充填するために使用される場合、損傷不耐性ポリメラーゼ又は損傷耐性ポリメラーゼを使用することができる。 Conversion of a Target Nucleic Acid from Symmetric to Asymmetric The methods, compositions, systems, and kits provided herein include converting a symmetric target nucleic acid to a target nucleic acid with an asymmetric adapter. As discussed herein, in some embodiments, the addition of an adapter to each end of the target nucleic acid results in a gap in each strand of the resulting modified target nucleic acid. In one embodiment, the gap is located between the 3' end of the target nucleic acid and the 5' end of the adapter attached to each end of the target nucleic acid. In one embodiment, the gap can be filled with nucleotides and ligated using the 3' end of the target nucleic acid as a primer. For example, in some embodiments using a transposome complex, a 9-bp target sequence overlap created by Tn5-based transposon insertion is extended. In one embodiment, extension results in displacement of the upstream sequence using a strand-displacing polymerase. In one embodiment, the target sequence overlap created by transposition is not extended. In one embodiment, ligation is used. When extension is used to fill the gap, a damage-intolerant or damage-tolerant polymerase can be used.

一実施形態では、アダプターがＤＮＡ損傷を含み、アダプターが標的核酸に結合している１つの鎖にギャップが存在する場合、ＤＮＡ損傷及びギャップは、異なる鎖上に位置する。伸長によってギャップを充填するために使用されるポリメラーゼは、鋳型鎖にＤＮＡ損傷を使用し、ポリメラーゼが損傷不耐性である場合、伸長は終了する。したがって、この構成が存在する場合、損傷不耐性ポリメラーゼの使用により、ギャップの下流のアダプターのアダプター配列の一部分のみの保持がもたらされる。これにより、標的核酸の１つのアダプターの修飾及び非対称標的核酸の生成がもたらされる。当業者は、非対称標的核酸が、ペアエンドシークエンシング反応を含むシークエンシング反応において使用され得ることを認識するであろう。しかしながら、本開示の方法は、本明細書に記載される更なる利点を提供する。 In one embodiment, when an adapter contains a DNA lesion and a gap exists in one strand where the adapter is bound to the target nucleic acid, the DNA lesion and the gap are located on different strands. The polymerase used to fill the gap by extension uses the DNA lesion in the template strand, and if the polymerase is damage intolerant, extension terminates. Thus, when this configuration exists, use of a damage-intolerant polymerase results in retention of only a portion of the adapter sequence of the adapter downstream of the gap. This results in modification of one adapter of the target nucleic acid and generation of an asymmetric target nucleic acid. Those skilled in the art will recognize that asymmetric target nucleic acids can be used in sequencing reactions, including paired-end sequencing reactions. However, the methods of the present disclosure provide additional advantages, as described herein.

対称標的核酸を生成し、次いで１つのアダプターを修飾して非対称標的核酸をもたらす一実施形態で起こり得る構造の例を図２に示す。例示的な標的核酸２０を、図２Ａに示す対称アダプター２２とともに示す。この例示的な実施形態では、対称アダプターは、ＤＮＡ損傷（Ｕで示す）を含む。一方の鎖の３’末端がブロックされ（＊で示す）、他方の鎖の３’末端がオーバーハングを含む。アダプターは、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含み得る。標的核酸２０の各末端へのアダプターの付着後、修飾標的核酸２３は、元の標的核酸２０の３’末端にギャップ２４を含む。損傷不耐性ポリメラーゼを用いる修飾標的核酸２３の伸長は、ギャップ２４の３’末端から始まり、ＤＮＡ損傷Ｕで停止し、得られた修飾標的核酸２５を図２Ｃに示す。修飾標的核酸２５の変性により、核酸が、一方の末端にＤＮＡ損傷を有する対称アダプター２２の鎖と、他端にギャップとＤＮＡ損傷との間に位置する対称アダプター配列の一部２７と、を含む、非対称標的核酸２６がもたらされる。 An example of a possible structure for one embodiment in which a symmetric target nucleic acid is generated and then one adapter is modified to result in an asymmetric target nucleic acid is shown in Figure 2. An exemplary target nucleic acid 20 is shown with a symmetric adapter 22, as shown in Figure 2A. In this exemplary embodiment, the symmetric adapter contains a DNA lesion (denoted by U). The 3' end of one strand is blocked (denoted by *), and the 3' end of the other strand contains an overhang. The adapter may contain one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof. After attachment of an adapter to each end of target nucleic acid 20, modified target nucleic acid 23 contains a gap 24 at the 3' end of the original target nucleic acid 20. Extension of modified target nucleic acid 23 using a damage-intolerant polymerase begins at the 3' end of gap 24 and terminates at the DNA lesion U, resulting in modified target nucleic acid 25, as shown in Figure 2C. Denaturation of the modified target nucleic acid 25 results in an asymmetric target nucleic acid 26, in which the nucleic acid comprises a strand of the symmetric adapter 22 with a DNA lesion at one end and a portion of the symmetric adapter sequence 27 at the other end located between the gap and the DNA lesion.

対称標的核酸の非対称アダプターを有する標的核酸への修飾後、非対称標核酸を更に修飾することができる。例えば、配列は、末端のうちの１つを特異的に標的とすることによって、例えば、標的核酸に付加された第１のアダプターにヌクレオチドを付加するか、又は非対称標的核酸をもたらすように修飾されたアダプターにヌクレオチドを付加することによって付加され得る。一実施形態では、修飾は、非対称標的核酸（例えば、図２Ｄに図示するように、アダプター２２から２７への修飾）をもたらすように修飾されたアダプターに第２のアダプターを付加するための伸長反応においてプライマーを使用することを含み得る。 After modification of the symmetric target nucleic acid to a target nucleic acid having an asymmetric adapter, the asymmetric target nucleic acid can be further modified. For example, a sequence can be added by specifically targeting one of the termini, e.g., by adding nucleotides to a first adapter added to the target nucleic acid, or by adding nucleotides to an adapter modified to yield an asymmetric target nucleic acid. In one embodiment, the modification can include using a primer in an extension reaction to add a second adapter to an adapter modified to yield an asymmetric target nucleic acid (e.g., modification of adapters 22 to 27, as illustrated in Figure 2D).

修飾に使用されるプライマーは、少なくとも２つのドメインを含み得る。第１のドメインは、プライマーの３’末端に存在し、非対称標的核酸をもたらすように修飾されたアダプターの一部にアニーリングする配列を含む。第１のドメインは、本明細書ではアニーリングドメインとも呼ばれる。当業者は、第１のドメインが特定のアニーリングのために十分な長さを有する場合、プライマーが本方法において有用であることを認識するであろう。当業者はまた、プライマーがアニーリングするヌクレオチドが、非対称アダプターの３’ヌクレオチドを含み、それによって３’ヌクレオチドをプライマーの第２のドメインを鋳型として使用する伸長のための好適な開始部位にする場合、プライマーが本方法において有用であることを認識するであろう。非対称標的核酸の３’末端は、ライゲーションを使用して修飾することもできる。 The primer used for modification may contain at least two domains. The first domain is present at the 3' end of the primer and contains a sequence that anneals to a portion of the modified adapter to yield an asymmetric target nucleic acid. The first domain is also referred to herein as the annealing domain. One skilled in the art will recognize that a primer is useful in the present method if the first domain has a sufficient length for specific annealing. One skilled in the art will also recognize that a primer is useful in the present method if the nucleotide to which the primer anneals includes the 3' nucleotide of the asymmetric adapter, thereby making the 3' nucleotide a suitable initiation site for extension using the second domain of the primer as a template. The 3' end of the asymmetric target nucleic acid can also be modified using ligation.

一実施形態では、アニーリングドメインの１つ以上のヌクレオチドは、改変ヌクレオチドである。改変ヌクレオチドは、対応する天然ＤＮＡヌクレオチドよりも高い融解温度で変性するヌクレオチド、例えば、対応する天然ＤＮＡヌクレオチドよりも高い強度を有する相補的な天然ヌクレオチドＡ、Ｔ、Ｇ、又はＣとのヌクレオチド水素結合である。改変ヌクレオチドの例としては、ロックド核酸（locked nucleic acid、ＬＮＡ）、架橋核酸（bridged nucleic acid、ＢＮＡ）、擬似相補的塩基、ペプチド核酸（peptide nucleic acid、ＰＮＡ）、２，６－ジアミノプリン、５’メチルｄＣ、ＳｕｐｅｒＴ、ＲＮＡヌクレオチド、又は本質的に溶融温度を増加させる当該技術分野で既知の任意のヌクレオチド若しくは塩基が挙げられるが、これらに限定されない。プライマーの第１のドメイン内の改変ヌクレオチドの数は、少なくとも１つ、少なくとも２つ、少なくとも３つ、少なくとも４つぃ、又は少なくとも５つであり得る。いくつかの実施形態では、天然ヌクレオチドと改変ヌクレオチドの組み合わせが使用される。一実施形態では、改変ヌクレオチドは、ポリメラーゼ開始部位から少なくとも５、少なくとも１０、又は少なくとも１５ヌクレオチド離れている。一実施形態では、伸長に有用なプライマーの濃度は、日常的な滴定によって決定することができる。 In one embodiment, one or more nucleotides of the annealing domain are modified nucleotides. Modified nucleotides are nucleotides that denature at a higher melting temperature than the corresponding natural DNA nucleotides, e.g., nucleotide hydrogen bonds with the complementary natural nucleotides A, T, G, or C with greater strength than the corresponding natural DNA nucleotides. Examples of modified nucleotides include, but are not limited to, locked nucleic acids (LNAs), bridged nucleic acids (BNAs), pseudo-complementary bases, peptide nucleic acids (PNAs), 2,6-diaminopurine, 5'-methyl dC, SuperT, RNA nucleotides, or any nucleotide or base known in the art that essentially increases the melting temperature. The number of modified nucleotides in the first domain of the primer can be at least one, at least two, at least three, at least four, or at least five. In some embodiments, a combination of natural and modified nucleotides is used. In one embodiment, the modified nucleotides are at least 5, at least 10, or at least 15 nucleotides away from the polymerase start site. In one embodiment, the concentration of primer useful for extension can be determined by routine titration.

一実施形態では、プライマー又はアダプターの３’末端をブロックして、ＤＮＡポリメラーゼによるプライマーの３’末端上のヌクレオチドの組み込みを防止する。プライマーの３’末端をブロックするための方法の例としては、３’－ＯＨ基の除去、又はプライマーの３’末端にジデオキシヌクレオチド（ｄｄＮＴＰ）などのヌクレオチド、逆塩基、それらの相補体を含まない追加の塩基、又はミスマッチ塩基などのヌクレオチドの存在によるが挙げられるが、これらに限定されない。 In one embodiment, the 3' end of a primer or adapter is blocked to prevent incorporation of a nucleotide on the 3' end of the primer by a DNA polymerase. Examples of methods for blocking the 3' end of a primer include, but are not limited to, removal of the 3'-OH group or the presence of a nucleotide, such as a dideoxynucleotide (ddNTP), a reverse base, an additional base not containing its complement, or a mismatch base, at the 3' end of the primer.

プライマーの第２のドメインは、アダプターを含むヌクレオチド配列を有する。アダプターは、１つ以上のインデックス配列、１つ以上のＵＭＩ、１つ以上のユニバーサル配列、又はそれらの組み合わせを含み得る。典型的には、アダプターに存在する任意のインデックス配列、ＵＭＩ、及びユニバーサル配列は、非対称標的核酸中に既に存在する任意のインデックス配列、ＵＭＩ、及びユニバーサル配列と比較して固有である。いくつかの実施形態では、存在する場合、ユニバーサル配列は、プライマーの５’末端に位置し得、インデックス又はＵＭＩなどのいかなる任意の配列も、第１のドメインとユニバーサル配列との間に存在することができる。 The second domain of the primer has a nucleotide sequence that includes an adapter. The adapter may include one or more index sequences, one or more UMIs, one or more universal sequences, or a combination thereof. Typically, any index sequences, UMIs, and universal sequences present in the adapter are unique compared to any index sequences, UMIs, and universal sequences already present in the asymmetric target nucleic acid. In some embodiments, the universal sequence, if present, may be located at the 5' end of the primer, and any optional sequence, such as an index or UMI, may be present between the first domain and the universal sequence.

プライマーを使用して、一方の末端に対称アダプターを、他端に非対称アダプターを有する一本鎖非対称標的核酸の３’末端を伸長又はリガンドする。 A primer is used to extend or ligate the 3' end of a single-stranded asymmetric target nucleic acid that has a symmetric adapter on one end and an asymmetric adapter on the other end.

いくつかの実施形態では、伸長の有効性は、アニーリング温度に依存し、当業者は、ｑＰＣＲなどの温度滴定及び増幅を使用して有用なアニーリング温度を容易に特定することができる。一実施形態では、損傷不耐性ＤＮＡポリメラーゼが伸長に使用される。伸長の結果は、一方の末端に対称アダプターを保持する非対称標的核酸であり、他端の非対称アダプターは、別のアダプターを含むように修飾されている。 In some embodiments, the effectiveness of extension depends on the annealing temperature, and one of skill in the art can readily identify useful annealing temperatures using temperature titration and amplification, such as qPCR. In one embodiment, a damage-intolerant DNA polymerase is used for extension. The result of extension is an asymmetric target nucleic acid that retains a symmetric adapter at one end, and the asymmetric adapter at the other end has been modified to contain another adapter.

天然ヌクレオチドＡ、Ｔ、Ｇ、及びＣは、伸長に使用することができる。いくつかの実施形態では、非天然ヌクレオチドが使用される。例えば、メチル化シトシンを使用することができる。メチル化シトシンは、アダプタープライマーが典型的にシトシンのウラシル変換中に変換されないため、メチル化シークエンシング用途（国際公開第２０１７／１０６４８１号）において有利である。 Natural nucleotides A, T, G, and C can be used for extension. In some embodiments, unnatural nucleotides are used. For example, methylated cytosine can be used. Methylated cytosine is advantageous in methylation sequencing applications (WO 2017/106481) because adapter primers typically are not converted during cytosine to uracil conversion.

一実施形態では、伸長反応が繰り返される。本発明者らは、少なくとも１つの改変ヌクレオチドを有する２ドメインプライマーを用いた複数の伸長サイクルの使用により、非対称修飾標的核酸の収量が理論上の最大収率近くまで驚くほどかつ予想外に増加することを見出した。一実施形態では、伸長回数は、少なくとも１、少なくとも３、少なくとも５、少なくとも７、少なくとも９、又は少なくとも１０であり得る。一実施形態では、伸長回数は、１５以下、１３以下、又は１１以下であり得る。一実施形態では、伸長回数は、１０である。 In one embodiment, the extension reaction is repeated. The inventors have discovered that the use of multiple extension cycles with a two-domain primer having at least one modified nucleotide surprisingly and unexpectedly increases the yield of asymmetrically modified target nucleic acids to near the theoretical maximum yield. In one embodiment, the number of extensions can be at least 1, at least 3, at least 5, at least 7, at least 9, or at least 10. In one embodiment, the number of extensions can be 15 or less, 13 or less, or 11 or less. In one embodiment, the number of extensions is 10.

タグメンテーションによって対称標的核酸を生成し、次いで、非対称標的核酸をもたらすために１つのアダプターを修飾する一実施形態で起こり得る構造の別の例を図３に示す。例示的な修飾標的核酸３３を、標的核酸３０及び対称アダプター３２とともに図３Ａに示す。アダプターは、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含み得る。この例示的な実施形態では、対称アダプター３２は、ＤＮＡ損傷（Ｕで示す）、ギャップ３４、及びトランスポザーゼ認識ドメイン３５などのユニバーサル配列を含む。損傷不耐性ポリメラーゼによる修飾標的核酸３３の伸長は、ギャップ３４の３’末端から始まり、ＤＮＡ損傷Ｕで停止し、変性後得られた非対称標的核酸３６を図３Ｂに示す。非対称標的核酸３６は、一方の末端にＤＮＡ損傷とともに対称アダプター３２の鎖を含む。他端に、非対称標的核酸３６は、非対称アダプター３７、例えば、ギャップとＤＮＡ損傷との間に位置した対称アダプター配列の一部を含む。図３Ｃは、別のアダプターを含むために非対称標的核酸３６を更に修飾する例示的な実施形態も示す。２ドメインプライマー３８は、非対称アダプター３７にアニーリングする１つのドメイン３９と、異なるアダプター４０を含む第２のドメインと、を含む。この例示的な実施形態では、プライマー３８の３’末端で開始された伸長を低減するために、ブロック（＊）が含まれる。図３Ｃにおいて点線で示される、任意選択的に損傷不耐性ポリメラーゼによる伸長は、非対称標的核酸３６の３’末端から始まり、異なるアダプター４０を追加し、図３Ｄに示すように非対称標的核酸４１をもたらす。 Another example of a possible structure in one embodiment in which a symmetric target nucleic acid is generated by tagmentation and then one adapter is modified to yield an asymmetric target nucleic acid is shown in Figure 3. An exemplary modified target nucleic acid 33 is shown in Figure 3A along with a target nucleic acid 30 and a symmetric adapter 32. The adapter can include one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof. In this exemplary embodiment, the symmetric adapter 32 includes a DNA lesion (denoted U), a gap 34, and a universal sequence, such as a transposase recognition domain 35. Extension of the modified target nucleic acid 33 with a damage-intolerant polymerase begins at the 3' end of the gap 34 and terminates at the DNA lesion U. After denaturation, the resulting asymmetric target nucleic acid 36 is shown in Figure 3B. The asymmetric target nucleic acid 36 includes a strand of the symmetric adapter 32 along with the DNA lesion at one end. At the other end, the asymmetric target nucleic acid 36 includes an asymmetric adapter 37, e.g., a portion of the symmetric adapter sequence located between the gap and the DNA lesion. Figure 3C also shows an exemplary embodiment in which the asymmetric target nucleic acid 36 is further modified to include another adapter. The two-domain primer 38 includes one domain 39 that anneals to the asymmetric adapter 37 and a second domain that includes a different adapter 40. In this exemplary embodiment, a block (*) is included to reduce extension initiated at the 3' end of the primer 38. Extension, optionally with a damage-intolerant polymerase, shown by the dotted line in Figure 3C, begins at the 3' end of the asymmetric target nucleic acid 36 and adds the different adapter 40, resulting in the asymmetric target nucleic acid 41, as shown in Figure 3D.

タグメンテーションによって対称標的核酸を生成し、次いで、非対称標的核酸をもたらすために１つのアダプターを修飾する一実施形態で起こり得る構造の別の例を図４に示す。２つのトランスポザーゼ及びトランスポゾンの例示的なトランスポソーム複合体４１は、アダプターを４２を含む（図４Ａ）。各アダプターは、プライマー（Ｐ５）、インデックス（ｉ５）、ユニバーサルアンカー配列（Ａ１４）、ＤＮＡ損傷ウラシル（Ｕ）、トランスポザーゼ認識配列（ＭＥ）、及びトランスポザーゼ認識配列（ＭＥ’）の相補体を含む。アダプターはまた、一方の鎖の５’末端に結合した任意選択の捕捉剤（Ｂ）及び任意選択の切断可能なリンカー（ＣＬ）、並びに他の鎖の３’末端に結合した任意選択のブロッキングジデオキシヌクレオチド（ｄｄＣ）を含む。いくつかの実施形態では、捕捉剤－切断可能なリンカー及びブロッキング基の配置が切り替えられる。図４Ｂは、トランスポザーゼと依然として複合体を形成している、タグ付き及び断片化された核酸を示す。簡単にするために、二量体の描写を図４Ａに示すが、図４Ｂには示していない。図４Ｃは、トランスポザーゼの除去後、及びＤＮＡ損傷不耐性ポリメラーゼを用いたギャップ充填後の構造を示す。図４Ｄは、２ドメインプライマー４３を用いそれにハイブリダイズされた、図４Ｃの上鎖を示す。２ドメインプライマー４３は、相補的ＭＥ’にアニーリングする１つのドメインＭＥと、異なるアダプター配列Ｂ１５、ｉ７、及びＰ７を含む第２のドメインと、を含む。図４Ｅは、２ドメインプライマー配列に基づく上鎖の伸長の結果を示す。図４Ｆは、プライマー除去後のタグ付きライブラリー断片を示す。図４Ｄにおいて点線で示される伸長は、ＭＥ’の３’末端から始まり、異なるアダプター４３を付加し、図４Ｆに示す非対称標的核酸をもたらす。 Another example of a possible structure in one embodiment in which a symmetric target nucleic acid is generated by tagmentation and then one adapter is modified to yield an asymmetric target nucleic acid is shown in Figure 4. An exemplary transposome complex 41 of two transposases and a transposon includes adapters 42 (Figure 4A). Each adapter includes a primer (P5), an index (i5), a universal anchor sequence (A14), the DNA damage uracil (U), a transposase recognition sequence (ME), and the complement of the transposase recognition sequence (ME'). The adapter also includes an optional capture agent (B) and an optional cleavable linker (CL) attached to the 5' end of one strand and an optional blocking dideoxynucleotide (ddC) attached to the 3' end of the other strand. In some embodiments, the configuration of the capture agent-cleavable linker and blocking group is switched. Figure 4B shows the tagged and fragmented nucleic acid still complexed with the transposase. For simplicity, a depiction of the dimer is shown in Figure 4A but not in Figure 4B. Figure 4C shows the structure after transposase removal and gap filling using a DNA damage-intolerant polymerase. Figure 4D shows the top strand of Figure 4C hybridized to it using a two-domain primer 43. The two-domain primer 43 contains one domain ME that anneals to a complementary ME' and a second domain containing different adapter sequences B15, i7, and P7. Figure 4E shows the results of extension of the top strand based on the two-domain primer sequence. Figure 4F shows the tagged library fragments after primer removal. The extension, indicated by the dotted line in Figure 4D, begins at the 3' end of ME' and adds a different adapter 43, resulting in the asymmetric target nucleic acid shown in Figure 4F.

非対称標的核酸のライブラリーは、ＤＮＡ損傷を除去するための条件に曝露され、任意選択的に、非対称標的核酸の一方又は両方の末端に１つ以上のアダプターを付加し、結果として、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ更なる以上のＵＭＩ配列、又はそれらの組み合わせにより一方又は両方の末端を更に修飾し得る。一実施形態では、ＤＮＡ損傷を除去するための条件は、損傷耐性ＤＮＡポリメラーゼによる伸長を含む。好適な損傷耐性ＤＮＡポリメラーゼの例を表１に示す。損傷耐性ＤＮＡポリメラーゼは、ＤＮＡ損傷を介して読み取る任意のタイプの伸長反応で使用することができ、得られた合成された鎖は、ＤＮＡ損傷をもはや含まない。一実施形態では、ＤＮＡ損傷を除去するための条件は、修復システムを含む。ＤＮＡ修復システムには、ＤＮＡ損傷を固定又は修復するための酵素及びメカニズムが含まれ、除去修復システム及びＤＮＡ修復システムが含まれるが、これらに限定されない。ＤＮＡ修復システムは、当該技術分野において既知である（Ｃｈａｕｄｈｕｒｉｅｔａｌ．，ＮａｔｕｒｅＲｅｖｉｅｗｓＭｏｌｅｃｕｌａｒＣｅｌｌＢｉｏｌｏｇｙ，２０１７，１８：６１０－６２１）。ＤＮＡ修復システムの使用後、非対称標的核酸のライブラリーは、伸長反応を含む条件に曝露される。 The library of asymmetric target nucleic acids is exposed to conditions to remove DNA damage, optionally adding one or more adapters to one or both ends of the asymmetric target nucleic acids, which may result in one or both ends being further modified with one or more universal sequences, one or more index sequences, one or more additional UMI sequences, or combinations thereof. In one embodiment, the conditions to remove DNA damage include extension with a damage-tolerant DNA polymerase. Examples of suitable damage-tolerant DNA polymerases are listed in Table 1. Damage-tolerant DNA polymerases can be used in any type of extension reaction that reads through DNA damage, and the resulting synthesized strands no longer contain DNA damage. In one embodiment, the conditions to remove DNA damage include a repair system. DNA repair systems include enzymes and mechanisms for fixing or repairing DNA damage, including, but not limited to, excision repair systems and DNA repair systems. DNA repair systems are known in the art (Chaudhuri et al., Nature Reviews Molecular Cell Biology, 2017, 18:610-621). After using the DNA repair system, the library of asymmetric target nucleic acids is exposed to conditions that include an extension reaction.

一実施形態では、伸長は、非対称標的核酸の数を実質的に増加させる方法による。一実施形態では、方法は、ポリメラーゼ連鎖反応（ＰＣＲ）及びローリングサークル増幅（ＲＣＡ）を含むがこれらに限定されない増幅であり得る。 In one embodiment, the extension is by a method that substantially increases the number of asymmetric target nucleic acids. In one embodiment, the method can be amplification, including, but not limited to, polymerase chain reaction (PCR) and rolling circle amplification (RCA).

一実施形態では、本方法は、ビーズ又はウェルの表面などの表面に結合したトランスポソーム複合体の使用を含む。典型的には、そのような実施形態では、トランスポゾンの鎖のうちの１つは、ビオチンなどの捕捉剤を含む。捕捉剤の使用は、非対称標的核酸を生成するための工程を有利に低減する方法を可能にする。例えば、捕捉剤及び任意の切断可能なリンカーは、一方の鎖（例えば、プライマーＰ５、インデックスｉ５、ユニバーサルアンカー配列Ａ１４、ＤＮＡ損傷ウラシルＵ、及びトランスポザーゼ認識配列ＭＥを含む図４におけるアダプター４２の鎖）の５’末端に結合することができる。表面結合トランスポソーム複合体を使用したタグメンテーション後、ＤＮＡ損傷不耐性ポリメラーゼ、ｄＮＴＰ、及び図４Ｄにおける２ドメインプライマー４３などの２ドメインプライマーを付加し得る。変性条件、例えば熱に曝露されると、トランスポザーゼ認識配列ＭＥ’の相補体が除去される。例えば、図４Ｂに示すＭＥ’は、もはやハイブリダイズしない。ポリメラーゼは、ＭＥを鋳型として使用して標的核酸の３’コピーを伸長し、ＭＥ’が標的核酸の３’末端に結合し、ＤＮＡ損傷で停止する。別の変性工程に続いて、２ドメインプライマー４３は、標的核酸の３’末端に結合したＭＥ’にアニーリングする。伸長は、鋳型として２ドメインプライマーを使用してＭＥ’で開始され、非対称標的核酸をもたらす。次いで、非対称標的核酸を固体表面から除去することができる。 In one embodiment, the method involves the use of a transposome complex bound to a surface, such as a bead or well surface. Typically, in such embodiments, one of the strands of the transposon contains a capture agent, such as biotin. The use of a capture agent advantageously reduces the number of steps required to generate an asymmetric target nucleic acid. For example, a capture agent and an optional cleavable linker can be attached to the 5' end of one strand (e.g., the strand of adapter 42 in Figure 4, which contains primer P5, index i5, universal anchor sequence A14, the DNA damage uracil U, and the transposase recognition sequence ME). After tagmentation using the surface-bound transposome complex, a DNA damage-intolerant polymerase, dNTPs, and a two-domain primer, such as two-domain primer 43 in Figure 4D, can be added. Upon exposure to denaturing conditions, such as heat, the complement of the transposase recognition sequence ME' is removed. For example, ME' shown in Figure 4B no longer hybridizes. The polymerase extends the 3' copy of the target nucleic acid using the ME as a template, with the ME' binding to the 3' end of the target nucleic acid and terminating at the DNA damage. Following another denaturation step, the two-domain primer 43 anneals to the ME' bound to the 3' end of the target nucleic acid. Extension is initiated at the ME' using the two-domain primer as a template, resulting in an asymmetric target nucleic acid. The asymmetric target nucleic acid can then be removed from the solid surface.

ビーズ又はウェルの表面などの表面に結合したトランスポソーム複合体を使用する別の実施形態では、捕捉剤及び任意の切断可能なリンカーは、他の鎖（例えば、トランスポザーゼ認識配列の相補体ＭＥ’を含む図４におけるアダプター４２の鎖）の３’末端に結合することができる。表面結合トランスポソーム複合体を使用したタグメンテーション後、ＤＮＡ損傷不耐性ポリメラーゼ、ｄＮＴＰ、及び図４Ｄにおける２ドメインプライマー４３などの２ドメインプライマーを付加し得る。変性条件、例えば熱に曝露されると、トランスポゾンの他の鎖及び結合した標的核酸が溶液に放出される。２ドメインプライマー４３は、標的核酸の３’末端に結合したＭＥ’にアニーリングされ得る。伸長は、鋳型として２ドメインプライマーを使用してＭＥ’で開始され、非対称標的核酸をもたらす。次いで、非対称標的核酸を固体表面から除去することができる。 In another embodiment using a transposome complex bound to a surface, such as the surface of a bead or well, the capture agent and optional cleavable linker can be attached to the 3' end of the other strand (e.g., the strand of adapter 42 in Figure 4, which contains the complement of the transposase recognition sequence ME'). After tagmentation using the surface-bound transposome complex, a DNA damage-intolerant polymerase, dNTPs, and a two-domain primer, such as two-domain primer 43 in Figure 4D, can be added. Upon exposure to denaturing conditions, such as heat, the other strand of the transposon and the bound target nucleic acid are released into solution. The two-domain primer 43 can anneal to the ME' attached to the 3' end of the target nucleic acid. Extension is initiated at the ME' using the two-domain primer as a template, resulting in an asymmetric target nucleic acid. The asymmetric target nucleic acid can then be removed from the solid surface.

インデックス配列
いくつかの実施形態では、シークエンシングの工程中に標的核酸の供給源を特定することが有用であり得る。これが有用である場合の例は、当業者に容易に明らかであり、異なる供給源（例えば、異なる対象、試料、組織、又は細胞型）からの複数のライブラリーの同時分析を含むが、これらに限定されない。標的核酸の供給源の特定は、例えば、標的核酸のサブセットを複数の区画に分配し、各区画において標的核酸を一意に標識し（典型的には、固有のインデックス配列を含むアダプターを付加するように修飾することによって）、次いでサブセットをプールすることによる区画化の使用を介して達成され得る。例えば、単一細胞コンビナトリアルインデックス付け（「ｓｃｉ－」）方法は、典型的には、スプリット－プール標識を使用する。したがって、いくつかの実施形態では、標的核酸の各々に結合したインデックスは、特定の区画に存在し、及びこのインデックスの存在は、この方法のこの段階で核又は細胞の集団が存在している区画を示しているか、又は識別するために使用される。区画化とも呼ばれる、区画へのインデックス及び核酸の分布の使用は、本明細書に記載されている。 Index Sequences In some embodiments, it may be useful to identify the source of target nucleic acids during the sequencing process. Examples of when this would be useful will be readily apparent to one of skill in the art and include, but are not limited to, the simultaneous analysis of multiple libraries from different sources (e.g., different subjects, samples, tissues, or cell types). Identifying the source of target nucleic acids can be achieved, for example, through the use of compartmentalization by distributing subsets of target nucleic acids into multiple compartments, uniquely labeling the target nucleic acids in each compartment (typically by modifying them with adapters containing unique index sequences), and then pooling the subsets. For example, single-cell combinatorial indexing ("sci-") methods typically use split-pool labeling. Thus, in some embodiments, an index bound to each of the target nucleic acids is present in a particular compartment, and the presence of this index is used to indicate or identify the compartment in which a population of nuclei or cells resides at this stage of the method. The use of indexes and distribution of nucleic acids into compartments, also referred to as compartmentalization, is described herein.

本明細書で使用するインデックス配列は、長さが任意の好適な数、例えば、１、２、３、４、５、６、７、８、９、１０、１１、１２、１３、１４、１５、１６、１７、１８、１９、２０以上のヌクレオチドの好適な配列であり得る。４つのヌクレオチドタグは、２５６個の試料を多重化する可能性をもたらし、６つの塩基タグは、４０９６個の試料の処理を可能にする。いくつかの実施形態では、インデックスは、特定の区画内の核酸を標識するために使用される。 As used herein, an index sequence can be any suitable sequence of any suitable number of nucleotides in length, for example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more. A four-nucleotide tag provides the possibility of multiplexing 256 samples, while a six-base tag allows for the processing of 4096 samples. In some embodiments, the index is used to label nucleic acids within a specific compartment.

本明細書に記載されるように、インデックスを付加するための非対称標的核酸の修飾は、対称標的核酸の生成中に達成され得る。例えば、インデックスは、対称アダプターに含まれ得る。追加のインデックスを、その後の工程で非対称標的核酸のいずれかの末端に選択的に付加することができる。 As described herein, modification of an asymmetric target nucleic acid to add an index can be achieved during the generation of a symmetric target nucleic acid. For example, an index can be included in a symmetric adapter. Additional indexes can be selectively added to either end of the asymmetric target nucleic acid in a subsequent step.

インデックスを付加することによる非対称標的核酸を修飾するための方法には、プライマーによる直接包含、伸長、転位、又はライゲーションが含まれるが、これらに限定されない。伸長の例としては、プライマーのハイブリダイゼーション、逆転写酵素を使用する伸長、及び増幅が挙げられるが、これらに限定されない。非対称標的核酸の一端又は両末端に付加されるヌクレオチド配列はまた、１つ以上のユニバーサル配列及び／又はＵＭＩを含み得る。ユニバーサル配列は、例えば、別のインデックス、ユニバーサル配列、及び／又はＵＭＩなどの別のヌクレオチド配列を非対称標的核酸に付加するためのプライマーとして使用され得るヌクレオチド配列をアニーリングするための後続の工程において「ランディングパッド」として使用され得る。したがって、インデックス配列の組み込みは、伸長（ハイブリダイゼーション、逆転写酵素、及び／又は増幅を含む）、ライゲーション、又は転位の本質的に任意の組み合わせを使用して、１、２、又はそれ以上の工程を含むプロセスを使用することができる。 Methods for modifying an asymmetric target nucleic acid by adding an index include, but are not limited to, direct inclusion by a primer, extension, transposition, or ligation. Examples of extension include, but are not limited to, primer hybridization, extension using reverse transcriptase, and amplification. The nucleotide sequence added to one or both ends of the asymmetric target nucleic acid may also include one or more universal sequences and/or UMIs. A universal sequence may be used as a "landing pad" in a subsequent step to anneal a nucleotide sequence that may be used as a primer for adding another nucleotide sequence, such as another index, universal sequence, and/or UMI, to the asymmetric target nucleic acid. Thus, incorporation of an index sequence can be a one-, two-, or more-step process using essentially any combination of extension (including hybridization, reverse transcriptase, and/or amplification), ligation, or transposition.

いくつかの実施形態では、インデックスの組み込みは、１、２、３、又はそれ以上のラウンドのスプリット及びプールのインデックス付けで生じ、インデックス付き単一細胞ライブラリーなどの単一、デュアル、トリプル、又は複数の（例えば、４以上の）インデックス付きライブラリーをもたらす。 In some embodiments, incorporation of indexes occurs over one, two, three, or more rounds of split and pool indexing, resulting in single, dual, triple, or multiple (e.g., four or more) indexed libraries, such as indexed single-cell libraries.

本方法は、単離された核又は細胞などの標的核酸の集団（本明細書ではプールとも呼ばれる）をサブセットにスプリットする、複数の分配工程を含み得る。以下は単離された核又は細胞に関して論じられているが、当業者は、「スプリット及びプール」工程が標的核酸の任意の集団に適用され得ることを理解するであろう。典型的には、単離された核又は細胞のサブセット、例えば、複数の区画に存在するサブセットを、区画特異的インデックスでインデックス付けし、次いでプールする。標的核酸のこの区画化は、インデックスが追加される任意の段階で起こり得る。例えば、標的核酸は、対称アダプター及び／又は別のアダプターが付加されるときに区画に存在し得る。したがって、本方法は、典型的には、プールされた単離された核又は細胞を得て、それらを分配し、区画特異的インデックスを付加するという、少なくとも１つの「スプリット及びプール」工程を含み、「スプリット及びプール」工程の数は、核酸断片に付加される異なるインデックスの数に依存し得る。インデックス付け前の核又は細胞の各初期サブセットは、他のサブセットとは異なり、一意であり得る。インデックス付け後、十分な数のインデックスが標的核酸に付加されるまで、必要に応じて、サブセットをインデックス付け後プールし、サブセットにスプリットし、インデックス付けし、再度プールすることができる。このプロセスは、各単一の細胞又は核に固有のインデックス又はインデックスの組み合わせを割り当てる。インデックス付けの完了後、例えば、１つ、２つ、３つ、又はそれ以上のインデックスの付加後、単離された核又は細胞を溶解することができる。いくつかの実施形態では、インデックスの付加及び溶解は同時に生じ得る。 The method may include multiple partitioning steps to split a population of target nucleic acids, such as isolated nuclei or cells (also referred to herein as a pool), into subsets. While the following is discussed with respect to isolated nuclei or cells, those skilled in the art will understand that the "split and pool" step can be applied to any population of target nucleic acids. Typically, subsets of isolated nuclei or cells, e.g., subsets present in multiple compartments, are indexed with compartment-specific indexes and then pooled. This compartmentalization of target nucleic acids can occur at any stage at which indexes are added. For example, target nucleic acids may be present in compartments when symmetric adapters and/or additional adapters are added. Thus, the method typically includes at least one "split and pool" step of obtaining pooled isolated nuclei or cells, partitioning them, and adding compartment-specific indexes. The number of "split and pool" steps may depend on the number of different indexes added to the nucleic acid fragments. Each initial subset of nuclei or cells prior to indexing may be unique and distinct from other subsets. After indexing, subsets can be pooled, split into subsets, indexed, and pooled again as needed until a sufficient number of indexes have been added to the target nucleic acid. This process assigns a unique index or combination of indexes to each single cell or nucleus. After indexing is complete, e.g., after addition of one, two, three, or more indexes, the isolated nuclei or cells can be lysed. In some embodiments, index addition and lysis can occur simultaneously.

サブセット、したがって各区画内に存在する核又は細胞の数は、少なくとも１であり得る。一実施形態では、サブセット内に存在する核又は細胞の数は、１００，０００，０００以下、１０，０００，０００以下、１，０００，０００以下、１００，０００以下、１０，０００以下、４，０００以下、３，０００以下、２，０００以下、１，０００以下、５００以下、又は５０以下である。一実施形態では、サブセット内に存在する核又は細胞の数は、１～１，０００、１，０００～１０，０００、１０，０００～１００，０００、１００，０００～１，０００，０００、１，０００，０００～１０，０００，０００、又は１０，０００，０００～１００，０００，０００であり得る。一実施形態では、各サブセット内に存在する核又は細胞の数はほぼ等しい。サブセット内、したがって各区画内に存在する核の数は、インデックスの衝突を減らしたいという要望に部分的に基づいており、衝突とは、本方法のこの工程において同じ区画内で終わる同じインデックスの組み合わせを有する２つの核又は細胞の存在である。核又は細胞をサブセットに分配するための方法は、当業者に既知であり、日常的であり、蛍光活性化細胞選別（fluorescence-activated cell sorting、ＦＡＣＳ）単純希釈を含む。 The number of nuclei or cells present in a subset, and therefore in each compartment, may be at least 1. In one embodiment, the number of nuclei or cells present in a subset is 100,000,000 or less, 10,000,000 or less, 1,000,000 or less, 100,000 or less, 10,000 or less, 4,000 or less, 3,000 or less, 2,000 or less, 1,000 or less, 500 or less, or 50 or less. In one embodiment, the number of nuclei or cells present in a subset may be 1 to 1,000, 1,000 to 10,000, 10,000 to 100,000, 100,000 to 1,000,000, 1,000,000 to 10,000,000, or 10,000,000 to 100,000,000. In one embodiment, the number of nuclei or cells present in each subset is approximately equal. The number of nuclei present in a subset, and therefore in each compartment, is based in part on the desire to reduce index collisions, where a collision is the presence of two nuclei or cells with the same index combination that end up in the same compartment at this step of the method. Methods for distributing nuclei or cells into subsets are known and routine to those skilled in the art and include fluorescence-activated cell sorting (FACS) and simple dilution.

分配工程（及び後続のインデックスの付加）における区画の数は、使用するフォーマットに依存し得る。例えば、区画の数は、２～９６区画（９６ウェルプレートを使用する場合）、２～３８４区画（３８４ウェルプレートを使用する場合）、又は２～１５３６区画（１５３６ウェルプレートを使用する場合）であり得る。一実施形態では、区画の数は、５０００以上（ＴａｋａｒａＢｉｏｓｃｉｅｎｃｅｓ、ｉｃｅｌｌ８システム）である。一実施形態では、複数のプレートを使用することができる。一実施形態では、各区画は液滴であり得る。使用される区画の種類が、２つ以上の核又は細胞を含有する液滴又はウェルである場合、少なくとも１０，０００、少なくとも１００，０００、少なくとも１，０００，０００、又は少なくとも１０，０００，０００の液滴など、任意の数の液滴又はウェルを使用することができる。単離された核又は細胞のサブセットは、典型的には、プール前に区画内でインデックス付けされる。 The number of compartments in the partitioning step (and subsequent indexing) can depend on the format used. For example, the number of compartments can be 2 to 96 compartments (when using a 96-well plate), 2 to 384 compartments (when using a 384-well plate), or 2 to 1536 compartments (when using a 1536-well plate). In one embodiment, the number of compartments is 5000 or more (Takara Biosciences, icell8 system). In one embodiment, multiple plates can be used. In one embodiment, each compartment can be a droplet. When the type of compartment used is a droplet or well containing two or more nuclei or cells, any number of droplets or wells can be used, such as at least 10,000, at least 100,000, at least 1,000,000, or at least 10,000,000 droplets. Isolated nuclei or cell subsets are typically indexed within compartments prior to pooling.

図５は、本開示による、単一細胞コンビナトリアルインデックス付けのための一般的な例示的方法の一般的なブロック図を示す。この方法は、単離された核又は細胞を提供すること（図５、ブロック５０）及び単離された核又は細胞を複数の区画に分配すること（図５、ブロック５１）を含む。ブロック４０はＤＮＡを指し、当業者は、ＤＮＡが、例えば、ゲノムＤＮＡ又はＲＮＡ由来のＤＮＡであり得ることを認識するであろう。この方法の実施形態では、単離された核又は細胞は、対称アダプターを付加することによって区画特異的インデックスでインデックス付けされ（図５、ブロック５２）、次いでプールされる（図５、ブロック５３）。したがって、本方法は、典型的には、プールされた単離された核又は細胞を得て、それらを分配し、区画特異的インデックスを付加するという、少なくとも１つの「スプリット及びプール」工程を含み、「スプリット及びプール」工程の数は、標的核酸に付加される、異なるインデックスの数に依存し得る。第２のインデックスに非対称アダプターを付加する場合、プールされた単離された核又は細胞は、第２の複数の区画に分配され（図５、ブロック５３）、非対称アダプターを付加することによって、区画特異的インデックスでインデックス付けされる（図５、ブロック５４）。任意選択的に、非対称標的核酸は、次いで増幅され得る（図５、ブロック５５）。非対称標的核酸の増幅は、インデックス配列、ＵＭＩ配列、及び／又はユニバーサル配列を含むがこれらに限定されない、一方又は両方の末端への他の有用な配列の付加を含むことができ、更なるスプリット及びプールインデックス付けと組み合わせることができる。 FIG. 5 shows a general block diagram of a general exemplary method for single-cell combinatorial indexing according to the present disclosure. The method includes providing isolated nuclei or cells ( FIG. 5 , block 50) and distributing the isolated nuclei or cells into multiple compartments ( FIG. 5 , block 51). Block 40 refers to DNA, and those skilled in the art will recognize that the DNA can be, for example, genomic DNA or DNA derived from RNA. In an embodiment of this method, the isolated nuclei or cells are indexed with compartment-specific indexes by adding symmetric adapters ( FIG. 5 , block 52) and then pooled ( FIG. 5 , block 53). Thus, the method typically includes at least one "split and pool" step of obtaining pooled isolated nuclei or cells, distributing them, and adding compartment-specific indexes; the number of "split and pool" steps may depend on the number of different indexes added to the target nucleic acid. If asymmetric adapters are added to the second index, the pooled isolated nuclei or cells are distributed into a second plurality of compartments ( FIG. 5 , block 53) and indexed with compartment-specific indexes by adding asymmetric adapters ( FIG. 5 , block 54). Optionally, the asymmetric target nucleic acids can then be amplified ( FIG. 5 , block 55). Amplification of the asymmetric target nucleic acids can include the addition of other useful sequences to one or both ends, including, but not limited to, index sequences, UMI sequences, and/or universal sequences, and can be combined with further split-and-pool indexing.

得られたインデックス付き標的核酸は、シークエンシングされ得る核酸のライブラリーを集合的に提供する。本明細書においてシークエンシングライブラリーとも呼ばれるライブラリーという用語は、３’末端及び５’末端に既知のユニバーサル配列を含む修飾核酸のコレクションを指す。 The resulting indexed target nucleic acids collectively provide a library of nucleic acids that can be sequenced. The term library, also referred to herein as a sequencing library, refers to a collection of modified nucleic acids containing known universal sequences at the 3' and 5' ends.

用途
本開示により提供される方法は、全ゲノム、トランスクリプトーム、メチル化、アクセス可能（例えば、ＡＴＡＣ）、及び立体構造状態（例えば、ＨｉＣ）などシークエンシングライブラリーの調製を含む、本質的に任意の用途に容易に組み込むことができる。これは、これらに限定されないが、ｓｃｉ－ＷＧＳ－ｓｅｑ、ｓｃｉ－ＭＥＴ－ｓｅｑ、ｓｃｉ－ＡＴＡＣ－ｓｅｑ、及びｓｃｉ－ＲＮＡ－ｓｅｑなどの単一細胞コンビナトリアルインデックス付け（ｓｃｉ）方法などの、高ライブラリー変換を必要とする、本質的に任意の用途において、特に有用であり得る。シークエンシングライブラリー生成を、各側に異なるユニバーサル配列を有する（例えば、非対称）標的核酸の生成に集中させる代わりに、本開示によって提供される方法をシークエンシングライブラリー生成に統合することは、各側に同じユニバーサル配列を有する（例えば、対称）より効率的な標的核酸の生成を含む。対称断片の生成時に、対称断片を非対称断片に変換するための本明細書に記載の方法を適用することができる。全ゲノム又は標的ライブラリーの構築に使用することができる、多数のシークエンシングライブラリー法が当業者に知られている（例えば、ｇｅｎｏｍｉｃｓ．ｕｍｎ．ｅｄｕ／ｄｏｗｎｌｏａｄｓ／ｓｅｑｕｅｎｃｉｎｇ－ｍｅｔｈｏｄｓ－ｒｅｖｉｅｗ．ｐｄｆで入手可能な「ＳｅｑｕｅｎｃｉｎｇＭｅｔｈｏｄｓＲｅｖｉｅｗ」を参照）。 Applications The methods provided by the present disclosure can be readily incorporated into essentially any application, including the preparation of whole genome, transcriptome, methylation, accessible (e.g., ATAC), and conformational state (e.g., HiC) sequencing libraries. This can be particularly useful in essentially any application requiring high library conversion, including single-cell combinatorial indexing (sci) methods such as, but not limited to, sci-WGS-seq, sci-MET-seq, sci-ATAC-seq, and sci-RNA-seq. Instead of focusing sequencing library generation on the generation of target nucleic acids with different universal sequences on each side (e.g., asymmetric), integrating the methods provided by the present disclosure into sequencing library generation involves the more efficient generation of target nucleic acids with the same universal sequence on each side (e.g., symmetric). Upon generation of symmetric fragments, the methods described herein for converting symmetric fragments to asymmetric fragments can be applied. Numerous sequencing library methods are known to those of skill in the art that can be used to construct whole genome or targeted libraries (see, for example, "Sequencing Methods Review," available at genomics.umn.edu/downloads/sequencing-methods-review.pdf).

いくつかの実施形態では、この適用は、全ゲノム又は標的化シークエンシングである。一般に、組織、個々の細胞、又は個々の核は、本明細書に記載されるように処理されて、対称標的核酸をもたらす。（実施例２を参照）。いくつかの実施形態では、個々の細胞又は個々の核を処理して、ゲノムＤＮＡからヌクレオソームを結合解除することができる（国際公開第２０１８／０１８００８号）。次いで、対称修飾標的核酸を本明細書に記載されるように処理して、非対称修飾標的核酸を生成することができる。例えば、図６に示すように、核酸は、核の完全性を維持するために固定され、ゲノムＤＮＡからヌクレオソームを除去してゲノム全体をアクセス可能にする条件に曝露し、次いで、対称標的核酸を生成するために例えばタグメンテーションによって挿入されたアダプターの１つの集団を有する。続いて、対称標的核酸を、本明細書に記載されるように非対称標的核酸に変換することができる。 In some embodiments, the application is whole genome or targeted sequencing. Generally, tissues, individual cells, or individual nuclei are processed as described herein to produce symmetric target nucleic acids. (See Example 2.) In some embodiments, individual cells or individual nuclei can be processed to unbind nucleosomes from genomic DNA (WO 2018/018008). The symmetrically modified target nucleic acids can then be processed as described herein to produce asymmetrically modified target nucleic acids. For example, as shown in FIG. 6, nucleic acids can be fixed to maintain nuclear integrity, exposed to conditions that remove nucleosomes from genomic DNA to make the entire genome accessible, and then have one population of adapters inserted, e.g., by tagmentation, to produce symmetric target nucleic acids. The symmetric target nucleic acids can then be converted to asymmetric target nucleic acids as described herein.

いくつかの実施形態では、用途は、アクセス可能なＤＮＡの識別のためのＡＴＡＣ－ｓｅｑ（シークエンシングを使用したトランスポザーゼアクセス可能クロマチンのアッセイ）などのアクセス可能なＤＮＡをプローブするためのものである。一般に、無傷のヌクレオソームを有する組織、個々の細胞、又は個々の核は、本明細書に記載されるように処理されて、対称標的核酸をもたらす（実施例２を参照）。次いで、対称修飾標的核酸を本明細書に記載されるように処理して、非対称修飾標的核酸を生成することができる。例えば、図７に示すように、結合ヌクレオソームを含むゲノムＤＮＡをタグメンテーションして、対称標的核酸を生成することができる。続いて、対称標的核酸を、本明細書に記載されるように非対称標的核酸に変換することができる。 In some embodiments, the application is for probing accessible DNA, such as ATAC-seq (Assay for Transposase-Accessible Chromatin Using Sequencing) for identification of accessible DNA. Generally, tissues, individual cells, or individual nuclei with intact nucleosomes are processed as described herein to yield symmetric target nucleic acids (see Example 2). The symmetrically modified target nucleic acids can then be processed as described herein to generate asymmetrically modified target nucleic acids. For example, as shown in FIG. 7, genomic DNA containing bound nucleosomes can be tagmented to generate symmetric target nucleic acids. The symmetric target nucleic acids can then be converted to asymmetric target nucleic acids as described herein.

いくつかの実施形態では、用途は、ｍＲＮＡなどのＲＮＡをシークエンシングするためである。ＲＮＡは、ＤＮＡに変換され、出発物質としてＤＮＡを使用する用途とは対照的に、ＤＮＡへのプロセッシング中にＲＮＡ分子の一方又は両方の末端にアダプターを付加することができる。これは、ＲＮＡの５’及び／又は３’プロファイリング又は全長ＲＮＡプロファイリングの選択肢を提供する。例えば、図８に示すように、例示的な一実施形態では、ｍＲＮＡ分子を、ユニバーサル配列及び鋳型スイッチプライマーを含むポリ－Ｔプライマーの存在下で逆転写酵素に供して、各端にアダプター（ＣＳ１で示す）を含む二本鎖ＤＮＡをもたらす（図８Ａ）。得られた二本鎖ＤＮＡのトランスポソーム複合体への曝露し（図８Ｂ）、対称アダプターを非対称アダプターに変換した後（図８Ｃ）、３つの異なる集団が結果として生じ得る（図８Ｄ）。１つの集団（３’末端）は、トランスポゾン配列が二本鎖ＤＮＡに挿入され、得られる標的核酸の他方の末端が、ｍＲＮＡの元の３’末端に対応する配列を含む場合に生じる可能性がある。第２の集団（ＲＮＡボディとして示す）は、トランスポゾン配列が二本鎖ＤＮＡ内の２つの位置に挿入されると、結果として生じ得る。第３の集団（５’末端として示す）は、トランスポゾン配列が二本鎖ＤＮＡに挿入され、結果として生じる標的核酸の他端が、ｍＲＮＡの元の５’末端に対応する配列を含む場合に生じ得る。 In some embodiments, the application is for sequencing RNA, such as mRNA. In contrast to applications where the RNA is converted to DNA and DNA is used as the starting material, adapters can be added to one or both ends of the RNA molecules during processing to DNA. This provides the option of 5' and/or 3' profiling of the RNA or full-length RNA profiling. For example, as shown in Figure 8, in one exemplary embodiment, mRNA molecules are subjected to reverse transcriptase in the presence of a poly-T primer containing a universal sequence and a template switch primer, resulting in double-stranded DNA containing adapters (designated CS1) at each end (Figure 8A). After exposing the resulting double-stranded DNA to a transposome complex (Figure 8B) and converting the symmetric adapters to asymmetric adapters (Figure 8C), three distinct populations can result (Figure 8D). One population (the 3' end) can arise when a transposon sequence is inserted into the double-stranded DNA and the other end of the resulting target nucleic acid contains a sequence corresponding to the original 3' end of the mRNA. The second population (denoted as RNA bodies) can result when the transposon sequence is inserted into two locations within the double-stranded DNA. The third population (denoted as 5' ends) can result when the transposon sequence is inserted into the double-stranded DNA and the other end of the resulting target nucleic acid contains a sequence corresponding to the original 5' end of the mRNA.

いくつかの実施形態では、用途は、メチル化シークエンシングである。メチル化又はヒドロキシメチル化状態の分析を可能にする広範囲な方法が文献、Ｂａｒｒｏｓ－Ｓｉｌｖａｅｔａｌ．，Ｇｅｎｅｓ（Ｂａｓｅｌ）．２０１８Ｓｅｐ；９（９）：４２９）に記載されているか、当業者に既知である。変換の化学的（例えば、亜硫酸水素ナトリウム又はホウ酸塩化学）又は酵素的方法は、ＢＳ－ｓｅｑ、ＴＡＢ－ｓｅｑ、ＲＲＢＳ－ｓｅｑ、ＭｅＤｉｐ－ｓｅｑ、メチルキャップ－ｓｅｑ、ＭＢＤ－ｓｅｑ、Ｎａｎｏｐｏｒｅ－ｓｅｑ、ｏｘＢＳ－ｓｅｑ、ＳｅｑＣａｐＥｐｉＣｐＧｉａｎｔ、ＢＳＡＳ、ＷＧＢＳ、及びｓｃｉ－ＭＥＴ（国際公開第２０１８／２２６７０８号）を含むがこれらに限定されない様々なメチル化シークエンシング法で使用することができる。 In some embodiments, the application is methylation sequencing. A wide range of methods that allow for analysis of methylation or hydroxymethylation status are described in the literature (Barros-Silva et al., Genes (Basel). 2018 Sep;9(9):429) or are known to those of skill in the art. Chemical (e.g., sodium bisulfite or borate chemistry) or enzymatic methods of conversion can be used with a variety of methylation sequencing methods, including, but not limited to, BS-seq, TAB-seq, RRBS-seq, MeDip-seq, methylcap-seq, MBD-seq, Nanopore-seq, oxBS-seq, SeqCap Epi CpGiant, BSAS, WGBS, and sci-MET (WO 2018/226708).

一実施形態では、用途は、タンパク質分析である。タンパク質は、細胞内若しくは表面結合されること、単離されること、又は生体試料中に存在することができる。様々な方法が当業者に利用可能である。タンパク質検出に多くの場合使用される一般的な方法は、抗体又はｆａｂ断片をオリゴヌクレオチドタグで標識することであり、抗体を目的のタンパク質と親和性結合させること、オリゴヌクレオチドタグを読み出し又は検出のために使用することである。オリゴヌクレオチドタグは、インデックス配列、ＵＭＩ、ユニバーサル配列、又はそれらの組み合わせを含み得る。 In one embodiment, the application is protein analysis. Proteins can be intracellular or surface-bound, isolated, or present in a biological sample. A variety of methods are available to those skilled in the art. A common method often used for protein detection is to label an antibody or Fab fragment with an oligonucleotide tag, allow the antibody to affinity bind to the protein of interest, and use the oligonucleotide tag for readout or detection. The oligonucleotide tag can include an index sequence, a UMI, a universal sequence, or a combination thereof.

いくつかの実施形態では、用途は、同時アッセイであり、２つ以上の異なる検体又は情報を同時に評価する。検体の例としては、ＤＮＡ、ＲＮＡ、及びタンパク質が挙げられるが、これらに限定されない。核酸は、異なる状態、例えば、エピジェネティック状態（ＡＴＡＣ、ｍｅＣ、５－ヒドロキシＭｅなど）、又は立体配座状態（例えば、ＨｉＣ、３Ｃ、クロマチン状態など）であり得る。例としては、ＤＮＡとＲＮＡ、ＤＮＡと／又はＲＮＡ及びエピジェネティック状態（ＡＴＡＣ、ｍｅＣ、５－ヒドロキシＭｅなど）、ＤＮＡと立体配座状態（例えば、ＨｉＣ、３Ｃ、クロマチン状態など）を分析するアッセイが挙げられる。 In some embodiments, the application is a simultaneous assay, evaluating two or more different analytes or information simultaneously. Examples of analytes include, but are not limited to, DNA, RNA, and protein. Nucleic acids can be in different states, such as epigenetic states (ATAC, meC, 5-hydroxyMe, etc.), or conformational states (e.g., HiC, 3C, chromatin state, etc.). Examples include assays that analyze DNA and RNA, DNA and/or RNA and epigenetic states (ATAC, meC, 5-hydroxyMe, etc.), and DNA and conformational states (e.g., HiC, 3C, chromatin state, etc.).

同時アッセイの例は、本明細書でＧＣＣ－ｓｅｑと称されるゲノム＋クロマチン立体配座シークエンシングのためのゲノムＤＮＡの調製である。ＧＣＣ－ｓｅｑは、全ゲノムシークエンシング及びクロマチン立体配座分析を組み合わせ、単一細胞若しくは単一核並びにスプリット及びプールインデックス付けと組み合わせたときに、日常的なＨｉ－Ｃタイプの方法よりも高い速度でクロマチン相互作用を捕捉する（実施例２を参照）。図９に例示されるようにゲノムＤＮＡは、例えば、固定、制限酵素による消化、近接ライゲーション、及びヌクレオソーム枯渇によって処理され、次いでアダプターが付加されて対称標的核酸をもたらす。任意選択的に、分子捕捉を使用することができる。対称修飾標的核酸は、本明細書に記載されるように処理することができる。 An example of a simultaneous assay is the preparation of genomic DNA for genome plus chromatin conformation sequencing, referred to herein as GCC-seq. GCC-seq combines whole-genome sequencing and chromatin conformation analysis, capturing chromatin interactions at a higher rate than routine Hi-C-type methods when combined with single-cell or single-nucleus and split-and-pool indexing (see Example 2). As illustrated in Figure 9, genomic DNA is processed, for example, by fixation, restriction enzyme digestion, proximity ligation, and nucleosome depletion, followed by adapter addition to yield symmetric target nucleic acids. Optionally, molecular capture can be used. Symmetrically modified target nucleic acids can be processed as described herein.

シークエンシングのための固定された試料の調製
インデックス付き標的核酸のライブラリーは、シークエンシングのために調製することができる。インデックス付き標的核酸を断片を基質に付加するための方法は、当技術分野において既知である。一実施形態では、インデックス付き断片は、インデックス付き断片に対する特異性を有する複数の捕捉オリゴヌクレオチドを使用して濃縮され、捕捉オリゴヌクレオチドは、フローセル又はビーズなどの固形基質の表面に固定化され得る。例えば、捕捉オリゴヌクレオチドは、ユニバーサル結合対の第１のメンバーを含むことができ、結合対の第２のメンバーは、固形基質の表面に固定化される。同様に、固定化された標的核酸を増幅するための方法としては、ブリッジ増幅及び結合平衡除外が挙げられるが、これらに限定されない。シークエンシングの前に固定化及び増幅する方法は、例えば、Ｂｉｇｎｅｌｌら（米国特許第８，０５３，１９２号）、Ｇｕｎｄｅｒｓｏｎら（国際公開第２０１６／１３０７０４号）、Ｓｈｅｎら（米国特許第８，８９５，２４９号）、及びＰｉｐｅｎｂｕｒｇら（米国特許第９，３０９，５０２号）に記載されている。 Preparation of Immobilized Samples for Sequencing A library of indexed target nucleic acids can be prepared for sequencing. Methods for attaching indexed target nucleic acid fragments to a substrate are known in the art. In one embodiment, the indexed fragments are enriched using multiple capture oligonucleotides specific for the indexed fragments, and the capture oligonucleotides can be immobilized on the surface of a solid substrate, such as a flow cell or beads. For example, the capture oligonucleotides can include a first member of a universal binding pair, and a second member of the binding pair is immobilized on the surface of the solid substrate. Similarly, methods for amplifying immobilized target nucleic acids include, but are not limited to, bridge amplification and binding equilibrium exclusion. Methods for immobilization and amplification prior to sequencing are described, for example, in Bignell et al. (U.S. Patent No. 8,053,192), Gunderson et al. (WO 2016/130704), Shen et al. (U.S. Patent No. 8,895,249), and Pipenberg et al. (U.S. Patent No. 9,309,502).

プールされた試料は、シークエンシングのために調製中に固定化され得る。シークエンシングは、単一分子のアレイとして実施することも、シークエンシングの前に増幅することもできる。増幅は、１つ以上の固定化プライマーを使用して実施することができる。固定化されたプライマーは、例えば、平面上、又はビーズのプール上のローンであり得る。ビーズのプールは、エマルジョンの各「区画」に単一のビーズを有するエマルジョン中に単離され得る。「区画」当たり１つの鋳型のみの濃度では、単一の鋳型のみが各ビーズ上で増幅される。 Pooled samples can be immobilized in preparation for sequencing. Sequencing can be performed as a single-molecule array or can be amplified prior to sequencing. Amplification can be performed using one or more immobilized primers. The immobilized primers can be, for example, on a flat surface or in a lawn on a pool of beads. The pool of beads can be isolated in an emulsion with a single bead in each "compartment" of the emulsion. At a concentration of only one template per "compartment," only a single template is amplified on each bead.

本明細書で使用するとき、用語「固相増幅」は、形成時に増幅生成物の全て又は一部が固体支持体上に固定されるように、固体支持体上又は固体支持体と関連して実施される任意の核酸増幅反応を指す。具体的には、この用語は、順方向及び逆方向増幅プライマーの一方又は両方が固体支持体上に固定されていることを除いて、標準溶液相増幅に類似した反応である固相ポリメラーゼ連鎖反応（固相ＰＣＲ）及び固相等温増幅を包含する。固相ＰＣＲは、一方のプライマーがビーズに固定され、もう一方が遊離溶液にあるエマルジョン、一方のプライマーが表面に固定され、もう一方が遊離溶液にある固相ゲルマトリックスでのコロニー形成などのシステムを包含する。 As used herein, the term "solid-phase amplification" refers to any nucleic acid amplification reaction performed on or in association with a solid support such that all or a portion of the amplification product is immobilized on the solid support during formation. Specifically, the term encompasses solid-phase polymerase chain reaction (solid-phase PCR) and solid-phase isothermal amplification, which are reactions similar to standard solution-phase amplification except that one or both of the forward and reverse amplification primers are immobilized on the solid support. Solid-phase PCR encompasses systems such as emulsions in which one primer is immobilized on a bead and the other is in free solution, and colonization in solid-phase gel matrices in which one primer is immobilized on a surface and the other is in free solution.

いくつかの実施形態では、固体支持体はパターン化された表面を含む。「パターン化された表面」は、固体支持体の露出層内又はその上の異なる領域の配置を指す。例えば、１つ以上の領域は、１つ以上の増幅プライマーが存在する特徴であり得る。この特徴は、増幅プライマーが存在しない間質領域によって分離され得る。いくつかの実施形態では、パターンは、行及び列にある特徴のｘ－ｙフォーマットであり得る。いくつかの実施形態では、パターンは、特徴及び／又は間質領域の反復配列であり得る。いくつかの実施態様では、パターンは、特徴及び／又は間質領域のランダム配列であり得る。本明細書に記載の方法及び組成物で使用することができる例示的なパターン化表面は、米国特許第８，７７８，８４８号、同第８，７７８，８４９号、及び同第９，０７９，１４８号、並びに米国特許出願公開第２０１４／０２４３２２４号に記載されている。 In some embodiments, the solid support comprises a patterned surface. "Patterned surface" refers to the arrangement of distinct regions within or on an exposed layer of a solid support. For example, one or more regions can be features in which one or more amplification primers are present. The features can be separated by interstitial regions in which no amplification primers are present. In some embodiments, the pattern can be an x-y format of features in rows and columns. In some embodiments, the pattern can be a repeating sequence of features and/or interstitial regions. In some embodiments, the pattern can be a random sequence of features and/or interstitial regions. Exemplary patterned surfaces that can be used in the methods and compositions described herein are described in U.S. Patent Nos. 8,778,848, 8,778,849, and 9,079,148, and U.S. Patent Application Publication No. 2014/0243224.

いくつかの実施形態では、固体支持体は、表面にウェル又は窪みのアレイを含む。これは、フォトリソグラフィ、スタンピング技術、成形技術、及びマイクロエッチング技術を含むがこれらに限定されない様々な技術を使用して、当該技術分野において一般的に知られているように製造することができる。当該技術分野において理解されるように、使用される技術は、アレイ基板の組成及び形状に依存する。 In some embodiments, the solid support comprises an array of wells or depressions on its surface, which can be fabricated as commonly known in the art using a variety of techniques, including, but not limited to, photolithography, stamping techniques, molding techniques, and microetching techniques. As understood in the art, the technique used will depend on the composition and shape of the array substrate.

パターン付き表面内の特徴は、ガラス、シリコン、プラスチック、又はポリ（Ｎ－（５－アジドアセトアミルペンチル）アクリルアミド－ｃｏ－アクリルアミド）（ＰＡＺＡＭ、例えば、それぞれ、参照によりその全体が本明細書に組み込まれる米国特許出願公開第２０１３／１８４７９６号、国際公開第２０１６／０６６５８６号、及び同第２０１５／００２８１３号参照）などのパターン化された共有結合ゲルを備えた他の適切な固体支持体上のウェルのアレイ（例えば、マイクロウェル又はナノウェル）のウェルである可能性がある。このプロセスは、シークエンシングのために使用されるゲルパッドを作成し、これは、多数のサイクルでシークエンシング動作にわたって安定であり得る。ポリマーをウェルに共有結合することは、様々な用途の間に、構造化基材の寿命全体にわたってゲルを構造化特徴部に維持するのに有用である。しかしながら、多くの実施形態では、ゲルは、ウェルに共有結合される必要はない。例えば、いくつかの条件では、構造化基質のどの部分にも共有結合されていないシランフリーのアクリルアミド（silane free acrylamide、ＳＦＡ、例えば、米国特許第８，５６３，４７７号を参照）をゲル材料として使用することができる。 The features within the patterned surface can be wells of an array of wells (e.g., microwells or nanowells) on glass, silicon, plastic, or other suitable solid support with a patterned covalently attached gel, such as poly(N-(5-azidoacetamylpentyl)acrylamide-co-acrylamide) (PAZAM; see, e.g., U.S. Patent Application Publication Nos. 2013/184796, WO 2016/066586, and WO 2015/002813, each of which is incorporated by reference in its entirety). This process creates a gel pad used for sequencing, which can be stable over many cycles of sequencing operations. Covalently attaching the polymer to the wells is useful for maintaining the gel in the structured features throughout the life of the structured substrate during various applications. However, in many embodiments, the gel need not be covalently attached to the wells. For example, in some circumstances, silane-free acrylamide (SFA, see, e.g., U.S. Patent No. 8,563,477), which is not covalently bonded to any part of the structured matrix, can be used as the gel material.

特定の別の実施形態では、構造化基材は、ウェル（例えば、マイクロウェル又はナノセル）を用いて固体支持材料をパターニングし、パターン化された支持体をゲル材料（例えば、ＰＡＺＡＭ、ＳＦＡ、又はその化学修飾された変異体）でコーティングすることによって作製することができ、ＳＦＡ（アジド－ＳＦＡ）のアジド化バージョンなど、及びゲルコーティングされた支持体を、例えば化学研磨又は機械研磨によって研磨し、それによって、ウェル内にゲルを保持するが、ウェル間の構造化基材の表面上の間質領域から実質的に全てのゲルを除去するか又は不活性化する。ゲル材料にプライマー核酸を付着させることができる。次いで、修飾標的核酸の溶液を研磨基材と接触させて、個々の修飾標的核酸が、ゲル材料に付着したプライマーとの相互作用を介して個々のウェルに播種されるようにすることができるが、ゲル材料が存在しないか不活性であるため、標的核酸は間質領域を占有しない。修飾標的核酸の増幅は、間質領域内のゲルの不在又は非活性が、増殖する核酸コロニーの外への移動を防止するため、ウェルに限定されるであろう。プロセスは、好都合に製造可能であり、スケール変更可能であり、従来のマイクロ又はナノ製造方法を利用する。 In certain other embodiments, a structured substrate can be created by patterning a solid support material with wells (e.g., microwells or nanocells), coating the patterned support with a gel material (e.g., PAZAM, SFA, or a chemically modified variant thereof), such as an azido-SFA version, and polishing the gel-coated support, e.g., by chemical or mechanical polishing, thereby retaining the gel within the wells but removing or inactivating substantially all of the gel from the interstitial regions on the surface of the structured substrate between the wells. Primer nucleic acids can then be attached to the gel material. A solution of modified target nucleic acids can then be contacted with the polished substrate, such that individual modified target nucleic acids are seeded into individual wells through interaction with primers attached to the gel material, but because the gel material is absent or inactive, the target nucleic acids do not occupy the interstitial regions. Amplification of the modified target nucleic acids will be confined to the wells because the absence or inactivity of gel within the interstitial regions prevents outward migration of growing nucleic acid colonies. The process is conveniently manufacturable and scalable, utilizing conventional micro- or nano-fabrication methods.

本開示は、１つの増幅プライマーのみが固定化される「固相」増幅法（他のプライマーは通常は遊離溶液中に存在する）を包含するが、一実施形態では、固体支持体は、固定化された順方向及び逆方向プライマーの両方とともに提供される。実際には、増幅プロセスは増幅を維持するために過剰なプライマーを必要とするため、「複数」の同一の順方向プライマー及び／又は固体支持体上に固定化された「複数」の同一の逆方向プライマーが存在するであろう。本明細書における順方向及び逆方向プライマーへの言及は、文脈が別段の指示をしない限り、「複数の」そのようなプライマーを包含するものとして解釈されるべきである。 While the present disclosure encompasses "solid-phase" amplification methods in which only one amplification primer is immobilized (the other primer is typically in free solution), in one embodiment, a solid support is provided with both immobilized forward and reverse primers. In practice, because the amplification process requires an excess of primers to maintain amplification, there will be "multiple" identical forward primers and/or "multiple" identical reverse primers immobilized on the solid support. References herein to forward and reverse primers should be construed as encompassing "multiple" such primers, unless the context dictates otherwise.

当業者に理解されるように、任意の所与の増幅反応は、増幅される鋳型に特異的な少なくとも１つのタイプの順方向プライマー及び少なくとも１つのタイプの逆方向プライマーを必要とする。しかしながら、特定の実施形態では、順方向及び逆方向プライマーは、同一配列の鋳型特異的部分を含んでもよく、完全に同一のヌクレオチド配列及び構造（任意の非ヌクレオチド修飾を含む）を有してもよい。換言すれば、１つのタイプのプライマーのみを用いて固相増幅を行うことができ、そのような単一プライマー法は、本開示の範囲内に包含される。他の実施形態は、同一の鋳型特異的配列を含むが、いくつかの他の構造的特徴において異なる順方向及び逆方向プライマーを使用してもよい。例えば、一方のタイプのプライマーは、他方には存在しない非ヌクレオチド修飾を含み得る。 As will be understood by those skilled in the art, any given amplification reaction requires at least one type of forward primer and at least one type of reverse primer specific to the template to be amplified. However, in certain embodiments, the forward and reverse primers may contain template-specific portions of the same sequence and may have the exact same nucleotide sequence and structure (including any non-nucleotide modifications). In other words, solid-phase amplification can be performed using only one type of primer, and such single-primer methods are encompassed within the scope of the present disclosure. Other embodiments may use forward and reverse primers that contain the same template-specific sequence but differ in some other structural feature. For example, one type of primer may contain a non-nucleotide modification that is not present in the other.

固相増幅用プライマーは、好ましくは、プライマーの５’末端又はその付近で固体支持体への単一点共有結合によって固定され、プライマーの鋳型特異的部分をその同族鋳型へのアニーリングのために自由にし、３’ヒドロキシル基をプライマー伸長のために自由にする。当該技術分野において既知の任意の好適な共有結合手段をこの目的のために使用することができる。選択された付着化学的物質は、固体支持体の性質、及びそれに適用される任意の誘導体化又は官能化に依存する。プライマー自体は、付着を促進するために非ヌクレオチド化学修飾であってもよい部分を含んでもよい。特定の実施形態では、プライマーは、５’末端にホスホロチオエート又はチオホスフェートなどの硫黄含有求核剤を含んでもよい。固体に支持されたポリアクリルアミドヒドロゲルの場合、この求核剤はヒドロゲルに存在するブロモアセトアミド基に結合する。プライマー及び鋳型を固体支持体に取り付けるより具体的な手段は、国際公開第０５／０６５８１４号に記載されるように、重合アクリルアミド及びＮ－（５－ブロモアセトアミドイルペンチル）アクリルアミド（ＢＲＡＰＡ）からなるヒドロゲルへの、５’ホスホロチオエート結合を介する。 Primers for solid-phase amplification are preferably immobilized to a solid support by a single-point covalent bond at or near the 5' end of the primer, leaving the template-specific portion of the primer free for annealing to its cognate template and the 3' hydroxyl group free for primer extension. Any suitable covalent attachment means known in the art can be used for this purpose. The attachment chemistry selected will depend on the nature of the solid support and any derivatization or functionalization applied to it. The primer itself may contain a moiety, which may be a non-nucleotide chemical modification, to facilitate attachment. In certain embodiments, the primer may contain a sulfur-containing nucleophile, such as a phosphorothioate or thiophosphate, at the 5' end. In the case of a solid-supported polyacrylamide hydrogel, this nucleophile binds to a bromoacetamide group present in the hydrogel. A more specific means of attaching primers and templates to a solid support is via a 5' phosphorothioate bond to a hydrogel composed of polymerized acrylamide and N-(5-bromoacetamidoylpentyl)acrylamide (BRAPA), as described in WO 05/065814.

本開示の特定の実施形態は、例えば、ポリヌクレオチドなど生体分子への共有結合を可能にする反応基を含む中間材料の層又はコーティングの適用によって「官能化」された不活性基質又はマトリックス（例えば、ガラススライド、ポリマービーズなど）を含む固体支持体を利用することができる。このような支持体の例としては、ガラスなどの不活性基質上に支持されるポリアクリルアミドヒドロゲルが挙げられるが、これに限定されない。このような実施形態では、生体分子（例えば、ポリヌクレオチド）は、中間材料（例えば、ヒドロゲル）に直接共有結合してもよいが、中間材料は、それ自体が基質又はマトリックス（例えば、ガラス基質）に非共有結合してもよい。用語「固体支持体への共有結合」は、このタイプの配置を包含するように適宜解釈されるべきである。 Certain embodiments of the present disclosure may utilize solid supports comprising an inert substrate or matrix (e.g., glass slides, polymeric beads, etc.) that has been "functionalized" by the application of a layer or coating of an intermediate material containing reactive groups that allow for covalent attachment to biomolecules, such as polynucleotides. Examples of such supports include, but are not limited to, polyacrylamide hydrogels supported on an inert substrate such as glass. In such embodiments, the biomolecule (e.g., polynucleotide) may be covalently attached directly to the intermediate material (e.g., hydrogel), or the intermediate material may itself be non-covalently attached to the substrate or matrix (e.g., glass substrate). The term "covalent attachment to a solid support" should be interpreted accordingly to encompass this type of arrangement.

プールされた試料は、ビーズ上で増幅されてもよく、各ビーズは、順方向及び逆方向増幅プライマーを含有する。一実施形態では、修飾標的核酸のライブラリーを使用して、米国特許出願公開第２００５／０１００９００号、米国特許第７，１１５，４００号、国際公開第００／１８９５７号及び国際公開第９８／４４１５１号に記載されているものに類似した、固相増幅、より具体的には固相等温増幅による核酸コロニーのクラスター化アレイを調製する。用語「クラスター」及び「コロニー」は、本明細書において交換可能に使用され、複数の同一の固定化核酸鎖及び複数の同一の固定化された相補的核酸鎖を含む、固体支持体上の別個の部位を指す。用語「クラスター化アレイ」とは、このようなクラスター又はコロニーから形成されるアレイを意味する。この文脈では、用語「アレイ」は、クラスターの順序付けられた配置を必要とするものとして理解されるべきではない。 Pooled samples may be amplified on beads, each bead containing a forward and reverse amplification primer. In one embodiment, a library of modified target nucleic acids is used to prepare clustered arrays of nucleic acid colonies by solid-phase amplification, more specifically solid-phase isothermal amplification, similar to that described in U.S. Patent Application Publication No. 2005/0100900, U.S. Patent No. 7,115,400, WO 00/18957, and WO 98/44151. The terms "cluster" and "colony" are used interchangeably herein and refer to distinct sites on a solid support containing a plurality of identical immobilized nucleic acid strands and a plurality of identical immobilized complementary nucleic acid strands. The term "clustered array" refers to an array formed from such clusters or colonies. In this context, the term "array" should not be understood as requiring an ordered arrangement of the clusters.

「固相」又は「表面」という用語は、プライマーが平坦な表面、例えば、ガラス、シリカ若しくはプラスチック顕微鏡スライド、又は類似のフロー細胞デバイスや、ビーズであって、１つ又は２つのプライマーが付着し、ビーズが増幅される、ビーズに取り付けられている平面アレイか、ビーズが増幅された後の表面上のビーズのアレイのいずれかを意味するために使用される。 The terms "solid phase" or "surface" are used to refer to either a planar array in which primers are attached to a flat surface, such as a glass, silica, or plastic microscope slide, or similar flow cell device or beads, to which one or two primers are attached and the beads are amplified, or an array of beads on a surface after the beads have been amplified.

クラスター化されたアレイは、国際公開第９８／４４１５１号に記載されているような熱サイクルのプロセス、又は温度が一定に維持され、試薬の変化を使用して延伸及び変性のサイクルが行われるプロセスを使用して調整され得る。このような等温増幅法は、国際公開第０２／４６４５６号及び米国特許出願公開第２００８／０００９４２０号に記載されている。 Clustered arrays can be prepared using a process of thermal cycling, such as that described in WO 98/44151, or a process in which the temperature is held constant and cycles of extension and denaturation are performed using varying reagents. Such isothermal amplification methods are described in WO 02/46456 and U.S. Patent Application Publication No. 2008/0009420.

本明細書に記載されるか、又は当技術分野において一般的に既知の増幅方法のいずれも、固定化ＤＮＡ断片を増幅するために、ユニバーサル又は標的特異的なプライマーとともに使用され得ることが理解されるであろう。増幅に好適な方法としては、米国特許第８，００３，３５４号に記載されているように、ポリメラーゼ連鎖反応（ＰＣＲ）、鎖置換増幅（strand displacement amplification、ＳＤＡ）、転写媒介増幅（transcription mediated amplification、ＴＭＡ）、及び核酸配列に基づく増幅（nucleic acid sequence-based amplification、ＮＡＳＢＡ）が挙げられるが、これらに限定されない。上記の増幅方法を用いて、対象とする１つ以上の核酸を増幅することができる。例えば、多重ＰＣＲ、ＳＤＡ、ＴＭＡ、ＮＡＳＢＡなどを含むＰＣＲを利用して、固定化ＤＮＡ断片を増幅することができる。いくつかの実施形態では、対象となるポリヌクレオチドに特異的に指向されるプライマーは、増幅反応に含まれる。 It will be understood that any of the amplification methods described herein or generally known in the art can be used with universal or target-specific primers to amplify immobilized DNA fragments. Suitable methods for amplification include, but are not limited to, polymerase chain reaction (PCR), strand displacement amplification (SDA), transcription-mediated amplification (TMA), and nucleic acid sequence-based amplification (NASBA), as described in U.S. Patent No. 8,003,354. The above amplification methods can be used to amplify one or more nucleic acids of interest. For example, PCR, including multiplex PCR, SDA, TMA, NASBA, etc., can be used to amplify immobilized DNA fragments. In some embodiments, primers specifically directed to polynucleotides of interest are included in the amplification reaction.

ポリヌクレオチドの増幅に好適な他の方法としては、オリゴヌクレオチド伸長及びライゲーション、ローリングサークル増幅（ＲＣＡ）（Ｌｉｚａｒｄｉｅｔａｌ．，Ｎａｔ．Ｇｅｎｅｔ．１９：２２５－２３２（１９９８））、及びオリゴヌクレオチドライゲーションアッセイ（oligonucleotide ligation assay、ＯＬＡ）技術を含み得る（一般に米国特許第７，５８２，４２０号、同第５，１８５，２４３号、同第５，６７９，５２４号、及び同第５，５７３，９０７号、欧州特許第０３２０３０８（Ｂ１）号、欧州特許第０３３６７３１（Ｂ１）号、欧州特許第０４３９１８２（Ｂ１）号、国際公開第９０／０１０６９号、国際公開第８９／１２６９６号、及び国際公開第８９／０９８３５号参照）。これらの増幅方法は、固定化ＤＮＡ断片を増幅するように設計され得ることが理解されるであろう。例えば、いくつかの実施形態では、増幅法は、対象の核酸に特異的に指向されるプライマーを含有するライゲーションプローブ増幅又はオリゴヌクレオチドライゲーションアッセイ（ＯＬＡ）反応を含んでもよい。いくつかの実施形態では、増幅法は、対象の核酸に特異的に指向されるプライマーを含有するプライマー伸長ライゲーション反応を含んでもよい。対象の核酸を増幅するよう特異的に設計され得るプライマー伸長及びライゲーションプライマーの非限定的な例として、増幅は、米国特許第７，５８２，４２０号及び同第７，６１１，８６９号により例示されるように、ＧｏｌｄｅｎＧａｔｅアッセイに使用されるプライマー（Ｉｌｌｕｍｉｎａ，Ｉｎｃ．，ＳａｎＤｉｅｇｏ，ＣＡ）を挙げることができる。 Other suitable methods for amplifying polynucleotides may include oligonucleotide extension and ligation, rolling circle amplification (RCA) (Lizardi et al., Nat. Genet. 19:225-232 (1998)), and oligonucleotide ligation assay (OLA) techniques (see generally U.S. Pat. Nos. 7,582,420, 5,185,243, 5,679,524, and 5,573,907; European Patent Nos. 0 320 308 (B1); 0 336 731 (B1); 0 439 182 (B1); WO 90/01069; WO 89/12696; and WO 89/09835). It will be understood that these amplification methods can be designed to amplify immobilized DNA fragments. For example, in some embodiments, the amplification method can include a ligation probe amplification or oligonucleotide ligation assay (OLA) reaction containing primers specifically directed to the nucleic acid of interest. In some embodiments, the amplification method can include a primer extension ligation reaction containing primers specifically directed to the nucleic acid of interest. Non-limiting examples of primer extension and ligation primers that can be specifically designed to amplify the nucleic acid of interest include the primers used in the GoldenGate assay (Illumina, Inc., San Diego, CA), as exemplified by U.S. Patent Nos. 7,582,420 and 7,611,869.

ＤＮＡナノボールも、本明細書に記載の方法、システム、及び組成物と組み合わせて使用することができる。ゲノムシークエンシングのためのＤＮＡナノブロックを作成及び使用するための方法は、例えば、米国特許及び公報である米国特許第７，９１０，３５４号、米国特許出願公開第２００９／０２６４２９９号、同第２００９／００１１９４３号、同第２００９／０００５２５２号、同第２００９／０１５５７８１号、同第２００９／０１１８４８８号に見出すことができ、例えば、Ｄｒｍａｎａｃら（２０１０，Ｓｃｉｅｎｃｅ３２７（５９６１）：７８－８１）に記載されているように見出すことができる。手短に言うと、非対称標的核酸の生成後、非対称標的核酸は環状化され、ローリングサークル増幅によって増幅される（Ｌｉｚａｒｄｉｅｔａｌ．，１９９８．Ｎａｔ．Ｇｅｎｅｔ．１９：２２５－２３２；米国特許出願公開第２００７／００９９２０８（Ａ１）号）。アンプリコンの伸長された鎖状構造は、コイリングを促進し、それによりコンパクトなＤＮＡナノボールを作成する。ＤＮＡナノボールは、基質上に捕捉され、好ましくは、各ナノボール間の距離が維持され、それによって別個のＤＮＡナノボールのシークエンシングが可能になるように、順序付けられた又はパターン化されたアレイを形成することができる。ＣｏｍｐｌｅｔｅＧｅｎｏｍｉｃｓ（ＭｏｕｎｔａｉｎＶｉｅｗ，Ｃａｌｉｆ．）によって使用されるものなどのいくつかの実施形態では、環状化の前にアダプター付加、増幅及び消化の連続ラウンドを実行して、アダプター配列によって分離されたいくつかの標的核酸を有するヘッドトゥテール構築物を作製する。 DNA nanoballs can also be used in combination with the methods, systems, and compositions described herein. Methods for making and using DNA nanoblocks for genome sequencing can be found, for example, in U.S. patents and publications such as U.S. Patent No. 7,910,354, U.S. Patent Application Publication Nos. 2009/0264299, 2009/0011943, 2009/0005252, 2009/0155781, and 2009/0118488, and as described, for example, in Drmanac et al. (2010, Science 327(5961):78-81). Briefly, after generation of the asymmetric target nucleic acid, the asymmetric target nucleic acid is circularized and amplified by rolling circle amplification (Lizardi et al., 1998. Nat. Genet. 19:225-232; U.S. Patent Application Publication No. 2007/0099208 A1). The extended, chain-like structure of the amplicon promotes coiling, thereby creating compact DNA nanoballs. The DNA nanoballs can be captured on a substrate, preferably forming an ordered or patterned array such that the distance between each nanoball is maintained, thereby enabling sequencing of individual DNA nanoballs. In some embodiments, such as those used by Complete Genomics (Mountain View, Calif.), sequential rounds of adapter addition, amplification, and digestion are performed prior to circularization to generate a head-to-tail construct with several target nucleic acids separated by adapter sequences.

本開示の方法で使用され得る例示的な等温増幅法としては、例えば、Ｄｅａｎｅｔａｌ．，Ｐｒｏｃ．Ｎａｔｌ．Ａｃａｄ．Ｓｃｉ．ＵＳＡ９９：５２６１－６６（２００２）、又は例えば米国特許第６，２１４，５８７号により例示される等温鎖置換核酸増幅によって例示される複数置換増幅（Multiple Displacement Amplification、ＭＤＡ）が挙げられるが、これらに限定されない。本開示で使用され得る他の非ＰＣＲ系方法としては、例えば、Ｗａｌｋｅｒｅｔａｌ．，ＭｏｌｅｃｕｌａｒＭｅｔｈｏｄｓｆｏｒＶｉｒｕｓＤｅｔｅｃｔｉｏｎ，ＡｃａｄｅｍｉｃＰｒｅｓｓＩｎｃ．，１９９５に記載されている鎖置換増幅（ＳＤＡ）、米国特許第５，４５５，１６６号、及び同第５，１３０，２３８号、並びにＷａｌｋｅｒｅｔａｌ．，Ｎｕｃｌ．ＡｃｉｄｓＲｅｓ．２０：１６９１－９６（１９９２）、又は、例えばＬａｇｅｅｔａｌ．，ＧｅｎｏｍｅＲｅｓ．１３：２９４－３０７（２００３）に記載されている過分枝鎖置換増幅が挙げられる。等温増幅法は、例えば、鎖置換Ｐｈｉ２９ポリメラーゼ又はＢｓｔＤＮＡポリメラーゼ大型断片、ゲノムＤＮＡのランダムプライマー増幅のための５’－＞３’エキソで使用することができる。これらのポリメラーゼの使用は、それらの高い加工性及び鎖置換活性の利点を利用する。高い加工性により、ポリメラーゼは、１０－２０ｋｂの長さの断片を生成できる。上記に述べたように、低加工性を有するポリメラーゼ及びＫｌｅｎｏｗポリメラーゼなどの鎖置換活性を有するポリメラーゼを使用して、等温条件下でより小さな断片を生成することができる。増幅反応、条件及び成分の更なる説明は、米国特許第７，６７０，８１０号の開示に詳細に記載されている。 Exemplary isothermal amplification methods that can be used in the methods of the present disclosure include, but are not limited to, multiple displacement amplification (MDA), exemplified by, for example, Dean et al., Proc. Natl. Acad. Sci. USA 99:5261-66 (2002), or isothermal strand displacement nucleic acid amplification, exemplified by, for example, U.S. Patent No. 6,214,587. Other non-PCR-based methods that can be used in the present disclosure include, for example, strand displacement amplification (SDA), as described in, for example, Walker et al., Molecular Methods for Virus Detection, Academic Press Inc., 1995, U.S. Patent Nos. 5,455,166 and 5,130,238, and Walker et al. , Nucl. Acids Res. 20:1691-96 (1992), or hyperbranched strand displacement amplification, as described, for example, in Lage et al., Genome Res. 13:294-307 (2003). Isothermal amplification methods can be used, for example, with strand-displacing Phi29 polymerase or Bst DNA polymerase large fragment, 5'->3' exo for random-primed amplification of genomic DNA. The use of these polymerases takes advantage of their high processivity and strand displacement activity. High processivity allows the polymerase to generate fragments of 10-20 kb in length. As mentioned above, polymerases with low processivity and polymerases with strand displacement activity, such as Klenow polymerase, can be used to generate smaller fragments under isothermal conditions. Further description of the amplification reaction, conditions, and components is described in detail in the disclosure of U.S. Pat. No. 7,670,810.

いくつかの実施形態では、等温増幅は、排除増幅（ＥｘＡｍｐ）とも呼ばれる、結合平衡除外増幅（kinetic exclusion amplification、ＫＥＡ）を使用して行うことができる。本開示の核酸ライブラリーは、増幅試薬を反応させて、部位に播種した個々の標的核酸からそれぞれがアンプリコンの実質的にクローン性集団を含む複数の増幅部位を生成する工程を含む方法を使用して作製することができる。いくつかの実施形態では、増幅反応は、それぞれの増幅部位の容量を満たすのに十分な数のアンプリコンが生成されるまで進行する。このように、既に播種された部位を容量まで満たすと、標的核酸がその部位に着地して増幅するのを阻害し、それによってその部位でアンプリコンのクローン集団を生成する。いくつかの実施形態では、第２の標的核酸がその部位に到達する前に増幅部位が容量まで満たされていなくても、見かけのクローン性を達成することができる。いくつかの条件下では、第１の標的核酸の増幅は、その部位に輸送される第２の標的核酸からのコピーの生成を有効に上回るか又は圧倒するのに十分な数のコピーが作製される点まで進行し得る。例えば、直径５００ｎｍ未満の円形特徴部上でブリッジ増幅プロセスを使用する実施形態では、第１の標的核酸に対する指数増幅の１４サイクル後、同じ部位での第２の標的核酸からの汚染は、Ｉｌｌｕｍｉｎａシークエンシングプラットフォーム上での配列合成分析に悪影響を及ぼすのに不十分な数の汚染アンプリコンを生成することが決定された。 In some embodiments, isothermal amplification can be performed using kinetic exclusion amplification (KEA), also known as exclusion amplification (ExAmp). Nucleic acid libraries of the present disclosure can be created using a method comprising reacting amplification reagents to generate a plurality of amplification sites, each containing a substantially clonal population of amplicons, from individual target nucleic acids seeded at the sites. In some embodiments, the amplification reaction proceeds until a sufficient number of amplicons are generated to fill each amplification site to capacity. In this manner, filling a previously seeded site to capacity inhibits target nucleic acids from landing and amplifying at that site, thereby generating a clonal population of amplicons at that site. In some embodiments, apparent clonality can be achieved even if an amplification site is not filled to capacity before a second target nucleic acid arrives at the site. Under some conditions, amplification of a first target nucleic acid can proceed to a point where a sufficient number of copies are generated to effectively exceed or overwhelm the generation of copies from a second target nucleic acid transported to that site. For example, in an embodiment using a bridge amplification process on circular features less than 500 nm in diameter, it was determined that after 14 cycles of exponential amplification for a first target nucleic acid, contamination from a second target nucleic acid at the same site would generate an insufficient number of contaminating amplicons to adversely affect sequence synthesis analysis on an Illumina sequencing platform.

いくつかの実施形態では、アレイ中の増幅部位は、完全にクローンであることができるが、必ずしもそうである必要はない。むしろ、いくつかの用途では、個々の増幅部位は、主に第１の非対称標的核酸からのアンプリコンで占められ、また、第２の非対称標的核酸からの低レベルの汚染アンプリコンを有し得る。アレイは、汚染レベルがアレイのその後の使用に許容できない影響を有さない限り、低レベルの汚染アンプリコンを有する１つ以上の増幅部位を有することができる。例えば、アレイが検出用途で使用される場合、許容可能なレベルの汚染は、検出技術の信号対雑音比又は分解能に許容できない方法で影響を与えないレベルである。したがって、見かけのクローン性は、一般に、本明細書に記載の方法によって作製されるアレイの特定の使用又は用途に関連する。特定の用途のために個々の増幅部位で許容できる汚染の例示的なレベルとしては、最大で０．１％、０．５％、１％、５％、１０％又は２５％の汚染アンプリコンを含むが、これらに限定されない。アレイは、これらの例示的なレベルの汚染アンプリコンを有する１つ以上の増幅部位を含み得る。例えば、アレイ内の増幅部位の最大５％、１０％、２５％、５０％、７５％、又は更には１００％に、汚染されたアンプリコンが含まれている可能性がある。アレイ又はその他の部位集合において、部位の少なくとも５０％、７５％、８０％、８５％、９０％、９５％又は９９％以上がクローン性であるか、又は見かけでクローン性であり得ることが理解されよう。 In some embodiments, amplification sites in an array can be, but are not necessarily, completely clonal. Rather, in some applications, individual amplification sites may be populated primarily with amplicons from a first asymmetric target nucleic acid and may also have low levels of contaminating amplicons from a second asymmetric target nucleic acid. Arrays can have one or more amplification sites with low levels of contaminating amplicons, as long as the level of contamination does not have an unacceptable effect on subsequent use of the array. For example, if an array is used in a detection application, an acceptable level of contamination is one that does not unacceptably affect the signal-to-noise ratio or resolution of the detection technique. Thus, apparent clonality generally relates to the particular use or application of an array produced by the methods described herein. Exemplary levels of contamination that are acceptable in individual amplification sites for a particular application include, but are not limited to, up to 0.1%, 0.5%, 1%, 5%, 10%, or 25% contaminating amplicons. Arrays can include one or more amplification sites with these exemplary levels of contaminating amplicons. For example, up to 5%, 10%, 25%, 50%, 75%, or even 100% of the amplification sites within an array may contain contaminating amplicons. It will be understood that in an array or other collection of sites, at least 50%, 75%, 80%, 85%, 90%, 95%, or 99% or more of the sites may be clonal or appear clonal.

いくつかの実施形態では、結合平衡除外は、別のイベント又はプロセスが発生することを効果的に排除するために、十分に速い速度でプロセスが生じるときに生じ得る。アレイの部位が溶液からの非対称標的核酸でランダムに播種され、非対称標的核酸のコピーが増幅プロセスで生成されて、播種部位の各々を容量を満たす核酸アレイの作製を例として取り上げる。本開示の結合平衡除外法によれば、播種及び増幅プロセスは、増幅速度が播種速度を超える条件下で同時に進行することができる。したがって、第１の標的核酸によって播種された部位でコピーが作製される比較的速い速度は、増幅のためにその部位を播種することから、第２の核酸を効果的に排除する。結合平衡除外法は、米国特許出願公開第２０１３／０３３８０４２号の開示に詳細に記載されているように実施することができる。 In some embodiments, binding equilibrium exclusion can occur when a process occurs at a rate fast enough to effectively exclude another event or process from occurring. Take, for example, the creation of a nucleic acid array in which array sites are randomly seeded with asymmetric target nucleic acids from solution, and copies of the asymmetric target nucleic acids are generated in an amplification process to fill each seeded site to capacity. According to the binding equilibrium exclusion method of the present disclosure, the seeding and amplification processes can proceed simultaneously under conditions in which the amplification rate exceeds the seeding rate. Thus, the relatively fast rate at which copies are made at a site seeded by a first target nucleic acid effectively excludes a second nucleic acid from seeding that site for amplification. The binding equilibrium exclusion method can be performed as described in detail in the disclosure of U.S. Patent Application Publication No. 2013/0338042.

結合平衡除外は、増幅を開始する比較的遅い速度（例えば、非対称標的核酸の第１のコピーを作製する比較的遅い速度）に対して、非対称標的核酸（又は非対称標的核酸の第１のコピーの）の後続のコピーを作製する比較的速い速度を利用することができる。前段落の例では、結合平衡除外は、比較的遅い速度の非対称標的核酸の播種（例えば、比較的遅い拡散又は輸送）に対して、非対称標的核酸播種のコピーで部位を満たすために増幅が生じる比較的速い速度のために生じる。別の例示的な実施形態では、結合平衡除外は、部位に播種した非対称標的核酸の第１のコピーの形成の遅延（例えば、遅延活性化又は遅い活性化）に対して、部位を満たすために後続のコピーが作製される比較的速い速度のために生じ得る。この例では、個々の部位にいくつかの異なる非対称標的核酸が播種されている可能性がある（例えば、増幅前に、各部位にいくつかの非対称標的核酸が存在し得る）。しかしながら、任意の所与の非対称標的核酸の第１のコピー形成がランダムに活性化され得、これにより、後続のコピーが生成される速度と比較して第１のコピー形成の平均速度が比較的遅くなる。この場合、個々の部位にいくつかの異なる非対称標的核酸が播種されている可能性があるが、結合平衡除外により、それらのうちの１つのみの増幅が可能になる。より具体的には、第１の非対称標的核酸が増幅のために活性化されると、その部位はそのコピーで急速に容量が満たされ、それにより、その部位で第２の非対称標的核酸のコピーが作製されるのが防止される。 Binding equilibrium exclusion can take advantage of a relatively slow rate of initiating amplification (e.g., a relatively slow rate of creating the first copy of the asymmetric target nucleic acid) versus a relatively fast rate of creating subsequent copies of the asymmetric target nucleic acid (or of the first copy of the asymmetric target nucleic acid). In the example of the previous paragraph, binding equilibrium exclusion occurs because of a relatively fast rate of amplification occurring to fill sites with seeded copies of the asymmetric target nucleic acid versus a relatively slow rate of seeding of the asymmetric target nucleic acid (e.g., relatively slow diffusion or transport). In another exemplary embodiment, binding equilibrium exclusion can occur because of a delayed formation of the first copy of the asymmetric target nucleic acid seeded at a site (e.g., delayed activation or slow activation) versus a relatively fast rate at which subsequent copies are created to fill sites. In this example, several different asymmetric target nucleic acids may be seeded at individual sites (e.g., several asymmetric target nucleic acids may be present at each site prior to amplification). However, first copy formation of any given asymmetric target nucleic acid may be activated randomly, resulting in a relatively slow average rate of first copy formation compared to the rate at which subsequent copies are generated. In this case, although an individual site may be seeded with several different asymmetric target nucleic acids, binding equilibrium exclusion allows amplification of only one of them. More specifically, when a first asymmetric target nucleic acid is activated for amplification, the site is rapidly filled to capacity with copies of that nucleic acid, thereby preventing copies of a second asymmetric target nucleic acid from being made at that site.

一実施形態では、本方法は、（ｉ）平均輸送速度で増幅部位に非対称標的核酸を輸送すること、（ｉｉ）平均増幅速度で増幅部位にある非対称標的核酸を増幅することを同時に行うため、平均増幅速度が平均輸送速度を超える（米国特許第９，１６９，５１３号）。したがって、このような実施形態では、比較的遅い輸送速度を使用することによって、結合平衡除外を達成することができる。例えば、濃度が低いと、輸送の平均速度が遅くなるので、濃度が十分に低い非対称標的核酸を選択して、所望の平均輸送速度を達成することができる。代替的に又は追加的に、溶液中の高粘度溶液及び／又は分子クラウディング試薬の存在を使用して、輸送速度を低下させることができる。有用な分子クラウディング試薬の例としては、ポリエチレングリコール（polyethylene glycol、ＰＥＧ）、フィコール、デキストラン、又はポリビニルアルコールが挙げられるが、これらに限定されない。例示的な分子クラウディング試薬及び製剤は、参照により本明細書に組み込まれる米国特許第７，３９９，５９０号に記載されている。所望の輸送速度を達成するように調節することができる別の因子は、標的核酸の平均サイズである。 In one embodiment, the method simultaneously (i) transports the asymmetric target nucleic acid to the amplification site at an average transport rate and (ii) amplifies the asymmetric target nucleic acid at the amplification site at an average amplification rate, such that the average amplification rate exceeds the average transport rate (U.S. Patent No. 9,169,513). Therefore, in such an embodiment, binding equilibrium exclusion can be achieved by using a relatively slow transport rate. For example, a sufficiently low concentration of the asymmetric target nucleic acid can be selected to achieve the desired average transport rate, since low concentrations result in a slow average transport rate. Alternatively or additionally, a high viscosity solution and/or the presence of a molecular crowding agent in the solution can be used to reduce the transport rate. Examples of useful molecular crowding agents include, but are not limited to, polyethylene glycol (PEG), Ficoll, dextran, or polyvinyl alcohol. Exemplary molecular crowding agents and formulations are described in U.S. Patent No. 7,399,590, incorporated herein by reference. Another factor that can be adjusted to achieve the desired transport rate is the average size of the target nucleic acid.

増幅試薬は、アンプリコン形成を促進する更なる成分を含むことができ、場合によってはアンプリコン形成の速度を増加させる。一実施例は、リコンビナーゼである。リコンビナーゼは、反復的な浸潤／伸長を可能にすることによって、アンプリコン形成を促進することができる。より具体的には、リコンビナーゼは、ポリメラーゼによる非対称標的核酸の浸潤と、非対称標的核酸をアンプリコン形成の鋳型として使用するポリメラーゼによるプライマーの伸長と、を促進することができる。このプロセスは、浸潤／伸長の各ラウンドから生成されたアンプリコンが後続のラウンドで鋳型として機能する鎖反応として繰り返すことができる。変性サイクル（例えば、加熱又は化学変性による）は必要とされないため、このプロセスは標準的なＰＣＲよりも迅速に行うことができる。したがって、リコンビナーゼ促進増幅は、等温的に行うことができる。増幅を促進するために、リコンビナーゼ促進増幅試薬中に、ＡＴＰ、又は他のヌクレオチド（又は場合によってはその非加水分解性類似体）を含めることが望ましい。リコンビナーゼと一本鎖結合（single stranded binding、ＳＳＢ）タンパク質の混合物は、ＳＳＢが増幅を更に促進できるため、特に有用である。リコンビナーゼ促進増幅のための代表的な製剤としては、ＴｗｉｓｔＤｘ社（Ｃａｍｂｒｉｄｇｅ，ＵＫ）によりＴｗｉｓｔＡｍｐキットとして市販されているものが挙げられる。リコンビナーゼ促進増幅試薬の有用な成分及び反応条件は、米国特許第５，２２３，４１４号及び同第７，３９９，５９０号に記載されている。 Amplification reagents can include additional components that promote amplicon formation, potentially increasing the rate of amplicon formation. One example is a recombinase. Recombinases can promote amplicon formation by enabling repeated invasion/extension. More specifically, recombinases can promote the invasion of an asymmetric target nucleic acid by a polymerase and the extension of a primer by the polymerase, using the asymmetric target nucleic acid as a template for amplicon formation. This process can be repeated as a chain reaction, with amplicons generated from each round of invasion/extension serving as templates in subsequent rounds. Because no denaturation cycles (e.g., by heating or chemical denaturation) are required, this process can be performed more rapidly than standard PCR. Therefore, recombinase-promoted amplification can be performed isothermally. To promote amplification, it is desirable to include ATP or other nucleotides (or, in some cases, non-hydrolyzable analogs thereof) in the recombinase-promoted amplification reagent. A mixture of recombinase and single-stranded binding (SSB) protein is particularly useful, as SSB can further enhance amplification. Exemplary formulations for recombinase-promoted amplification include those commercially available as TwistAmp kits from TwistDx (Cambridge, UK). Useful components and reaction conditions for recombinase-promoted amplification reagents are described in U.S. Patent Nos. 5,223,414 and 7,399,590.

アンプリコン形成を促進し、場合によってはアンプリコン形成の速度を増加させるために増幅試薬に含めることができる成分の別の例は、ヘリカーゼである。ヘリカーゼは、アンプリコン形成の連鎖反応を可能にすることによって、アンプリコン形成を促進することができる。変性サイクル（例えば、加熱又は化学変性による）は必要とされないため、このプロセスは標準的なＰＣＲよりも迅速に行うことができる。したがって、ヘリカーゼ促進増幅は、等温的に行うことができる。ヘリカーゼと一本鎖結合（ＳＳＢ）タンパク質の混合物は、ＳＳＢが増幅を更に促進できるため、特に有用である。ヘリカーゼ促進増幅のための代表的な製剤としては、Ｂｉｏｈｅｌｉｘ社（Ｂｅｖｅｒｌｙ，ＭＡ）からＩｓｏＡｍｐキットとして市販されているものが挙げられる。更に、ヘリカーゼタンパク質を含む有用な製剤の例は、米国特許第７，３９９，５９０号及び同第７，８２９，２８４号に記載されている。 Another example of a component that can be included in an amplification reagent to facilitate and, in some cases, increase the rate of amplicon formation is a helicase. Helicases can facilitate amplicon formation by enabling a chain reaction of amplicon formation. Because no denaturation cycles (e.g., by heating or chemical denaturation) are required, this process can be more rapid than standard PCR. Helicase-promoted amplification can therefore be performed isothermally. A mixture of helicase and single-strand binding (SSB) protein is particularly useful, as SSB can further facilitate amplification. Exemplary formulations for helicase-promoted amplification include those commercially available as IsoAmp kits from Biohelix, Inc. (Beverly, MA). Further examples of useful formulations containing helicase proteins are described in U.S. Patent Nos. 7,399,590 and 7,829,284.

アンプリコン形成を促進し、場合によってはアンプリコン形成の速度を増加させるために増幅試薬に含めることができる成分の更に別の例は、起点結合タンパク質である。 Another example of a component that can be included in an amplification reagent to facilitate amplicon formation, and in some cases increase the rate of amplicon formation, is an origin binding protein.

シークエンシング方法
非対称標的核酸の表面への結合に続いて、固定化され増幅された非対称標的核酸の配列が決定される。シークエンシングは、任意の好適なシークエンシング技術を使用して実施することができ、鎖再合成を含む、固定化され、増幅された非対称修飾標的核酸の配列を決定するための方法は、当技術分野において既知であり、例えば、Ｂｉｇｎｅｌｌｅｔａｌ．（米国特許第８，０５３，１９２号）、Ｇｕｎｄｅｒｓｏｎｅｔａｌ．（国際公開第２０１６／１３０７０４号）、Ｓｈｅｎｅｔａｌ．（米国特許第８，８９５，２４９号）、及びＰｉｐｅｎｂｕｒｇｅｔａｌ．（米国特許第９，３０９，５０２号）に記載されている。 Following the binding of the asymmetric target nucleic acid to the surface, the sequence of the immobilized and amplified asymmetric target nucleic acid is determined.Sequencing can be carried out using any suitable sequencing technology, and the method for determining the sequence of the immobilized and amplified asymmetric modified target nucleic acid, including strand resynthesis, is known in the art, and is described in, for example, Bignell et al. (US Pat. No. 8,053,192), Gunderson et al. (WO 2016/130704), Shen et al. (US Pat. No. 8,895,249) and Pipenberg et al. (US Pat. No. 9,309,502).

本明細書に記載の方法は、様々な核酸シークエンシング方法と併せて使用することができる。特に適用可能な技術は、核酸が、それらの相対的位置が変化しないようにアレイ内の固定位置に取り付けられ、アレイが繰り返し撮像されるものである。例えば、１つのヌクレオチド塩基型を別のヌクレオチド塩基型と区別するために使用される異なる標識と一致する異なる色チャネルで画像が得られる実施形態は、特に適用可能である。いくつかの実施形態では、非対称標的核酸のヌクレオチド配列を決定するプロセスは、自動プロセスであり得る。好ましい実施形態としては、合成によるシークエンシング（「ＳＢＳ（sequencing-by-synthesis）」）技術が挙げられる。 The methods described herein can be used in conjunction with various nucleic acid sequencing methods. Particularly applicable techniques are those in which nucleic acids are attached to fixed positions within an array so that their relative positions do not change, and the array is repeatedly imaged. For example, embodiments in which images are obtained in different color channels corresponding to different labels used to distinguish one nucleotide base type from another are particularly applicable. In some embodiments, the process of determining the nucleotide sequence of an asymmetric target nucleic acid can be an automated process. Preferred embodiments include sequencing-by-synthesis ("SBS") techniques.

ＳＢＳ技術は、一般に、鋳型鎖に対するヌクレオチドの反復的付加による、新生核酸鎖の酵素的伸長を伴う。ＳＢＳの従来の方法では、単一のヌクレオチドモノマーが、各送達においてポリメラーゼの存在下で標的ヌクレオチドに提供され得る。しかしながら、本明細書に記載の方法では、送達中のポリメラーゼの存在下で、複数のタイプのヌクレオチドモノマーを標的核酸に提供することができる。 SBS techniques generally involve the enzymatic extension of a nascent nucleic acid strand by the repetitive addition of nucleotides to a template strand. In traditional methods of SBS, a single nucleotide monomer may be provided to the target nucleic acid in the presence of a polymerase during each delivery. However, the methods described herein allow for multiple types of nucleotide monomers to be provided to the target nucleic acid in the presence of a polymerase during each delivery.

一実施形態では、ヌクレオチドモノマーは、ロックされた核酸（locked nucleic acid、ＬＮＡ）又は架橋核酸（bridged nucleic acid、ＢＮＡ）を含む。ヌクレオチドモノマーにおけるＬＮＡ又はＢＮＡの使用は、ヌクレオチドモノマーと固定化された非対称修飾標的核酸上に存在するシークエンシングプライマー配列との間のハイブリダイゼーション強度を増加させる。 In one embodiment, the nucleotide monomer comprises a locked nucleic acid (LNA) or a bridged nucleic acid (BNA). The use of an LNA or BNA in the nucleotide monomer increases the hybridization strength between the nucleotide monomer and a sequencing primer sequence present on the immobilized asymmetrically modified target nucleic acid.

ＳＢＳは、ターミネーター部分を有するヌクレオチドモノマー、又はターミネーター部分を欠くヌクレオチドモノマーを使用することができる。ターミネーターを含まないヌクレオチドモノマーを使用する方法としては、例えば、本明細書で更に詳細に記載されるように、γ－リン酸標識ヌクレオチドを用いたピロシークエンシング及びシークエンシングが挙げられる。ターミネーターを含まないヌクレオチドモノマーを使用する方法では、各サイクルに付加されるヌクレオチドの数は、一般に可変であり、鋳型配列及びヌクレオチド送達のモードに依存する。ターミネーター部分を有するヌクレオチドモノマーを利用するＳＢＳ技術では、ターミネーターは、ジデオキシヌクレオチドを利用する従来のＳａｎｇｅｒシークエンシングの場合のように使用されるシークエンシング条件下で有効に不可逆的であり得るか、又はターミネーターは、Ｓｏｌｅｘａ（現在はＩｌｌｕｍｉｎａ，Ｉｎｃ．）によって開発されたシークエンシング方法の場合のように可逆的であり得る。 SBS can use nucleotide monomers that have terminator moieties or that lack terminator moieties. Methods that use nucleotide monomers that do not contain terminators include, for example, pyrosequencing and sequencing using γ-phosphate-labeled nucleotides, as described in further detail herein. In methods that use nucleotide monomers that do not contain terminators, the number of nucleotides added in each cycle is generally variable and depends on the template sequence and the mode of nucleotide delivery. In SBS techniques that utilize nucleotide monomers that have terminator moieties, the terminators can be effectively irreversible under the sequencing conditions used, as in traditional Sanger sequencing that uses dideoxynucleotides, or the terminators can be reversible, as in the sequencing method developed by Solexa (now Illumina, Inc.).

ＳＢＳ技術は、標識部分を有するヌクレオチドモノマー、又は標識部分を欠くヌクレオチドモノマーを使用することができる。したがって、標識の蛍光などの標識の特性、分子量又は電荷などのヌクレオチドモノマーの特性、ピロリン酸の放出などのヌクレオチドの組み込みの副生成物などに基づいて、組み込みイベントを検出することができる。２つ以上の異なるヌクレオチドがシークエンシング試薬中に存在する実施形態では、異なるヌクレオチドは互いに区別可能であってもよく、あるいは２つ以上の異なる標識は、使用される検出技術の下で区別可能であり得る。例えば、シークエンシング試薬中に存在する異なるヌクレオチドは、異なる標識を有することができ、それらは、Ｓｏｌｅｘａ社（現Ｉｌｌｕｍｉｎａ社）によって開発されたシークエンシング方法によって例示される適切な光学系を使用して区別することができる。 SBS techniques can use nucleotide monomers that have a label moiety or lack a label moiety. Thus, incorporation events can be detected based on properties of the label, such as fluorescence of the label; properties of the nucleotide monomer, such as molecular weight or charge; by-products of nucleotide incorporation, such as the release of pyrophosphate; and the like. In embodiments in which two or more different nucleotides are present in the sequencing reagent, the different nucleotides may be distinguishable from one another, or the two or more different labels may be distinguishable under the detection technique used. For example, different nucleotides present in the sequencing reagent can have different labels, which can be distinguished using appropriate optical systems, as exemplified by the sequencing method developed by Solexa (now Illumina).

好ましい実施形態としては、ピロシークエンシング技術が挙げられる。パイロシークエンシングは、特定のヌクレオチドが新生鎖に組み込まれるときに無機ピロリン酸塩（ＰＰｉ）の放出を検出する（Ｒｏｎａｇｈｉ，Ｍ．，Ｋａｒａｍｏｈａｍｅｄ，Ｓ．，Ｐｅｔｔｅｒｓｓｏｎ，Ｂ．，Ｕｈｌｅｎ，Ｍ．ａｎｄＮｙｒｅｎ，Ｐ．（１９９６）「Ｒｅａｌ－ｔｉｍｅＤＮＡｓｅｑｕｅｎｃｉｎｇｕｓｉｎｇｄｅｔｅｃｔｉｏｎｏｆｐｙｒｏｐｈｏｓｐｈａｔｅｒｅｌｅａｓｅ．」ＡｎａｌｙｔｉｃａｌＢｉｏｃｈｅｍｉｓｔｒｙ２４２（１），８４－９、Ｒｏｎａｇｈｉ，Ｍ．（２００１）「ＰｙｒｏｓｅｑｕｅｎｃｉｎｇｓｈｅｄｓｌｉｇｈｔｏｎＤＮＡｓｅｑｕｅｎｃｉｎｇ．」ＧｅｎｏｍｅＲｅｓ．１１（１），３－１１、Ｒｏｎａｇｈｉ，Ｍ．，Ｕｈｌｅｎ，Ｍ．ａｎｄＮｙｒｅｎ，Ｐ．（１９９８）「Ａｓｅｑｕｅｎｃｉｎｇｍｅｔｈｏｄｂａｓｅｄｏｎｒｅａｌ－ｔｉｍｅｐｙｒｏｐｈｏｓｐｈａｔｅ．」Ｓｃｉｅｎｃｅ２８１（５３７５），３６３、米国特許第６，２１０，８９１号、同第６，２５８，５６８号及び同第６，２７４，３２０号）。ピロシークエンシングにおいて、放出されたＰＰｉは、ＡＴＰスルフラーゼによってアデノシン三リン酸（ＡＴＰ）に即座に変換されることによって検出することができ、生成されたＡＴＰのレベルはルシフェラーゼで生成された光子を介して検出される。シークエンシングされる核酸は、アレイ中の特徴部に付着させることができ、アレイは、アレイの特徴部にヌクレオチドを組み込むことにより生成される化学発光シグナルを捕捉するために画像化することができる。アレイを特定のヌクレオチド型（例えば、Ｔ、Ｃ、又はＧ）で処理した後に、画像を得ることができる。各ヌクレオチド型の付加後に得られる画像は、アレイ内のどの特徴部が検出されるかに関して異なる。画像内のこれらの差異は、アレイ上の特徴部の異なる配列コンテンツを反映する。しかしながら、各特徴部の相対的な位置は、画像内で変わらないままである。画像は、本明細書に記載の方法を使用して記憶、処理、及び分析することができる。例えば、アレイを各異なるヌクレオチド型で処理した後に得られる画像は、可逆的ターミネーターベースのシークエンシング方法のための異なる検出チャネルから得られる画像について、本明細書に例示されるものと同じ方法で処理することができる。 A preferred embodiment is pyrosequencing technology. Pyrosequencing detects the release of inorganic pyrophosphate (PPi) when a specific nucleotide is incorporated into a nascent strand (Ronaghi, M., Karamohamed, S., Petersson, B., Uhlen, M. and Nyren, P. (1996) "Real-time DNA sequencing using detection of pyrophosphate release." Analytical Biochemistry 242(1), 84-9; Ronaghi, M. (2001) "Pyrosequencing sheds light on DNA sequencing." Genome Res. 11(1), 3-11; Ronaghi, M., Uhlen, M. and Nyren, P. (1998) "A sequencing method based on real-time pyrophosphate." Science 281(5375), 363; U.S. Patent Nos. 6,210,891, 6,258,568 and 6,274,320. In pyrosequencing, the released PPi can be detected by its immediate conversion to adenosine triphosphate (ATP) by ATP sulfurase, and the level of ATP produced is detected via luciferase-generated photons. Nucleic acids to be sequenced can be attached to features in an array, and the array can be imaged to capture chemiluminescent signals generated by the incorporation of nucleotides into the features of the array. Images can be obtained after treating the array with a particular nucleotide type (e.g., T, C, or G). The images obtained after the addition of each nucleotide type differ in terms of which features in the array are detected. These differences in the images reflect the different sequence content of the features on the array. However, the relative positions of each feature remain unchanged in the image. Images can be stored, processed, and analyzed using methods described herein. For example, images obtained after treating the array with each different nucleotide type can be processed in the same manner as exemplified herein for images obtained from different detection channels for reversible terminator-based sequencing methods.

別の例示的な種類のＳＢＳでは、サイクルシークエンシングは、例えば、国際公開第０４／０１８４９７号及び米国特許第７，０５７，０２６号に記載されているような開裂可能な又は光漂白可能な染料標識を含む可逆的ターミネーターヌクレオチドを段階的に付加することによって達成される。この手法は、Ｓｏｌｅｘａ社（現在Ｉｌｌｕｍｉｎａ社）によって商品化されており、国際公開第９１／０６６７８号及び同第０７／１２３，７４４号にも記載されている。終端の両方を逆転させることができ、蛍光標識が開裂された蛍光標識ターミネーターの可用性は、効率的な循環可逆的終端（ＣＲＴ）シークエンシングを容易にする。ポリメラーゼはまた、これらの修飾されたヌクレオチドを効率的に組み込み、かつそこから伸長するように共操作することもできる。 In another exemplary type of SBS, cycle sequencing is achieved by the stepwise addition of reversible terminator nucleotides containing cleavable or photobleachable dye labels, as described, for example, in WO 04/018497 and U.S. Pat. No. 7,057,026. This approach has been commercialized by Solexa (now Illumina) and is also described in WO 91/06678 and WO 07/123,744. The availability of fluorescently labeled terminators, both of whose termini can be reversed and from which the fluorescent labels are cleaved, facilitates efficient cyclic reversible termination (CRT) sequencing. Polymerases can also be co-engineered to efficiently incorporate and extend from these modified nucleotides.

いくつかの可逆的ターミネーターベースのシークエンシング実施形態では、標識は、ＳＢＳ反応条件下での伸長を実質的に阻害しない。しかしながら、検出標識は、例えば、開裂又は分解によって取り外し可能であり得る。画像は、アレイ化された核酸特徴部への標識の組み込み後に捕捉することができる。特定の実施形態では、各サイクルは、アレイへの４つの異なるヌクレオチド型の同時送達を伴い、各ヌクレオチド型は、スペクトル的に異なる標識を有する。次に、４つの異なる標識の１つに選択的な検出チャネルをそれぞれ使用して、４つの画像を得ることができる。あるいは、異なるヌクレオチド型を順次追加することができ、各追加工程の間にアレイの画像を得ることができる。このような実施形態では、各画像は、特定の型のヌクレオチドを組み込んだ核酸特徴部を示す。各特徴部の配列コンテンツが異なるため、様々な画像に様々な特徴部が存在するか、存在しない。しかしながら、特徴部の相対的な位置は、画像内で変わらないままである。このような可逆的ターミネーター－ＳＢＳ法から得られる画像は、本明細書に記載されるように保存、処理、及び分析することができる。画像捕捉工程に続いて、標識を除去することができ、その後のヌクレオチド付加及び検出のサイクルのために可逆的ターミネーター部分を除去することができる。特定のサイクルで検出された後、及び後続のサイクルの前に標識を除去すると、サイクル間のバックグラウンド信号及びクロストークを低減できるという利点がある。有用な標識及び除去方法の例を本明細書に記載する。 In some reversible terminator-based sequencing embodiments, the label does not substantially inhibit extension under SBS reaction conditions. However, the detection label may be removable, for example, by cleavage or degradation. Images can be captured after incorporation of the label into arrayed nucleic acid features. In certain embodiments, each cycle involves the simultaneous delivery of four different nucleotide types to the array, with each nucleotide type bearing a spectrally distinct label. Four images can then be acquired, each using a detection channel selective for one of the four different labels. Alternatively, different nucleotide types can be added sequentially, with images of the array being acquired between each addition step. In such embodiments, each image shows nucleic acid features incorporating a particular type of nucleotide. Because the sequence content of each feature varies, different features are present or absent in different images. However, the relative positions of the features remain unchanged within the image. Images obtained from such reversible terminator-SBS methods can be stored, processed, and analyzed as described herein. Following the image capture step, the label can be removed, and the reversible terminator moiety can be removed for subsequent cycles of nucleotide addition and detection. Removing the label after detection in a particular cycle and before the subsequent cycle has the advantage of reducing background signal and crosstalk between cycles. Examples of useful labeling and removal methods are described herein.

特定の実施形態では、ヌクレオチドモノマーの一部又は全ては、可逆的ターミネーターを含み得る。このような実施形態では、可逆的ターミネーター／開裂可能なフルオロフォアは、３’エステル結合（Ｍｅｔｚｋｅｒ，ＧｅｎｏｍｅＲｅｓ．１５：１７６７－１７７６（２００５））を介してリボース部分に結合されたフルオロフォアを含み得る。他の手法は、蛍光標識（Ｒｕｐａｒｅｌｅｔａｌ．，ＰｒｏｃＮａｔｌＡｃａｄＳｃｉＵＳＡ１０２：５９３２－７（２００５））からターミネーターの化学的物質を分離した。Ｒｕｐａｒｅｌらは、少量の３’アリル基を使用して伸長をブロックするが、パラジウム触媒で短時間処理することで簡単にブロックを解除できる可逆性ターミネーターの開発について説明している。フルオロフォアは、長波長ＵＶ光への３０秒の曝露によって容易に開裂することができる光開裂可能リンカーを介して基に付着された。したがって、ジスルフィド還元又は光開裂のいずれかを開裂可能なリンカーとして使用することができる。可逆的終端への別の手法は、ｄＮＴＰ上に嵩高な染料を配置した後に続く自然終端の使用である。ｄＮＴＰ上の帯電した嵩高な染料の存在は、立体障害及び／又は静電障害を介して効果的なターミネーターとして作用することができる。１つの組み込みイベントの存在は、染料が除去されない限り、それ以上の結合を防止する。染料の開裂は、フルオロフォアを除去し、終端を効果的に逆転させる。修飾ヌクレオチドの例は、米国特許第７，４２７，６７３号及び同第７，０５７，０２６号にも記載されている。 In certain embodiments, some or all of the nucleotide monomers may contain reversible terminators. In such embodiments, the reversible terminator/cleavable fluorophore may comprise a fluorophore attached to the ribose moiety via a 3' ester bond (Metzker, Genome Res. 15:1767-1776 (2005)). Another approach has separated the terminator chemistry from the fluorescent label (Ruparel et al., Proc Natl Acad Sci USA 102:5932-7 (2005)). Ruparel et al. describe the development of a reversible terminator that uses a small 3' allyl group to block extension but can be easily unblocked by brief treatment with a palladium catalyst. The fluorophore was attached to the group via a photocleavable linker that can be easily cleaved by 30 seconds of exposure to long-wavelength UV light. Thus, either disulfide reduction or photocleavage can be used as the cleavable linker. Another approach to reversible termination is the use of a natural termination followed by the placement of a bulky dye on the dNTP. The presence of a charged, bulky dye on the dNTP can act as an effective terminator through steric and/or electrostatic hindrance. The presence of one incorporation event prevents further binding unless the dye is removed. Cleavage of the dye removes the fluorophore, effectively reversing the termination. Examples of modified nucleotides are also described in U.S. Patent Nos. 7,427,673 and 7,057,026.

本明細書に記載の方法及びシステムとともに利用することができる追加の例示的なＳＢＳシステム及び方法は、米国特許出願公開第２００７／０１６６７０５号、同第２００６／０１８８９０１号、同第２００６／０２４０４３９号、同第２００６／０２８１１０９号、同第２０１２／０２７０３０５号、及び同第２０１３／０２６０３７２号、米国特許第７，０５７，０２６号、及び国際公開第０５／０６５８１４号、米国特許出願公開第２００５／０１００９００号、及び国際公開第０６／０６４１９９号及び同第０７／０１０，２５１号に記載されている。 Additional exemplary SBS systems and methods that can be utilized with the methods and systems described herein are described in U.S. Patent Application Publication Nos. 2007/0166705, 2006/0188901, 2006/0240439, 2006/0281109, 2012/0270305, and 2013/0260372, U.S. Patent No. 7,057,026, and International Publication No. WO 05/065814, U.S. Patent Application Publication No. 2005/0100900, and International Publication Nos. WO 06/064199 and WO 07/010,251.

いくつかの実施形態は、４つ未満の異なる標識を使用する４つの異なるヌクレオチドの検出を使用することができる。例えば、ＳＢＳは、組み込まれた資料である米国特許公開第２０１３／００７９２３２号に記載される方法及びシステムを使用して実施することができる。第１の例として、ヌクレオチド型の対は、同じ波長で検出することができるが、対のうちの一方のメンバーの他方と比較した強度の差に基づいて、又は、対のうちの他方のメンバーについて検出されたシグナルと比較して明らかなシグナルを出現又は消失させる、対のうちの一方のメンバーへの変化（例えば、化学修飾、光化学修飾、又は物理的修飾を行うことを介して）に基づいて区別され得る。第２の例として、４つの異なるヌクレオチド型のうちの３つを特定の条件下で検出することができ、一方、第４のヌクレオチド型は、それらの条件下で検出可能な標識がないか、又はそれらの条件下で最小限に検出される（例えば、バックグラウンド蛍光による最小限の検出など）。最初の３つのヌクレオチド型を核酸に組み込むことは、それらの対応するシグナルの存在に基づいて決定することができ、第４のヌクレオチド型を核酸に組み込むことは、任意のシグナルの不在又は最小限の検出に基づいて決定することができる。第３の例として、１つのヌクレオチド型は、２つの異なるチャネルで検出される標識を含むことができ、一方、他のヌクレオチド型は、チャネルのうちの１つ以下で検出される。前述の３つの例示的な構成は、相互に排他的であるとはみなされず、様々な組み合わせで使用することができる。３つ全ての実施例を組み合わせた例示的な実施形態は、第１のチャネルで検出される第１のヌクレオチド型（例えば、第１の励起波長によって励起されたときに第１のチャネルで検出される標識を有するｄＡＴＰ）、第２のチャネルで検出される第２のヌクレオチド型（例えば、第２の励起波長によって励起されたときに第２のチャネルで検出される標識を有するｄＣＴＰ）、第１及び第２のチャネルの両方において検出される第３のヌクレオチド型（例えば、第１及び／又は第２の励起波長によって励起されたときに両方のチャネルで検出される少なくとも１つの標識を有するｄＴＴＰ）、及びいずれのチャネルでも検出されないか、又は最小限に検出される、標識のない第４のヌクレオチド型（例えば、標識のないｄＧＴＰ）を使用する蛍光ベースのＳＢＳ法である。 Some embodiments may employ detection of four different nucleotides using fewer than four different labels. For example, SBS may be performed using the methods and systems described in the incorporated document, U.S. Patent Publication No. 2013/0079232. As a first example, pairs of nucleotide types may be detected at the same wavelength but may be distinguished based on differences in intensity of one member of the pair compared to the other, or based on a change to one member of the pair (e.g., via chemical, photochemical, or physical modification) that results in the appearance or disappearance of a distinct signal compared to the signal detected for the other member of the pair. As a second example, three of the four different nucleotide types may be detected under certain conditions, while the fourth nucleotide type may have no detectable label under those conditions or be minimally detected under those conditions (e.g., minimal detection due to background fluorescence). Incorporation of the first three nucleotide types into a nucleic acid may be determined based on the presence of their corresponding signals, and incorporation of the fourth nucleotide type into a nucleic acid may be determined based on the absence or minimal detection of any signal. As a third example, one nucleotide type can include a label that is detected in two different channels, while the other nucleotide type is detected in one or less of the channels. The three exemplary configurations described above are not considered mutually exclusive and can be used in various combinations. An exemplary embodiment that combines all three examples is a fluorescence-based SBS method that uses a first nucleotide type (e.g., dATP having a label that is detected in the first channel when excited with a first excitation wavelength), a second nucleotide type (e.g., dCTP having a label that is detected in the second channel when excited with a second excitation wavelength), a third nucleotide type (e.g., dTTP having at least one label that is detected in both channels when excited with the first and/or second excitation wavelengths) that is detected in both the first and second channels, and a fourth nucleotide type (e.g., unlabeled dGTP) that is not or minimally detected in either channel.

更に、米国特許出願公開第２０１３／００７９２３２号に記載のように、シークエンシングデータは、単一のチャネルを使用して得ることができる。このようないわゆる１つの染料シークエンシング方法では、第１のヌクレオチド型は標識されるが、第１の画像が生成された後に標識が除去され、第２のヌクレオチド型は、第１の画像が生成された後にのみ標識される。第３のヌクレオチド型は、第１及び第２の画像の両方においてその標識を保持し、第４のヌクレオチド型は、両方の画像において標識されていないままである。 Furthermore, as described in U.S. Patent Application Publication No. 2013/0079232, sequencing data can be obtained using a single channel. In this so-called single-dye sequencing method, a first nucleotide type is labeled but the label is removed after the first image is generated, and a second nucleotide type is labeled only after the first image is generated. A third nucleotide type retains its label in both the first and second images, and a fourth nucleotide type remains unlabeled in both images.

いくつかの実施形態は、ライゲーション技術によるシークエンシングを使用することができる。このような技術は、ＤＮＡリガーゼを使用してオリゴヌクレオチドを組み込み、そのようなオリゴヌクレオチドの組み込みを識別する。オリゴヌクレオチドは、典型的には、オリゴヌクレオチドがハイブリダイズする配列中の特定のヌクレオチドの同一性と相関する異なる標識を有する。他のＳＢＳ方法と同様に、標識されたシークエンシング試薬で核酸配列のアレイを処理した後、画像を得ることができる。各画像は、特定の型の標識を組み込んだ核酸特徴部を示す。各特徴部の配列コンテンツが異なるため、様々な画像に様々な特徴部が存在するか、存在しないが、特徴部の相対的な位置は、画像内で変わらないままである。ライゲーションベースのシークエンシング方法から得られる画像は、本明細書に記載されるように保存、処理、及び分析することができる。本明細書に記載の方法及びシステムとともに利用することができる例示的なＳＢＳシステム及び方法は、米国特許第６，９６９，４８８号、同第６，１７２，２１８号、及び同第６，３０６，５９７号に記載されている。 Some embodiments may use sequencing by ligation techniques. Such techniques use DNA ligase to incorporate and identify oligonucleotides. The oligonucleotides typically have different labels that correlate with the identity of specific nucleotides in the sequence to which the oligonucleotides hybridize. As with other SBS methods, images can be obtained after treating an array of nucleic acid sequences with labeled sequencing reagents. Each image shows nucleic acid features that incorporate a particular type of label. Because the sequence content of each feature varies, different features may or may not be present in different images, but the relative positions of the features remain constant within the image. Images obtained from ligation-based sequencing methods can be stored, processed, and analyzed as described herein. Exemplary SBS systems and methods that can be utilized with the methods and systems described herein are described in U.S. Patent Nos. 6,969,488, 6,172,218, and 6,306,597.

いくつかの実施形態は、ナノ細孔シークエンシングを使用することができる（Ｄｅａｍｅｒ，Ｄ．Ｗ．＆Ａｋｅｓｏｎ，Ｍ．「Ｎａｎｏｐｏｒｅｓａｎｄｎｕｃｌｅｉｃａｃｉｄｓ：ｐｒｏｓｐｅｃｔｓｆｏｒｕｌｔｒａｒａｐｉｄｓｅｑｕｅｎｃｉｎｇ．」、ＴｒｅｎｄｓＢｉｏｔｅｃｈｎｏｌ．１８，１４７－１５１（２０００）、Ｄｅａｍｅｒ，Ｄ．ａｎｄＤ．Ｂｒａｎｔｏｎ，「Ｃｈａｒａｃｔｅｒｉｚａｔｉｏｎｏｆｎｕｃｌｅｉｃａｃｉｄｓｂｙｎａｎｏｐｏｒｅａｎａｌｙｓｉｓ」，Ａｃｃ．Ｃｈｅｍ．Ｒｅｓ．３５：８１７－８２５（２００２）、Ｌｉ，Ｊ．，Ｍ．Ｇｅｒｓｈｏｗ，Ｄ．Ｓｔｅｉｎ，Ｅ．Ｂｒａｎｄｉｎ，ａｎｄＪ．Ａ．Ｇｏｌｏｖｃｈｅｎｋｏ，「ＤＮＡｍｏｌｅｃｕｌｅｓａｎｄｃｏｎｆｉｇｕｒａｔｉｏｎｓｉｎａｓｏｌｉｄ－ｓｔａｔｅｎａｎｏｐｏｒｅｍｉｃｒｏｓｃｏｐｅ」Ｎａｔ．Ｍａｔｅｒ．２：６１１－６１５（２００３））。そのような実施形態では、非対称標的核酸は、ナノ細孔を通過する。ナノ細孔は、α－ヘモリジンなどの合成孔又は生体膜タンパク質であり得る。非対称標的核酸がナノ細孔を通過するとき、各塩基対は、細孔の電気コンダクタンスの変動を測定することによって識別することができる。（米国特許第７，００１，７９２号、Ｓｏｎｉ，Ｇ．Ｖ．＆Ｍｅｌｌｅｒ，「Ａ．ＰｒｏｇｒｅｓｓｔｏｗａｒｄｕｌｔｒａｆａｓｔＤＮＡｓｅｑｕｅｎｃｉｎｇｕｓｉｎｇｓｏｌｉｄ－ｓｔａｔｅｎａｎｏｐｏｒｅｓ．」Ｃｌｉｎ．Ｃｈｅｍ．５３，１９９６－２００１（２００７）、Ｈｅａｌｙ，Ｋ．「Ｎａｎｏｐｏｒｅ－ｂａｓｅｄｓｉｎｇｌｅ－ｍｏｌｅｃｕｌｅＤＮＡａｎａｌｙｓｉｓ．」Ｎａｎｏｍｅｄ．２，４５９－４８１（２００７）、Ｃｏｃｋｒｏｆｔ，Ｓ．Ｌ．，Ｃｈｕ，Ｊ．，Ａｍｏｒｉｎ，Ｍ．＆Ｇｈａｄｉｒｉ，Ｍ．Ｒ．「Ａｓｉｎｇｌｅ－ｍｏｌｅｃｕｌｅｎａｎｏｐｏｒｅｄｅｖｉｃｅｄｅｔｅｃｔｓＤＮＡｐｏｌｙｍｅｒａｓｅａｃｔｉｖｉｔｙｗｉｔｈｓｉｎｇｌｅ－ｎｕｃｌｅｏｔｉｄｅｒｅｓｏｌｕｔｉｏｎ．」Ｊ．ＡｍＣｈｅｍ．Ｓｏｃ．１３０，８１８－８２０（２００８）。ナノ細孔シークエンシングから得られるデータは、本明細書に記載されるように、保存、処理、及び分析することができる。具体的には、データは、本明細書に記載される光学画像及び他の画像の例示的な処理に従って、画像として処理することができる。 Some embodiments can use nanopore sequencing (Deamer, D.W. & Akeson, M. "Nanopores and nucleic acids: prospects for ultrarapid sequencing." Trends Biotechnol. 18, 147-151 (2000); Deamer, D. and D. Branton, "Characterization of nucleic acids by nanopore sequencing"). (See, for example, Li, J., M. Gershow, D. Stein, E. Brandin, and J. A. Golovchenko, "DNA molecules and configurations in a solid-state nanopore microscope," Nat. Mater. 2:611-615 (2003)). In such embodiments, an asymmetric target nucleic acid passes through a nanopore. The nanopore can be a synthetic pore or a biological membrane protein, such as α-hemolysin. As the asymmetric target nucleic acid passes through the nanopore, each base pair can be distinguished by measuring the fluctuation in the electrical conductance of the pore. (U.S. Pat. No. 7,001,792, Soni, G.V. & Meller, “A. Progress toward ultrafast DNA sequencing using solid-state Clin. Chem. 53, 1996-2001 (2007), Healy, K. "A single-molecule nanopore" "Device detects DNA polymerase activity with single-nucleotide resolution." J. Am Chem. Soc. 130, 818-820 (2008). Data obtained from nanopore sequencing can be stored, processed, and analyzed as described herein. Specifically, the data can be processed as images according to the exemplary processing of optical and other images described herein.

いくつかの実施形態は、ＤＮＡポリメラーゼ活性のリアルタイムモニタリングを含む方法を使用することができる。ヌクレオチドの組み込みは、例えば、米国特許第７，３２９，４９２号及び同第７，２１１，４１４号に記載されているようなフルオロフォア含有ポリメラーゼとγ－リン酸標識ヌクレオチドとの間の蛍光共鳴エネルギー移動（fluorescence resonance energy transfer、ＦＲＥＴ）相互作用を介して検出することができ、又はヌクレオチドの組み込みは、例えば、米国特許第７，３１５，０１９号に記載されているようなゼロモード導波路、並びに、例えば、米国特許第７，４０５，２８１号及び米国特許出願公開第２００８／０１０８０８２号に記載されているような蛍光ヌクレオチド類似体及び操作ポリメラーゼを使用して検出することができる。照明は、蛍光標識されたヌクレオチドの組み込みが低バックグラウンドで観察され得るように、表面繋留ポリメラーゼの周囲のゼプトリットルスケールの体積に制限することができる（Ｌｅｖｅｎｅ，Ｍ．Ｊ．ｅｔａｌ．「Ｚｅｒｏ－ｍｏｄｅｗａｖｅｇｕｉｄｅｓｆｏｒｓｉｎｇｌｅ－ｍｏｌｅｃｕｌｅａｎａｌｙｓｉｓａｔｈｉｇｈｃｏｎｃｅｎｔｒａｔｉｏｎｓ．」Ｓｃｉｅｎｃｅ，２９９，６８２－６８６（２００３）、Ｌｕｎｄｑｕｉｓｔ，Ｐ．Ｍ．ｅｔａｌ．「Ｐａｒａｌｌｅｌｃｏｎｆｏｃａｌｄｅｔｅｃｔｉｏｎｏｆｓｉｎｇｌｅｍｏｌｅｃｕｌｅｓｉｎｒｅａｌｔｉｍｅ．」Ｏｐｔ．Ｌｅｔｔ．３３，１０２６－１０２８（２００８）、Ｋｏｒｌａｃｈ，Ｊ．ｅｔａｌ．「ＳｅｌｅｃｔｉｖｅａｌｕｍｉｎｕｍｐａｓｓｉｖａｔｉｏｎｆｏｒｔａｒｇｅｔｅｄｉｍｍｏｂｉｌｉｚａｔｉｏｎｏｆｓｉｎｇｌｅＤＮＡｐｏｌｙｍｅｒａｓｅｍｏｌｅｃｕｌｅｓｉｎｚｅｒｏ－ｍｏｄｅｗａｖｅｇｕｉｄｅｎａｎｏｓｔｒｕｃｔｕｒｅｓ．」Ｐｒｏｃ．Ｎａｔｌ．Ａｃａｄ．Ｓｃｉ．ＵＳＡ１０５、１１７６－１１８１（２００８））。このような方法から得られる画像は、本明細書に記載されるように、記憶、処理、及び分析することができる。 Some embodiments can utilize methods involving real-time monitoring of DNA polymerase activity. Nucleotide incorporation can be detected via fluorescence resonance energy transfer (FRET) interactions between a fluorophore-containing polymerase and a γ-phosphate-labeled nucleotide, as described, for example, in U.S. Pat. Nos. 7,329,492 and 7,211,414, or nucleotide incorporation can be detected using zero-mode waveguides, as described, for example, in U.S. Pat. No. 7,315,019, and fluorescent nucleotide analogs and engineered polymerases, as described, for example, in U.S. Pat. No. 7,405,281 and U.S. Patent Application Publication No. 2008/0108082. Illumination can be restricted to a zeptoliter-scale volume around the surface-tethered polymerase so that incorporation of fluorescently labeled nucleotides can be observed with low background (Levene, M.J. et al. "Zero-mode waveguides for single-molecule analysis at high concentration." Science, 299, 682-686 (2003); Lundquist, P.M. et al. "Parallel confocal detection of single molecules in real time." Opt. Lett. 33, 1026-1028 (2008); Korlach, J. et al. al. "Selective aluminum passivation for targeted immobilization of single DNA polymerase molecules in zero-mode waveguide nanostructures." Proc. Natl. Acad. Sci. USA 105, 1176-1181 (2008)). Images obtained from such methods can be stored, processed, and analyzed as described herein.

いくつかのＳＢＳ実施形態は、伸長生成物へのヌクレオチドの組み込み時に放出されるプロトンの検出を含む。例えば、放出されたプロトンの検出に基づくシークエンシングは、ＩｏｎＴｏｒｒｅｎｔ社（Ｇｕｉｌｆｏｒｄ，ＣＴ、ＬｉｆｅＴｅｃｈｎｏｌｏｇｉｅｓ社子会社）から市販されている電気検出器及び関連技術、又は米国特許出願公開第２００９／００２６０８２号、同第２００９／０１２７５８９号、同第２０１０／０１３７１４３号、及び同第２０１０／０２８２６１７号に記載のシークエンシング方法及びシステムを使用することができる。結合平衡除外を使用して標的核酸を増幅するための本明細書に記載の方法は、プロトンを検出するために使用される基質に容易に適用することができる。より具体的には、本明細書に記載の方法を使用して、プロトンを検出するために使用されるアンプリコンのクローン集団を生成することができる。 Some SBS embodiments involve the detection of protons released upon incorporation of a nucleotide into an extension product. For example, sequencing based on detection of released protons can use commercially available electrical detectors and related technology from Ion Torrent (Guilford, CT, a Life Technologies subsidiary), or the sequencing methods and systems described in U.S. Patent Application Publication Nos. 2009/0026082, 2009/0127589, 2010/0137143, and 2010/0282617. The methods described herein for amplifying target nucleic acids using equilibrium exclusion can be readily adapted to substrates used to detect protons. More specifically, the methods described herein can be used to generate clonal populations of amplicons used to detect protons.

上記のＳＢＳ方法は、複数の異なる非対称標的核酸が同時に操作されるように、多重形式で有利に実施することができる。特定の実施形態では、異なる非対称標的核酸は、共通の反応容器又は特定の基質の表面で処理することができる。これにより、シークエンシング試薬の簡便な送達、未反応試薬の除去、及び組み込みイベントの検出を多重に可能になる。表面結合された標的核酸を使用する実施形態では、非対称標的核酸は、アレイ形式であり得る。アレイ形式では、非対称標的核酸は、典型的には、空間的に区別可能な様式で表面に結合され得る。非対称標的核酸は、直接共有結合、ビーズ若しくは他の粒子への付着、又は表面に付着したポリメラーゼ若しくは他の分子への結合によって結合され得る。アレイは、各部位（特徴部とも呼ばれる）に非対称標的核酸の単一コピーを含むか、又は同じ配列を有する複数のコピーが、各部位若しくは特徴部に存在することができる。複数のコピーは、本明細書で更に詳細に記載されるブリッジ増幅又はエマルジョンＰＣＲなどの増幅方法によって生成することができる。 The SBS methods described above can be advantageously performed in a multiplex format, allowing multiple different asymmetric target nucleic acids to be manipulated simultaneously. In certain embodiments, the different asymmetric target nucleic acids can be processed in a common reaction vessel or on the surface of a specific substrate. This allows for convenient delivery of sequencing reagents, removal of unreacted reagents, and multiplexed detection of incorporation events. In embodiments using surface-bound target nucleic acids, the asymmetric target nucleic acids can be in an array format. In an array format, the asymmetric target nucleic acids can typically be bound to a surface in a spatially distinguishable manner. The asymmetric target nucleic acids can be bound by direct covalent binding, attachment to beads or other particles, or binding to a polymerase or other molecule attached to a surface. Arrays can contain a single copy of the asymmetric target nucleic acid at each site (also called a feature), or multiple copies with the same sequence can be present at each site or feature. Multiple copies can be generated by amplification methods such as bridge amplification or emulsion PCR, as described in further detail herein.

本明細書に記載の方法は、例えば、少なくとも約１０個の特徴部／ｃｍ^２、１００個の特徴部／ｃｍ^２、５００個の特徴部／ｃｍ^２、１，０００個の特徴部／ｃｍ^２、５，０００個の特徴部／ｃｍ^２、１０，０００個の特徴部／ｃｍ^２、５０，０００個の特徴部／ｃｍ^２、１００，０００個の特徴部／ｃｍ^２、１，０００，０００個の特徴部／ｃｍ^２、５，０００，０００個の特徴部／ｃｍ^２、又はそれ以上を含む、様々な密度のいずれかの特徴部を有するアレイを使用することができる。 The methods described herein can use arrays having any of a variety of densities of features, including, for example, at least about 10 features/cm ² , 100 features/cm ² , 500 features/cm ² , 1,000 features/cm ² , 5,000 features/cm ² , 10,000 features/cm ² , 50,000 features/cm ² , 100,000 features/cm ² , 1,000,000 features/cm ² , 5,000,000 features/ ^cm 2 , or more.

本明細書に記載の方法の利点は、複数のｃｍ^２の迅速かつ効率的で、並行な検出を提供することである。したがって、本開示は、本明細書に例示されるものなどの当技術分野において既知の技術を使用して核酸を調製及び検出することができる統合システムを提供する。したがって、本開示の統合システムは、増幅試薬及び／又はシークエンシング試薬を１つ以上の固定化された非対称標的核酸に送達することができる流体構成要素を含むことができ、システムは、ポンプ、弁、リザーバー、流体ラインなどの構成要素を含む。フローセルは、標的核酸を検出するための統合システムで構成及び／又は使用することができる。例示的なフロー細胞は、例えば、米国特許第８，２４１，５７３号及び米国特許第８，９５１，７８１号に記載されている。フローセルについて例示されるように、統合システムの流体成分の１つ以上を増幅方法及び検出方法に使用することができる。核酸シークエンシングの実施形態を一例としてとると、統合システムの流体構成要素の１つ以上を、本明細書に記載の増幅方法、及び上記に例示したようなシークエンシング方法におけるシークエンシング試薬の送達に使用することができる。あるいは、統合システムは、増幅方法を実行し、検出方法を実行するための別個の流体システムを含み得る。増幅された核酸を作成し、又核酸の配列を決定することができる統合シークエンシングシステムの例としては、ＭｉＳｅｑＴＭプラットフォーム（Ｉｌｌｕｍｉｎａ，Ｉｎｃ．，ＳａｎＤｉｅｇｏ，ＣＡ）、及び米国特許第８，９５１，７８１号に記載の装置が挙げられるが、これらに限定されない。 An advantage of the methods described herein is that they provide rapid, efficient, and parallel detection of multiple ^cm2 . Accordingly, the present disclosure provides an integrated system capable of preparing and detecting nucleic acids using techniques known in the art, such as those exemplified herein. Thus, the integrated system of the present disclosure can include fluidic components capable of delivering amplification and/or sequencing reagents to one or more immobilized asymmetric target nucleic acids, including components such as pumps, valves, reservoirs, and fluid lines. A flow cell can be configured and/or used in the integrated system for detecting target nucleic acids. Exemplary flow cells are described, for example, in U.S. Pat. Nos. 8,241,573 and 8,951,781. As exemplified for the flow cell, one or more of the fluidic components of the integrated system can be used for the amplification and detection methods. Taking the nucleic acid sequencing embodiment as an example, one or more of the fluidic components of the integrated system can be used for the amplification methods described herein and for delivering sequencing reagents in the sequencing methods exemplified above. Alternatively, the integrated system can include separate fluidic systems for performing the amplification method and the detection method. Examples of integrated sequencing systems that can generate amplified nucleic acids and also determine the sequence of nucleic acids include, but are not limited to, the MiSeq™ platform (Illumina, Inc., San Diego, CA) and the device described in U.S. Pat. No. 8,951,781.

組成物
本開示によって提供される方法の実施中に、いくつかの組成物が生じ得る。例えば、トランスポソーム複合体及び損傷不耐性ＤＮＡポリメラーゼを含む組成物が生じ得る。トランスポソームは、アダプターを含むトランスポゾン配列に結合したトランスポザーゼを含み得る。アダプターは、１つ以上のＤＮＡ損傷、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含み得る。組成物は、標的核酸を更に含むことができる。任意選択的に、組成物は、損傷耐性ＤＮＡポリメラーゼを含み得る。 Compositions During the implementation of the methods provided by the present disclosure, several compositions may be produced. For example, a composition may be produced that includes a transposome complex and a damage-intolerant DNA polymerase. The transposome may include a transposase bound to a transposon sequence that includes an adapter. The adapter may include one or more DNA lesions, one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof. The composition may further include a target nucleic acid. Optionally, the composition may include a damage-tolerant DNA polymerase.

別の実施形態では、組成物は、複数の一本鎖修飾標的核酸、プライマー、及び損傷不耐性ＤＮＡポリメラーゼを有することになり得る。例えば、標的核酸は、５’から３’に、第１のアダプター、標的核酸、及び第１のアダプターの相補体を含み得る。第１のアダプターは、１つ以上のＤＮＡ損傷、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含むことができる。一実施形態では、ユニバーサル配列は、トランスポザーゼ認識部位を含み得る。プライマーは、５’から３’に、第２のアダプター、及び第１のアダプターの相補体にアニーリングするヌクレオチド配列を含み得る。第２のアダプターは、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含むことができる。プライマーは、任意選択的に、ブロック３’末端を含み得、任意選択的に少なくとも１つの改変ヌクレオチドを含み得る。一実施形態では、プライマーは、一本鎖修飾標的核酸にアニーリングされる。 In another embodiment, the composition may comprise a plurality of single-stranded modified target nucleic acids, primers, and a damage-intolerant DNA polymerase. For example, the target nucleic acid may comprise, from 5' to 3', a first adapter, the target nucleic acid, and the complement of the first adapter. The first adapter may comprise one or more DNA lesions, one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof. In one embodiment, the universal sequence may comprise a transposase recognition site. The primer may comprise, from 5' to 3', a second adapter and a nucleotide sequence that anneals to the complement of the first adapter. The second adapter may comprise one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof. The primer may optionally comprise a blocked 3' end and may optionally comprise at least one modified nucleotide. In one embodiment, the primer anneals to the single-stranded modified target nucleic acid.

別の実施形態では、組成物は、トランスポソーム複合体を含む。トランスポソーム複合体は、トランスポザーゼ及びトランスポゾンを含むが、これらに限定されない。一実施形態では、トランスポゾンは、アダプターを含む。アダプターは、例えば、５’から３’に少なくとも１つのユニバーサル配列、少なくとも１つのインデックス配列、少なくとも１つのＵＭＩ、又はそれらの組み合わせ、ＤＮＡ損傷、及びトランスポザーゼ認識配列を有する第１の鎖を含み得る。一実施形態では、トランスポザーゼ認識配列は、モザイク要素を含む。アダプターは、例えば、トランスポザーゼ認識配列の少なくとも一部に相補的なヌクレオチドを有する第２の鎖を含み得る。一実施形態では、第１の鎖はまた、５’末端に捕捉剤を含むか、又は第２の鎖も、３’末端に捕捉剤を含む。一実施形態では、切断可能なリンカーは、捕捉剤と第１の鎖の５’末端との間に位置する。一実施形態では、切断可能なリンカーは、捕捉剤と第２の鎖の３’末端との間に位置する。一実施形態では、組成物は、固体表面を更に含み、トランスポザーゼ複合体が、固体表面に付着している。別の実施形態では、組成物は、固体表面を更に含み、トランスポゾンはトランスポザーゼに関連しておらず、トランスポゾンは固体表面に結合している。 In another embodiment, the composition comprises a transposome complex. The transposome complex includes, but is not limited to, a transposase and a transposon. In one embodiment, the transposon includes an adapter. The adapter can include, for example, a first strand having, from 5' to 3', at least one universal sequence, at least one index sequence, at least one UMI, or a combination thereof, DNA damage, and a transposase recognition sequence. In one embodiment, the transposase recognition sequence includes a mosaic element. The adapter can include, for example, a second strand having nucleotides complementary to at least a portion of the transposase recognition sequence. In one embodiment, the first strand also includes a capture agent at its 5' end, or the second strand also includes a capture agent at its 3' end. In one embodiment, a cleavable linker is positioned between the capture agent and the 5' end of the first strand. In one embodiment, the cleavable linker is positioned between the capture agent and the 3' end of the second strand. In one embodiment, the composition further includes a solid surface, and the transposase complex is attached to the solid surface. In another embodiment, the composition further comprises a solid surface, wherein the transposon is not associated with a transposase, and the transposon is bound to the solid surface.

キット
本開示はまた、本明細書で提供される方法の１つ以上の態様を実施するためのキットを提供する。キットは、標的核酸のライブラリーを生成するために使用することができる。一実施形態では、キットは、対称標的核酸のライブラリーを生成するために使用することができる。キットは、別個の容器を、トランスポソーム複合体及び損傷不耐性ＤＮＡポリメラーゼを含見得る。トランスポソームは、トランスポゾン配列に結合したトランスポザーゼを含むことができ、トランスポゾン配列は、アダプター及びＤＮＡ損傷を含む。一実施形態では、キットは、対称ライブラリーを非対称ライブラリーに変換するために使用することができる。この実施形態では、キットは、プライマーを更に含むことができる。一実施形態では、プライマーは、５’から３’に第２のアダプター及び第１のアダプターの相補体にアニーリングするヌクレオチド配列を含む。 Kits The present disclosure also provides kits for carrying out one or more aspects of the methods provided herein. The kits can be used to generate libraries of target nucleic acids. In one embodiment, the kits can be used to generate libraries of symmetric target nucleic acids. The kits can contain a transposome complex and a damage-intolerant DNA polymerase in separate containers. The transposomes can contain a transposase bound to a transposon sequence, and the transposon sequence contains an adapter and DNA damage. In one embodiment, the kits can be used to convert a symmetric library into an asymmetric library. In this embodiment, the kits can further contain primers. In one embodiment, the primers contain a nucleotide sequence that anneals 5' to 3' to the second adapter and the complement of the first adapter.

キットの成分は、少なくとも１つのライブラリーを生成するのに十分な量で好適なパッケージ材料中に存在し得る。任意選択的に、緩衝液（調製されたもの、又はその構成成分中に存在するかのいずれかであり、構成成分のうちの１つ以上が予め混合されていてもよく、又は構成成分の全てが別々であってもよい）などの他の試薬もまた含まれる。典型的には、パッケージされた構成要素の使用説明書も含まれる。 The components of the kit may be present in suitable packaging materials in amounts sufficient to generate at least one library. Optionally, other reagents, such as buffers (either prepared or present in their components, one or more of the components may be premixed, or all of the components may be separate), are also included. Instructions for use of the packaged components are also typically included.

本明細書で使用するとき、「パッケージ材料」という語句は、キットの内容物を収容するために使用される１つ以上の物理的構造を指す。パッケージ材料は、好ましくは無菌の、汚染物質を含まない環境を提供するために、既知の方法によって構築される。パッケージ材料は、シークエンシングライブラリーを生成するために構成成分が使用され得ることを示すラベルを有してよい。加えて、パッケージ材料は、本明細書で提供される方法の１つ以上の態様を実施するためにキット内の材料がどのように用いられるかを示す説明書を含む。本明細書で使用され場合、用語「パッケージ」とは、キットの１つ以上の構成成分を一定限度内に保持することができる、ガラス、プラスチック、紙、箔などの固体マトリックス又は材料を指す。「使用説明書」は、典型的には、試薬濃度、又は混合する試薬及び試料の相対量、試薬／試料混合物の維持期間、温度、緩衝条件など少なくとも１つのアッセイ法パラメータを説明する具体的な表現を含む。 As used herein, the phrase "packaging material" refers to one or more physical structures used to house the contents of the kit. The packaging material is preferably constructed by known methods to provide a sterile, contaminant-free environment. The packaging material may bear a label indicating that the components can be used to generate a sequencing library. In addition, the packaging material includes instructions indicating how the materials in the kit are to be used to practice one or more aspects of the methods provided herein. As used herein, the term "package" refers to a solid matrix or material, such as glass, plastic, paper, or foil, capable of holding one or more components of the kit within certain limits. "Instructions for use" typically include specific language describing at least one assay method parameter, such as reagent concentrations or relative amounts of reagents and sample to be mixed, duration of the reagent/sample mixture, temperature, buffer conditions, etc.

本発明は、特許請求の範囲に定義される。しかしながら、以下に非限定的な例示的態様の非網羅的なリストを提供する。これらの態様の特徴のうちの任意の１つ以上は、本明細書に記載される別の実施例、実施形態、又は態様のうちの任意の１つ以上の特徴と組み合わせることができる。 The present invention is defined in the claims. However, the following provides a non-exhaustive list of non-limiting exemplary aspects. Any one or more of the features of these aspects may be combined with any one or more features of any other example, embodiment, or aspect described herein.

例示的な態様
態様１は、シークエンシングライブラリーを生成するための方法であって、
各末端に第１のアダプター配列を含む複数の対称修飾標的核酸を提供することであって、第１のアダプター配列が、ＤＮＡ損傷を含む、提供することと、
修飾標的核酸を損傷不耐性ポリメラーゼで伸長して、各鎖の５’末端に第１のアダプター配列と、各鎖の３’末端に第１のアダプターの一部の相補体とを含む複数の非対称修飾標的核酸を生成することと、を含む、方法である。 Exemplary Embodiments Embodiment 1 is a method for generating a sequencing library, comprising:
providing a plurality of symmetrically modified target nucleic acids comprising a first adaptor sequence at each end, the first adaptor sequence comprising a DNA lesion;
extending the modified target nucleic acid with a damage-intolerant polymerase to generate a plurality of asymmetrically modified target nucleic acids comprising a first adapter sequence at the 5' end of each strand and a complement of a portion of the first adapter at the 3' end of each strand.

態様２は、複数の対称修飾標的核酸が、二本鎖であり、各鎖が、５’から３’にＤＮＡ損傷を含む第１のアダプター配列と、標的核酸と、少なくとも１つのヌクレオチドを含むギャップと、ＤＮＡ損傷を含まない第１のアダプター配列の相補体と、を含む、態様１に記載の方法である。 Aspect 2 is the method of aspect 1, wherein the plurality of symmetrically modified target nucleic acids are double-stranded, each strand comprising a first adapter sequence comprising a DNA lesion 5' to 3', the target nucleic acid, a gap comprising at least one nucleotide, and the complement of the first adapter sequence that does not comprise the DNA lesion.

態様３は、伸長が、ギャップで開始する、態様１又は２に記載の方法である。 Aspect 3 is the method of aspect 1 or 2, wherein the extension begins at a gap.

態様４は、
プライマーを複数の非対称修飾標的核酸にアニーリングすることであって、プライマーが、５’から３’に第２のアダプター配列及びアニーリングドメインを含み、アニーリングドメインが、複数の非対称修飾標的核酸の第１のアダプターの一部の相補体にアニーリングするヌクレオチド配列を含む、アニーリングすることと、
アニーリングされた非対称修飾標的核酸の３’末端を、損傷不耐性ポリメラーゼで伸長することであって、伸長が、５’から３’に（ｉ）第１のアダプター、（ｉｉ）標的核酸、（ｉｉｉ）第１のアダプターの一部の相補体、及び（ｉｖ）第２のアダプターの相補体を含む、複数の非対称修飾標的核酸をもたらす、伸長することと、を更に含む、態様２又は３のいずれかの方法である。 Aspect 4 is
annealing a primer to a plurality of asymmetrically modified target nucleic acids, the primer comprising a second adapter sequence from 5' to 3' and an annealing domain, the annealing domain comprising a nucleotide sequence that anneals to a complement of a portion of the first adapter of the plurality of asymmetrically modified target nucleic acids;
4. The method of any of aspects 2 or 3, further comprising extending the 3′ end of the annealed asymmetrically modified target nucleic acid with a damage-intolerant polymerase, wherein the extension results in a plurality of asymmetrically modified target nucleic acids comprising, from 5′ to 3′, (i) the first adaptor, (ii) the target nucleic acid, (iii) a complement of a portion of the first adaptor, and (iv) a complement of the second adaptor.

態様５は、アニーリングされた非対称修飾標的核酸の３’末端の伸長が少なくとも３回繰り返される、態様１～４のいずれか１つに記載の方法である。 Aspect 5 is a method according to any one of aspects 1 to 4, wherein the extension of the 3' end of the annealed asymmetrically modified target nucleic acid is repeated at least three times.

態様６は、ＤＮＡ損傷が、脱塩基部位、修飾塩基、ミスマッチ、一本鎖切断、又は架橋ヌクレオチドのうちの少なくとも１つを含む、態様１～５のいずれか１つに記載の方法である。 Aspect 6 is the method of any one of aspects 1 to 5, wherein the DNA damage includes at least one of an abasic site, a modified base, a mismatch, a single-strand break, or a crosslinked nucleotide.

態様７は、ＤＮＡ損傷が、少なくとも１つのウラシルを含む、態様１～６のいずれか１つに記載の方法である。 Aspect 7 is a method according to any one of aspects 1 to 6, wherein the DNA damage comprises at least one uracil.

態様８は、プライマーのアニーリングドメインが、対応する天然ＤＮＡヌクレオチドと比較して、融解温度を増加させる少なくとも１つの改変ヌクレオチドを含む、態様１～７のいずれか１つに記載の方法である。 Aspect 8 is a method according to any one of aspects 1 to 7, wherein the annealing domain of the primer comprises at least one modified nucleotide that increases the melting temperature compared to the corresponding natural DNA nucleotide.

態様９は、改変ヌクレオチドが、ロックド核酸、ＰＮＡ、又はＲＮＡを含む、態様１～８のいずれか１つに記載の方法である。 Aspect 9 is the method of any one of aspects 1 to 8, wherein the modified nucleotide comprises a locked nucleic acid, PNA, or RNA.

態様１０は、プライマーの３’末端がブロックされている、態様１～９のいずれか１つに記載の方法である。 Aspect 10 is the method of any one of aspects 1 to 9, wherein the 3' end of the primer is blocked.

態様１１は、第１のアダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のユニバーサル分子識別子、又はそれらの組み合わせを含む、態様１～１０のいずれか１つに記載の方法である。 Aspect 11 is the method of any one of aspects 1 to 10, wherein the first adapter comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.

態様１２は、１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子のうちの少なくとも１つが、ＤＮＡ損傷と標的核酸の遠位のアダプターの末端との間のアダプターに位置する、態様１～１１のいずれか１つに記載の方法である。 Aspect 12 is the method of any one of aspects 1 to 11, wherein at least one of the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers is located in an adaptor between the DNA lesion and the end of the adaptor distal to the target nucleic acid.

態様１３は、第２のアダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のユニバーサル分子識別子、又はそれらの組み合わせを含む、態様１～１２のいずれか１つに記載の方法である。 Aspect 13 is the method of any one of aspects 1 to 12, wherein the second adapter comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.

態様１４は、第１のアダプターの１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子が、第２のアダプターの１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子と比較して固有である、態様１～１３のいずれか１つに記載の方法である。 Aspect 14 is the method of any one of aspects 1 to 13, wherein the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the first adaptor are unique compared to the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the second adaptor.

態様１５は、第１のアダプターの１つ以上のインデックス配列が、区画特異的である、態様１～１４のいずれか１つに記載の方法である。 Aspect 15 is the method of any one of aspects 1 to 14, wherein one or more index sequences of the first adapter are compartment-specific.

態様１６は、第２のアダプターの１つ以上のインデックス配列が、区画特異的である、態様１～１５のいずれか１つに記載の方法である。 Aspect 16 is the method of any one of aspects 1 to 15, wherein one or more index sequences of the second adapter are compartment-specific.

態様１７．第１のアダプターが、トランスポザーゼ認識部位を含む、態様１～１６のいずれか１つに記載の方法である。 Aspect 17. The method of any one of Aspects 1 to 16, wherein the first adapter comprises a transposase recognition site.

態様１８は、標的核酸が、単一細胞に由来する核酸由来である、態様１～１７のいずれか１つに記載の方法である。 Aspect 18 is the method of any one of aspects 1 to 17, wherein the target nucleic acid is derived from nucleic acid derived from a single cell.

態様１９は、標的核酸が、複数の細胞に由来する核酸由来である、態様１～１８のいずれか１つに記載の方法である。 Aspect 19 is the method of any one of Aspects 1 to 18, wherein the target nucleic acid is derived from nucleic acids derived from multiple cells.

態様２０は、単一細胞又は複数の細胞に由来する標的核酸が、ＲＮＡを含む、態様１～１９のいずれか１つに記載の方法である。 Aspect 20 is the method of any one of aspects 1 to 19, wherein the target nucleic acid derived from a single cell or multiple cells comprises RNA.

態様２１は、ＲＮＡが、ｍＲＮＡを含む、態様１～２０のいずれか１つに記載の方法である。 Aspect 21 is the method of any one of aspects 1 to 20, wherein the RNA comprises mRNA.

態様２２は、単一細胞又は複数の細胞に由来する標的核酸が、ＤＮＡを含む、態様１～２１のいずれか１つに記載の方法である。 Aspect 22 is the method of any one of Aspects 1 to 21, wherein the target nucleic acid derived from a single cell or multiple cells comprises DNA.

態様２３は、ＤＮＡが、全細胞ゲノムＤＮＡを含む、態様１～２２のいずれか１つに記載の方法である。 Aspect 23 is the method of any one of aspects 1 to 22, wherein the DNA comprises whole cell genomic DNA.

態様２４は、全細胞ゲノムＤＮＡが、ヌクレオソームを含む、態様１～２３のいずれか１つに記載の方法である。 Aspect 24 is the method of any one of aspects 1 to 23, wherein the total cellular genomic DNA comprises nucleosomes.

態様２５は、標的核酸が無細胞ＤＮＡに由来する核酸由来である、態様１～２４７のいずれか１つに記載の方法である。 Aspect 25 is the method of any one of aspects 1 to 247, wherein the target nucleic acid is derived from nucleic acid derived from cell-free DNA.

態様２６は、方法が、コンビナトリアルインデックス付きを含む、態様１～２５のいずれか１つに記載の方法である。 Aspect 26 is the method of any one of aspects 1 to 25, wherein the method includes combinatorial indexing.

態様２７は、非対称修飾標的核酸を増幅することを更に含み、増幅が、第２のプライマー及び損傷耐性ポリメラーゼを含み、第２のプライマーが、第１のアダプター配列又はその相補体にアニーリングするヌクレオチド配列を含む、態様１～２６のいずれか１つに記載の方法である。 Aspect 27 is the method of any one of aspects 1 to 26, further comprising amplifying the asymmetrically modified target nucleic acid, wherein the amplification comprises a second primer and a damage-tolerant polymerase, and the second primer comprises a nucleotide sequence that anneals to the first adapter sequence or its complement.

態様２８は、第２のプライマーが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のユニバーサル分子識別子、又はそれらの組み合わせを更に含む、態様１～２７のいずれか１つに記載の方法である。 Aspect 28 is the method of any one of aspects 1 to 27, wherein the second primer further comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.

態様２９は、第２のプライマーの１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子が、第１のアダプター及び第２のアダプターの１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子と比較して固有である、態様１～２８のいずれか１つに記載の方法である。 Aspect 29 is the method of any one of aspects 1 to 28, wherein the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the second primer are unique compared to the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the first adapter and the second adapter.

態様３０は、複数の非対称修飾標的核酸のサブセットが、複数の区画内に存在し、（ｉ）第１のアダプターが、第１の区画特異的インデックスを含むか、（ｉｉ）第２のアダプターが、第２の区画特異的インデックスを含むか、又は（ｉ）及び（ｉｉ）の両方のいずれかである、態様１～２９のいずれか１つに記載の方法である。 Aspect 30 is the method of any one of aspects 1 to 29, wherein a subset of the multiple asymmetrically modified target nucleic acids is present in multiple compartments, and either (i) the first adapter comprises a first compartment-specific index, (ii) the second adapter comprises a second compartment-specific index, or both (i) and (ii).

態様３１．異なる区画からの非対称修飾標的核酸を組み合わせて、プールされたインデックス付き非対称修飾標的核酸を生成することを更に含む、態様１～３０のいずれか１つに記載の方法である。 Aspect 31. The method of any one of aspects 1 to 30, further comprising combining asymmetrically modified target nucleic acids from different compartments to generate pooled indexed asymmetrically modified target nucleic acids.

態様３２は、
プールされたインデックス付き非対称修飾標的核酸のサブセットを第２の複数の区画に分配し、インデックス付き非対称修飾標的核酸を修飾することを更に含み、修飾が、各サブセット中に存在するインデックス付き非対称修飾標的核酸に追加の区画特異的インデックス配列を付加して、インデックス付きＤＮＡ核酸をもたらすことを含み、修飾が、ライゲーション又は伸長を含む、態様１～３１のいずれか１つに記載の方法である。 Aspect 32 is
32. The method of any one of aspects 1-31, further comprising distributing a subset of the pooled indexed asymmetrically modified target nucleic acids into a second plurality of compartments and modifying the indexed asymmetrically modified target nucleic acids, wherein the modification comprises adding an additional compartment-specific index sequence to the indexed asymmetrically modified target nucleic acids present in each subset to result in indexed DNA nucleic acids, and wherein the modification comprises ligation or extension.

態様３３は、区画が、ウェル又は液滴を含む、態様１～３２のいずれか１つに記載の方法である。 Aspect 33 is the method of any one of aspects 1 to 32, wherein the compartment comprises a well or a droplet.

態様３４は、提供が、複数のＤＮＡ断片を第１のアダプターと、ＤＮＡ断片の両端に第１のアダプターをライゲートする条件下で、接触させることを含む、態様１～３３のいずれか１つに記載の方法である。 Aspect 34 is a method according to any one of aspects 1 to 33, wherein providing comprises contacting a plurality of DNA fragments with first adaptors under conditions that ligate the first adaptors to both ends of the DNA fragments.

態様３５は、ＤＮＡ断片が二本鎖及び平滑末端である、態様１～３４のいずれか１つに記載の方法である。 Aspect 35 is the method of any one of aspects 1 to 34, wherein the DNA fragment is double-stranded and blunt-ended.

態様３６は、第１のアダプターが、二本鎖ＤＮＡオリゴヌクレオチドである、態様１～３５のいずれか１つに記載の方法である。 Aspect 36 is the method of any one of aspects 1 to 35, wherein the first adapter is a double-stranded DNA oligonucleotide.

態様３７は、第１の伸長オリゴヌクレオチドの一方の３’末端が、ブロックされている、態様１～３６のいずれか一項に記載の方法である。 Aspect 37 is the method of any one of aspects 1 to 36, wherein one 3' end of the first extension oligonucleotide is blocked.

態様３８は、ＤＮＡ断片が二本鎖であり、一方又は両方の３’末端に一本鎖領域を含む、態様１～３７のいずれか１つに記載の方法である。 Aspect 38 is the method of any one of aspects 1 to 37, wherein the DNA fragment is double-stranded and contains a single-stranded region at one or both 3' ends.

態様３９は、第１のアダプターが、一方の末端に一本鎖領域を含む二本鎖ＤＮＡオリゴヌクレオチドであり、一本鎖領域が、ＤＮＡ断片上に存在する一本鎖領域にアニーリングすることができる、態様１～３８のいずれか１つに記載の方法である。 Aspect 39 is the method of any one of aspects 1 to 38, wherein the first adaptor is a double-stranded DNA oligonucleotide comprising a single-stranded region at one end, the single-stranded region being capable of annealing to a single-stranded region present on the DNA fragment.

態様４０は、アダプターがフォーク型アダプターである、態様１～３８のいずれか１つに記載の方法である。 Aspect 40 is the method of any one of aspects 1 to 38, wherein the adapter is a forked adapter.

態様４１は、提供が、ＤＮＡをトランスポソーム複合体と接触させることを含み、トランスポソーム複合体が、トランスポザーゼと、第１のアダプターと、を含み、接触が、第１のアダプターのＤＮＡへのライゲーションに好適な条件下で生じて、対称修飾標的核酸を生成する、態様１～４０のいずれか１つに記載の方法である。一態様では、トランスポソーム複合体は、態様６７～７１のいずれか１つに記載のトランスポソーム複合体である。 Aspect 41 is the method of any one of aspects 1 to 40, wherein providing comprises contacting the DNA with a transposome complex, the transposome complex comprising a transposase and a first adapter, and the contacting occurs under conditions suitable for ligation of the first adapter to the DNA to produce a symmetrically modified target nucleic acid. In one aspect, the transposome complex is the transposome complex of any one of aspects 67 to 71.

態様４２は、生成された対称修飾標的核酸が、ライゲートされた第１のアダプターと標的核酸との間で１つの鎖中に少なくとも１つのヌクレオチドのギャップを含む、態様１～４１のいずれか１つに記載の方法である。 Aspect 42 is the method of any one of aspects 1 to 41, wherein the generated symmetrically modified target nucleic acid comprises a gap of at least one nucleotide in one strand between the ligated first adaptor and the target nucleic acid.

態様４３は、ＤＮＡが複数の区画内に存在し、各区画内の第１のアダプターが、区画特異的インデックスを含む、態様１～４２のいずれか１つに記載の方法である。 Aspect 43 is a method according to any one of aspects 1 to 42, wherein the DNA is present in multiple compartments and the first adapter in each compartment comprises a compartment-specific index.

態様４４は、異なる区画からの一本鎖修飾標的核酸を組み合わせて、プールされた対称修飾標的核酸を生成することと、対称修飾標的核酸を第２の複数の区画に分配することと、を更に含む、態様１～４３のいずれか１つに記載の方法である。 Aspect 44 is the method of any one of aspects 1 to 43, further comprising combining single-stranded modified target nucleic acids from different compartments to generate pooled symmetrically modified target nucleic acids, and distributing the symmetrically modified target nucleic acids into a second plurality of compartments.

態様４５は、方法が、全細胞ゲノムＤＮＡの断片化を更に含む、態様１～４４のいずれか１つに記載の方法である。 Aspect 45 is the method of any one of aspects 1 to 44, wherein the method further comprises fragmenting the whole cell genomic DNA.

態様４６は、断片化が、制限エンドヌクレアーゼを用いた全細胞ゲノムＤＮＡの消化を含む、態様１～４５のいずれか１つに記載の方法である。 Aspect 46 is a method according to any one of aspects 1 to 45, wherein fragmenting comprises digestion of the total cellular genomic DNA with a restriction endonuclease.

態様４７は、断片化されたＤＮＡが、キメラ標的核酸を結合するための近接ライゲーションに供される、態様１～４６のいずれか１つに記載の方法である。 Aspect 47 is a method according to any one of aspects 1 to 46, wherein the fragmented DNA is subjected to proximity ligation to attach the chimeric target nucleic acid.

態様４８は、アダプターのシトシン残基が、５－メチルシトシンで置き換えられる、態様１～４７のいずれか１つに記載の方法である。 Aspect 48 is a method according to any one of aspects 1 to 47, wherein the cytosine residue of the adapter is replaced with 5-methylcytosine.

態様４９は、対称又は非対称標的核酸が、化学的又は酵素的メチル化変換に供される、態様１～４８のいずれか１つに記載の方法である。 Aspect 49 is a method according to any one of aspects 1 to 48, wherein a symmetric or asymmetric target nucleic acid is subjected to chemical or enzymatic methylation conversion.

態様５０は、提供が、単離された核を固定することと、単離された核を、ゲノムＤＮＡからヌクレオソームを解離させる条件に供することと、ゲノムＤＮＡを断片化することと、キメラ標的核酸を結合する近接ライゲーションに断片を供することと、ライゲートされた断片をトランスポソーム複合体に接触させることと、を含み、トランスポソーム複合体が、トランスポザーゼと、第１のアダプターと、を含み、接触が、第１のアダプターのＤＮＡへのライゲーションに好適な条件下で生じて、対称修飾標的核酸を生成する、態様１～４９のいずれか１つに記載の方法である。 Aspect 50 is the method of any one of aspects 1 to 49, wherein providing comprises fixing isolated nuclei, subjecting the isolated nuclei to conditions that dissociate nucleosomes from genomic DNA, fragmenting the genomic DNA, subjecting the fragments to proximity ligation that joins a chimeric target nucleic acid, and contacting the ligated fragments with a transposome complex, wherein the transposome complex comprises a transposase and a first adapter, and wherein the contacting occurs under conditions suitable for ligation of the first adapter to the DNA to generate a symmetrically modified target nucleic acid.

態様５１は、断片化が、制限エンドヌクレアーゼでの消化を含む、態様１～５０のいずれか１つに記載の方法である。 Aspect 51 is a method according to any one of aspects 1 to 50, wherein the fragmentation comprises digestion with a restriction endonuclease.

態様５２は、
複数の増幅部位を含む表面を提供することであって、
増幅部位が、遊離３’末端を有する結合した一本鎖捕捉オリゴヌクレオチドの少なくとも２つの集団を含む、提供することと、
個々の非対称修飾標的核酸からのアンプリコンのクローン集団を各々含む複数の増幅部位を生成するのに好適な条件下で、増幅部位を含む表面を、複数の非対称修飾標的核酸と接触させることと、を更に含む、態様１～５１のいずれか１つに記載の方法である。 Aspect 52 is
providing a surface comprising a plurality of amplification sites,
providing an amplification site comprising at least two populations of linked single-stranded capture oligonucleotides having free 3'ends;
Aspect 52. The method of any one of aspects 1-51, further comprising contacting the surface comprising the amplification sites with a plurality of asymmetrically modified target nucleic acids under conditions suitable to generate a plurality of amplification sites, each comprising a clonal population of amplicons from an individual asymmetrically modified target nucleic acid.

態様５３は、トランスポソーム複合体及びＤＮＡポリメラーゼを含む組成物であって、トランスポソームが、トランスポゾン配列に結合したトランスポザーゼを含み、トランスポゾン配列が、アダプター及びＤＮＡ損傷を含み、ＤＮＡポリメラーゼが、損傷不耐性ポリメラーゼである、組成物である。 Aspect 53 is a composition comprising a transposome complex and a DNA polymerase, wherein the transposome comprises a transposase bound to a transposon sequence, the transposon sequence comprises an adapter and DNA damage, and the DNA polymerase is a damage-intolerant polymerase.

態様５４は、アダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含む、態様５３に記載の組成物である。 Aspect 54 is the composition of aspect 53, wherein the adapter comprises one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof.

態様５５は、損傷耐性ＤＮＡポリメラーゼを更に含む、態様５３又は５４に記載の組成物である。 Aspect 55 is the composition according to Aspect 53 or 54, further comprising a damage-tolerant DNA polymerase.

態様５６は、
５’～３’にＤＮＡ損傷を含む第１のアダプター、標的核酸、及び第１のアダプターの相補体を含む複数の修飾標的核酸と、
５’から３’に第２のアダプターと、アニーリングドメインと、を含むプライマーであって、アニーリングドメインが、第１のアダプターの相補体にアニーリングするヌクレオチド配列を含む、プライマーと、
損傷不耐性ＤＮＡポリメラーゼと、を含む、組成物である。 Aspect 56 is
a plurality of modified target nucleic acids comprising a first adaptor comprising a 5' to 3' DNA lesion, a target nucleic acid, and a complement of the first adaptor;
a primer comprising a second adaptor from 5' to 3' and an annealing domain, wherein the annealing domain comprises a nucleotide sequence that anneals to the complement of the first adaptor;
and a damage-intolerant DNA polymerase.

態様５７は、プライマーが、対応する天然ＤＮＡヌクレオチドと比較して、融解温度を増加させる少なくとも１つの改変ヌクレオチドを含む、態様５６に記載の組成物である。 Aspect 57 is the composition of aspect 56, wherein the primer comprises at least one modified nucleotide that increases the melting temperature compared to the corresponding natural DNA nucleotide.

態様５８は、プライマーが標的核酸にアニーリングされる、態様５６又は５７に記載の組成物である。 Aspect 58 is the composition of aspect 56 or 57, wherein the primer is annealed to the target nucleic acid.

態様５９は、プライマーの３’末端が、ブロックされている、態様５６～５８のいずれか１つに記載の組成物である。 Aspect 59 is the composition of any one of aspects 56 to 58, wherein the 3' end of the primer is blocked.

態様６０は、第１のアダプターが、トランスポザーゼ認識部位を含む、態様５６～５９のいずれか１つに記載の組成物である。 Aspect 60 is a composition according to any one of aspects 56 to 59, wherein the first adapter comprises a transposase recognition site.

態様６１は、トランスポソーム複合体及びＤＮＡポリメラーゼを別々の容器に、並びに使用説明書を含むキットであって、トランスポソームが、トランスポゾン配列に結合したトランスポザーゼを含み、トランスポゾン配列が、第１のアダプター及びＤＮＡ損傷を含み、ＤＮＡポリメラーゼが、損傷不耐性ポリメラーゼである、キットである。 Aspect 61 is a kit comprising a transposome complex and a DNA polymerase in separate containers, as well as instructions for use, wherein the transposome comprises a transposase bound to a transposon sequence, the transposon sequence comprises a first adapter and DNA damage, and the DNA polymerase is a damage-intolerant polymerase.

態様６２は、第２のＤＮＡポリメラーゼを更に含み、第２のＤＮＡポリメラーゼが、損傷耐性ポリメラーゼである、態様６１に記載のキットである。 Aspect 62 is the kit according to Aspect 61, further comprising a second DNA polymerase, wherein the second DNA polymerase is a damage-tolerant polymerase.

態様６３は、プライマーを更に含み、プライマーが、５’から３’に第２のアダプター及びアニーリングドメインを含み、アニーリングドメインが、第１のアダプターの相補体にアニーリングするヌクレオチド配列を含む、態様６１又は６２に記載のキットである。 Aspect 63 is a kit according to Aspect 61 or 62, further comprising a primer, the primer comprising a second adapter and an annealing domain 5' to 3' from the second adapter, the annealing domain comprising a nucleotide sequence that anneals to the complement of the first adapter.

態様６４は、プライマーの３’末端が、ブロックされている、態様６１～６３のいずれか１つに記載のキットである。 Aspect 64 is a kit according to any one of aspects 61 to 63, wherein the 3' end of the primer is blocked.

態様６５は、第１のアダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含む、態様６１～６４のいずれか１つに記載のキットである。 Aspect 65 is a kit according to any one of aspects 61 to 64, wherein the first adapter comprises one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof.

態様６６は、第２のアダプタープライマーが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを更に含む、態様６１～６５のいずれか１つに記載のキットである。 Aspect 66 is a kit according to any one of aspects 61 to 65, wherein the second adapter primer further comprises one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof.

態様６７は、トランスポザーゼと、第１の鎖上に、５’から３’に、少なくとも１つのユニバーサル配列、少なくとも１つのインデックス配列、少なくとも１つのＵＭＩ、又はそれらの組み合わせ、ＤＮＡ損傷、トランスポザーゼ認識配列を含み、第２の鎖上に、トランスポザーゼ認識配列の少なくとも一部に相補的なヌクレオチドを含むアダプターを含む核酸を含むトランスポゾンと、を含む、トランスポソーム複合体である。 Aspect 67 is a transposome complex comprising a transposase and a transposon comprising a nucleic acid that includes, on a first strand, from 5' to 3', at least one universal sequence, at least one index sequence, at least one UMI, or a combination thereof, DNA damage, or a transposase recognition sequence, and, on a second strand, an adapter that includes nucleotides complementary to at least a portion of the transposase recognition sequence.

態様６８は、第１の鎖が、第１の鎖の５’末端に捕捉剤を更に含む、態様６７に記載のトランスポソーム複合体である。 Aspect 68 is a transposome complex described in Aspect 67, wherein the first strand further comprises a capture agent at the 5' end of the first strand.

態様６９は、第１の鎖が、捕捉剤と５’末端との間に位置する切断可能なリンカーを更に含む、態様６７又は６８に記載のトランスポソーム複合体である。 Aspect 69 is a transposome complex described in Aspect 67 or 68, wherein the first strand further comprises a cleavable linker positioned between the capture agent and the 5' end.

態様７０は、第２の鎖が、第２の鎖の３’末端に捕捉剤を更に含む、態様６７～６９のいずれか１つに記載のトランスポソーム複合体である。 Aspect 70 is a transposome complex described in any one of aspects 67 to 69, wherein the second strand further comprises a capture agent at the 3' end of the second strand.

態様７１は、第２の鎖が、捕捉剤と３’末端との間に位置する切断可能なリンカーを更に含む、態様７０に記載のいずれか１つのトランスポソーム複合体である。 Aspect 71 is any one of the transposome complexes described in aspect 70, wherein the second strand further comprises a cleavable linker positioned between the capture agent and the 3' end.

本開示は、以下の実施例によって例示される。特定の実施例、材料、量、及び手順は、本明細書に記載の本開示の範囲及び趣旨に従って広く解釈されるべきであることを理解されたい。 The present disclosure is illustrated by the following examples. It is understood that the specific examples, materials, amounts, and procedures are to be interpreted broadly in accordance with the scope and spirit of the present disclosure as set forth herein.

実施例１
対称標的核酸の非対称標的核酸断片への変換の概念実証。 Example 1
Proof of concept for conversion of symmetric target nucleic acids into asymmetric target nucleic acid fragments.

タグメンテーションによる対称標的核酸の生成及び非対称標的核酸への変換の実験アプローチ。シークエンシングライブラリーは、各末端に同じアダプターを有する標的核酸を生成するための単一のトランスポゾンとのトランスポソーム複合体を使用してＤＮＡのタグメンテーションによって調製し、次いで、アダプターのうちの１つを修飾して非対称標的核酸をもたらすための条件に曝露した。 Experimental approach for generating symmetric target nucleic acids by tagmentation and converting them to asymmetric target nucleic acids. Sequencing libraries were prepared by tagmentation of DNA using transposome complexes with a single transposon to generate target nucleic acids with the same adapter at each end, which were then exposed to conditions to modify one of the adapters, resulting in an asymmetric target nucleic acid.

細胞／核のプロトコル。 Cell/Nuclear Protocols.

キットは、以下を含む：９６ウェルインデックス付きＴＳＭプレート、３８４ウェルインデックス付きＰＣＲプレート、５×タグメンテーション緩衝液ＴＢ１、ＥｘＴＢ（１．７ｍｌのスクリューキャップチューブ、ＬＮＡ＋ＴＸ１００中５００ｕｌ）、タグメンテーション後の洗浄緩衝液（１５ｍｌコニカルチューブ中１０ｍｌ）、再懸濁緩衝液（Resuspension buffer、ＲＳＢ）（１５ｍｌのコニカルチューブ中１０ｍｌ）、及び０．５％ＳＤＳ（１．７ｍｌのスクリューキャップチューブ中５００ｕｌ）。 The kit includes: a 96-well indexed TSM plate, a 384-well indexed PCR plate, 5x Tagmentation Buffer TB1, ExTB (500ul in a 1.7ml screw-cap tube, LNA+TX100), Post-Tagmentation Wash Buffer (10ml in a 15ml conical tube), Resuspension Buffer (RSB) (10ml in a 15ml conical tube), and 0.5% SDS (500ul in a 1.7ml screw-cap tube).

使用者により調製されたもの：Ｑ５２×マスターミックス（ＮＥＢ、Ｍ０４９２Ｌ）、Ｑ５Ｕ２×マスターミックス（ＮＥＢ、Ｍ０５９７Ｌ）、８０％ＥｔＯＨ、及びＡＭＰｕｒｅＸＰビーズ（ＢｅｃｋｍａｎＣｏｕｌｔｅｒ、Ａ６３８８０）。 User prepared: Q5 2x Master Mix (NEB, M0492L), Q5U 2x Master Mix (NEB, M0597L), 80% EtOH, and AMPure XP beads (Beckman Coulter, A63880).

機器及び消耗プラスチック：細胞カウンター（ＴｈｅｒｍｏＦｉｓｈｅｒＣｏｕｎｔｅｓｓＩＩＦＬ自動細胞カウンター、ＡＭＱＡＦ１０００）、Ｃｏｕｎｔｅｓｓ細胞カウントチャンバースライド（ＴｈｅｒｍｏＦｉｓｈｅｒ、ＰＮＣ１０２２８）、温度制御を備えたプレート用遠心分離機、温度制御を備えたベンチトップ型遠心分離機、バイオアナライザー（Ａｇｉｌｅｎｔ、ＰＮＧ２９３９ＢＡ）、ＡｇｉｌｅｎｔＨｉｇｈＳｅｎｓｉｔｉｖｉｔｙＤＮＡキット（５０６７－４６２６）、９６ウェルプレート（Ｅｐｐｅｎｄｏｒｆｔｗｉｎ．ｔｅｃＰＣＲプレート９６ＬｏＢｉｎｄ、スカート型、ＰＮ００３０１２９５１２）、３８４ウェルプレート（Ｅｐｐｅｎｄｏｒｆｔｗｉｎ．ｔｅｃＰＣＲプレート３８４ＬｏＢｉｎｄ、スカート型、ＰＮ００３０１２９５４７）、使い捨て試薬リザーバー（ＶＷＲ、ＰＮ８９０９４－６５８）又は同等品、ビーズ収集用マグネットスタンド、９６及び３８４ウェルプレート用サーマルサイクラー、プレートシェーカー、並びにＦａｌｃｏｎ１５ｍＬコレクションチューブ（ＴｈｅｒｍｏＦｉｓｈｅｒ、ＰＮ１４－９５９－５３Ａ又はＳＡＲＳＴＥＤＴ、ＰＮ６２．５５４．２０５）。 Equipment and consumable plastics: cell counter (ThermoFisher Countess II FL automated cell counter, AMQAF1000), Countess cell counting chamber slides (ThermoFisher, PN C10228), temperature-controlled plate centrifuge, temperature-controlled benchtop centrifuge, bioanalyzer (Agilent, PN G2939BA), Agilent High Sensitivity DNA Kit (5067-4626), 96-well plates (Eppendorf twin.tec PCR Plate 96 LoBind, skirted, PN 0030129512), 384-well plates (Eppendorf twin.tec PCR Plate 384 LoBind, skirted, PN 0030129547), disposable reagent reservoir (VWR, PN 89094-658) or equivalent, magnetic stand for bead collection, thermal cycler for 96- and 384-well plates, plate shaker, and Falcon 15mL collection tubes (ThermoFisher, PN 14-959-53A or SARSTEDT, PN 62.554.205).

核調製のための試薬：Ｐｉｅｒｃｅ（商標）１６％ホルムアルデヒド（ｗ／ｖ）、メタノール不含（ＴｈｅｒｍｏＦｉｓｈｅｒ、ＰＮ２８９０６）、ＴｒｙＰＬＥ（ＦｉｓｈｅｒＳｃｉｅｎｔｉｆｉｃ、ＰＮ１２－６０４－０３９）、ＰＢＳ緩衝液（Ｓｉｇｍａ、ＰＮ＃８０６５５２－１Ｌ）、Ｐｉｅｒｃｅプロテアーゼ阻害剤ミニ錠剤、ＥＤＴＡ不含（ＰＮＡ３２９５５）、及びトリパンブルー溶液（ＴｈｅｒｍｏＦｉｓｈｅｒ、ＰＮ１５２５００６１）。 Reagents for nuclei preparation: Pierce™ 16% formaldehyde (w/v), methanol-free (ThermoFisher, PN28906), TryPLE (Fisher Scientific, PN12-604-039), PBS buffer (Sigma, PN#806552-1L), Pierce protease inhibitor mini-tablets, EDTA-free (PN A32955), and trypan blue solution (ThermoFisher, PN15250061).

細胞株に推奨される緩衝液。溶解緩衝液は、１０ｍＭのＨＥＰＥＳ、１０ｍＭのＮａＣｌ、３ｍＭのＭｇＣｌ２、０．１％のＩｇｅｐａｌ、０．１％のＴｗｅｅｎ（登録商標）、及びプロテアーゼ阻害剤である。ＮＩＢ緩衝液は、１０ｍＭのＴｒｉｓ（ｐＨ７．５）、１０ｍＭのＮａＣｌ、３ｍＭのＭｇＣｌ２、０．１％のＴｗｅｅｎ、及びプロテアーゼ阻害剤である。１０×Ｘｌｉｎｋ緩衝液は、５ＭのＮａＣｌ、１ＭのＴｒｉｓＨＣｌ（ｐＨ７．５）、１ＭのＭｇＣｌ２及び１００ｎｇ／ｕＬのＢＳＡである。 Recommended buffers for cell lines. Lysis buffer is 10 mM HEPES, 10 mM NaCl, 3 mM MgCl2, 0.1% Igepal, 0.1% Tween®, and protease inhibitors. NIB buffer is 10 mM Tris (pH 7.5), 10 mM NaCl, 3 mM MgCl2, 0.1% Tween, and protease inhibitors. 10x Xlink buffer is 5 M NaCl, 1 M Tris HCl (pH 7.5), 1 M MgCl2, and 100 ng/uL BSA.

核調製及びヌクレオソーム枯渇。細胞は、前日にＴ２５フラスコ（ＰＮ）に１×１０^６で播種し、採取時にはサブコンフルエントであった。細胞を、５ｍＬの氷冷ＰＢＳを用いてフラスコ内で洗浄し、ＴｒｙｐＬＥを用いてトリプシン処理し（１ｍＬ、３７℃で５分間）、４℃で３分間５００ｒｃｆで回転させて収集し、１ｍＬの氷冷ＰＢＳで洗浄して、核単離に進めた。 Nuclei preparation and nucleosome depletion. Cells were seeded at 1 x ^{10 cells} in T25 flasks (PN) the day before and were subconfluent at the time of harvest. Cells were washed in the flask with 5 mL of ice-cold PBS, trypsinized with TrypLE (1 mL, 5 min at 37°C), harvested by spinning at 500 rcf for 3 min at 4°C, washed with 1 mL of ice-cold PBS, and proceeded to nuclear isolation.

核単離。細胞を、４℃で３分間５００ｒｃｆで遠心沈殿させ、１ｍＬの溶解緩衝液中に再懸濁し、氷上で１０分間インキュベートした。細胞を、４℃で３分間５００ｒｃｆで遠心沈殿させ、３００ｕＬの溶解緩衝液に再懸濁した。核を１：５希釈（２ｕＬの試料＋８ｕＬの溶解緩衝液＋１０ｕＬのトリパンブルー溶液）を使用してカウントした。１×１０^６を固定のために等分した。 Nuclei isolation. Cells were spun down at 500 rcf for 3 minutes at 4°C, resuspended in 1 mL of lysis buffer, and incubated on ice for 10 minutes. Cells were spun down at 500 rcf for 3 minutes at 4°C, and resuspended in 300 μL of lysis buffer. Nuclei were counted using a 1:5 dilution (2 μL sample + 8 μL lysis buffer + 10 μL trypan blue solution). 1 × 10 ^cells were aliquoted for fixation.

核固定。体積を５ｍＬの溶解緩衝液まで増加させ、新たに開封したアンプルから１６％ホルムアルデヒド２４６μＬを添加した（総計０．７５％ホルムアルデヒド、０．５％～０．７５％の範囲が許容可能）。穏やかに振とうしながら室温で１０分間インキュベートし、４℃で３分間５００ｒｃｆで遠心分離してペレット化し、１ｍＬの氷冷ＮＩＢで洗浄し、４℃で３分間５００ｒｃｆで回転させ、２００ｕＬの１×Ｘｌｉｎｋ緩衝液（氷冷）で洗浄した。洗浄中、より良好なペレットのために核を１．５ｍＬチューブに移し、４℃で３分間５００ｒｃｆで回転させた。 Nuclei fixation. The volume was increased to 5 mL of lysis buffer and 246 μL of 16% formaldehyde was added from a freshly opened ampoule (total 0.75% formaldehyde; a range of 0.5% to 0.75% is acceptable). Incubate for 10 minutes at room temperature with gentle shaking, pellet by centrifugation at 500 rcf for 3 minutes at 4°C, wash with 1 mL of ice-cold NIB, spin at 500 rcf for 3 minutes at 4°C, and wash with 200 μL of 1x Xlink buffer (ice-cold). During the wash, nuclei were transferred to a 1.5 mL tube for better pelleting and spun at 500 rcf for 3 minutes at 4°C.

ヌクレオソーム枯渇（全ゲノムシークエンシング用）。ペレットを４０μＬの１％ＳＤＳで７６０μＬの１×Ｘｌｉｎｋ緩衝液に再懸濁し、３７℃で２０分間振とうしながら（４００ｒｐｍ）インキュベートした。（０．０５％最終ＳＤＳ）、４℃で３分間５００ｒｃｆで回転させ、２００ｕＬ１×ＮＩＢで洗浄し、４℃で３分間５００ｒｃｆで回転させて、５０～１００ｕＬのＮＩＢに再懸濁した。２ｕＬの試料に８ｕＬのＮＩＢ及び１０ｕＬのトリパンブルーを添加し、１０ｕＬを細胞カウンターにロードした。核は、ピペッティングアウトのために、必要に応じて、５００核／ｕｌに濃縮又は希釈した。 Nucleosome depletion (for whole genome sequencing). The pellet was resuspended in 760 μL of 1x Xlink buffer with 40 μL of 1% SDS and incubated at 37°C for 20 minutes with shaking (400 rpm). (0.05% final SDS) spun at 500 rcf for 3 minutes at 4°C, washed with 200 μL of 1x NIB, spun at 500 rcf for 3 minutes at 4°C, and resuspended in 50-100 μL of NIB. To a 2 μL sample, 8 μL of NIB and 10 μL of trypan blue were added, and 10 μL was loaded onto a cell counter. Nuclei were concentrated or diluted to 500 nuclei/μL, as needed, for pipetting out.

プレートベースのコンビナトリアルインデックス付けワークフローのためのプロトコル（図１０）。 Protocol for plate-based combinatorial indexing workflow (Figure 10).

タグメンテーション。核を緩衝液と混合する：３５０ｕｌ（約１００Ｋ）核、５００ｕｌの５×タグメンテーション緩衝液（ＴＢ１）、及び１３５０ｕｌのＨ２Ｏ。９６ＴＳＭプレートの各ウェルに２０ｕｌを添加し、サーマルサイクラー上で５５℃で１５分間インキュベートする。１００ｕｌの２００ｍＭのＥＤＴＡを１５ｍＬのコレクションチューブに添加し、核を９６ウェルプレートから氷上の１５ｍＬコレクションチューブにプールする（総計２５ｕｌ×９６＋１００ｕｌ＝２．５ｍＬ）。核を４℃、５００ｒｃｆでペレット化し、核を洗浄緩衝液５００ｕｌに懸濁する。２ｕＬの試料を取り出し、８ｕＬのＮＩＢ及び１０ｕＬのトリパンブルーを添加し、１０ｕＬを細胞カウンターにロードすることによって核の濃度を決定する。次いで、核を核／ｕＬに希釈し、４ｕＬをプレートの各ウェルにロードする。 Tagmentation. Mix nuclei with buffer: 350 ul (approximately 100K) nuclei, 500 ul of 5x Tagmentation Buffer (TB1), and 1350 ul of H2O. Add 20 ul to each well of a 96-well TSM plate and incubate at 55°C on a thermal cycler for 15 minutes. Add 100 ul of 200 mM EDTA to a 15 mL collection tube and pool the nuclei from the 96-well plate into a 15 mL collection tube on ice (total 25 ul x 96 + 100 ul = 2.5 mL). Pellet the nuclei at 4°C and 500 rcf and suspend the nuclei in 500 ul of wash buffer. Determine the concentration of nuclei by removing a 2 uL sample, adding 8 uL of NIB and 10 uL of trypan blue, and loading 10 uL into a cell counter. The nuclei are then diluted to nuclei/uL and 4uL is loaded into each well of the plate.

伸長：以下の順序で試薬を添加する：１ｕｌの０．５％ＳＤＳを添加し、５５℃で１０分間加熱し、２ｕｌのＥｘＴＢ、７ｕｌの２×Ｑ５マスターミックス（ＮＥＢ）を合計１４ｕｌ添加する。ウェルを混合し、サーモサイクラー上でプログラムを実行する。１．７２℃で１０分間、２．９８℃で３０秒間、３．９８℃で１０秒間、４．５９℃で２０秒間、５．７２℃で１０秒間、６．工程３～５を総計１０サイクル繰り返す、７．７２℃で２分間及び８．１０℃保持温度。 Extension: Add reagents in the following order: Add 1 ul of 0.5% SDS, heat to 55°C for 10 minutes, add 2 ul of ExTB, and 7 ul of 2x Q5 Master Mix (NEB) for a total of 14 ul. Mix wells and run the program on the thermocycler: 1. 72°C for 10 minutes, 2. 98°C for 30 seconds, 3. 98°C for 10 seconds, 4. 59°C for 20 seconds, 5. 72°C for 10 seconds, 6. Repeat steps 3-5 for a total of 10 cycles, 7. 72°C for 2 minutes, and 8. Hold temperature at 10°C.

インデックス付きＰＣＲ：伸長から１ｕｌのＰＣＲプライマーを３８４ウェルＰＣＲプレートから核プレートに移す。１５ｕｌ２×ＮＥＢＱ５Ｕを添加し、サーマルサイクラー上でＰＣＲプログラムを実行する：１．９８℃で３０秒間、２．９８℃で１０秒間、３．５５℃で２０秒間、４．７２℃で３０秒間、５．工程２～４を総計２０サイクル繰り返す、６．７２℃で２分間、及び７．１０℃保持温度。ライブラリーは通常、１２～１４サイクルの間で増幅される。 Indexed PCR: Transfer 1 ul of PCR primer from the extension well from the 384-well PCR plate to a nuclei plate. Add 15 ul 2x NEB Q5U and run the PCR program on a thermal cycler: 1. 98°C for 30 seconds, 2. 98°C for 10 seconds, 3. 55°C for 20 seconds, 4. 72°C for 30 seconds, 5. Repeat steps 2-4 for a total of 20 cycles, 6. 72°C for 2 minutes, and 7. 10°C hold temperature. Libraries are typically amplified between 12-14 cycles.

ライブラリーのクリーンアップ：１ウェル当たり１０ｕｌをプールし、総計３８４０ｕｌを１５ｍＬのコレクションチューブ（ＰＮ）に、ＱｉａｇｅｎＰＣＲクリーンアップカラム（ＰＮ）を通して濃縮し、５０ｕｌに溶出し、５０ｕｌのＡｍｐｕｒｅＸＰビーズを添加し、１００ｕｌの８０％ＥｔＯＨで２回洗浄し、２０ｕｌのＲＳＢに溶出し、ＢｉｏａｎａｌｙｚｅｒＤＮＡＨＳキット（ＰＮ）で定量化する。 Library cleanup: Pool 10 ul per well and place a total of 3840 ul in a 15 mL collection tube (PN), concentrate through a Qiagen PCR cleanup column (PN), elute to 50 ul, add 50 ul of Ampure XP beads, wash twice with 100 ul of 80% EtOH, elute into 20 ul of RSB, and quantify using the Bioanalyzer DNA HS kit (PN).

ゲノムＤＮＡ（genomic DNA、ｇＤＮＡ）のためのＡＡ→ＡＢ（対称対非対称）プロトコル。 AA → AB (symmetric vs. asymmetric) protocol for genomic DNA (gDNA).

Ｔｎｐアセンブリ：５ｕｌの１０×アニーリング緩衝液、５ｕｌのＳＢＳ１２－Ｕ－ＭＥ（ＭｏｓａｉｃＥｌｅｍｅｎｔ）１００ｕＭ、５ｕｌのＭＥ’１００ｕＭ、及び３５ｕｌのＨ_２Ｏを総計で５０ｕｌを添加する。サーモサイクラー上で実行する：９５℃で１分間、８０℃で３０秒間、サイクルごとに１℃ずつ２０℃まで下げて、２０℃で１時間、１０℃保持温度。 Tnp assembly: Add 50ul total: 5ul 10x annealing buffer, 5ul SBS12-U-ME (Mosaic Element) 100uM, 5ul ME' 100uM, and 35ul H ₂ O. Run on thermocycler: 95°C for 1 minute, 80°C for 30 seconds, ramp down to 20°C by 1°C per cycle, 20°C for 1 hour, 10°C hold temperature.

ＴＳＭアセンブリ：７９ｕｌのＳＤＢ緩衝液、１ｕｌのＴｎ５２００ｕＭ、及び２０ｕｌのＴｎｐを、総計１００ｕｌでＴｎｐアセンブリから添加する。３７℃で一晩インキュベートし、ＳＤＢ緩衝液中で４倍希釈して５００ｕＭのＴＳＭにする。 TSM Assembly: Add 79 ul of SDB buffer, 1 ul of Tn5 200 uM, and 20 ul of Tnp for a total of 100 ul from the Tnp assembly. Incubate overnight at 37°C and dilute 4x in SDB buffer to 500 uM TSM.

ｇＤＮＡに対するタグメンテーション：４ｕｌのｇＤＮＡ２０ｎｇ、５ｕｌの２×ＴＤ緩衝液（タグメンション緩衝液）、及びＴＳＭアセンブリから１ｕｌのＴＳＭを総計１０ｕｌで添加し、５５℃で１０分間インキュベートする。 Tagmentation of gDNA: Add 4 ul of 20 ng of gDNA, 5 ul of 2x TD buffer (tagmentation buffer), and 1 ul of TSM from the TSM assembly for a total of 10 ul and incubate at 55°C for 10 minutes.

ＡＡ→ＡＢ変換：１ｕｌの１％ＳＤＳを添加し、５５℃で１０分間インキュベートし、１ｕＭのＬＮＡ－ＭＥ＿Ａ１４オリゴと混合した２ｕｌ１０％Ｔｒｉｔｏｎ（登録商標）－Ｘ１００を加え、２ｕｌの２×ＮＰＭマスターミックス（Ｉｌｌｕｍｉｎａ）を総計１５ｕｌで添加する。サーモサイクラー上で実行する：１．７２℃で１０分間、２．９８℃で３０秒間、３．９８℃で１０秒間、４．５９℃で２０秒間、５．７２℃で１０秒間、６．工程３～５を総計１０サイクル繰り返す、７．７２℃で２分間、及び８．１０℃保持温度。 AA → AB conversion: Add 1 ul of 1% SDS and incubate at 55°C for 10 minutes. Add 2 ul of 10% Triton®-X100 mixed with 1 uM LNA-ME_A14 oligo, and add 2 ul of 2x NPM Master Mix (Illumina) for a total of 15 ul. Run on a thermocycler: 1. 72°C for 10 minutes, 2. 98°C for 30 seconds, 3. 98°C for 10 seconds, 4. 59°C for 20 seconds, 5. 72°C for 10 seconds, 6. Repeat steps 3-5 for a total of 10 cycles, 7. 72°C for 2 minutes, and 8. 10°C hold temperature.

ＰＣＲ：２５ｕＭのＳＢＳ１２を１ｕｌ、２５ｕＭのＡ１４を１ｕｌ、Ｈ_２Ｏを８ｕｌ、及び２５ｕｌの２×ＮＥＢＱ５Ｕマスターミックスを総計５０ｕｌで添加する。サーモサイクラー上で実行する：１．９８℃で３０秒間、２．９８℃で１０秒間、３．５５℃で２０秒間、４．７２℃で３０秒間、５．工程２～４を総計２０サイクル繰り返す、６．７２℃で２分間、及び７．１０℃保持温度。ライブラリーは通常、１２～１４サイクルの間で増幅される。ライブラリーは、５ｕｌのＰＣＲ生成物を１．２％のＬｏｎｚａａｇｒｏｓｅゲルにロードし、１８０ｖで１５分間、生成物を分解することによってチェックすることができる。 PCR: Add 1 ul of 25 uM SBS12, 1 ul of 25 uM A14, 8 ul of _H2O , and 25 ul of 2x NEB Q5U master mix for a total of 50 ul. Run on a thermocycler: 1. 98°C for 30 seconds, 2. 98°C for 10 seconds, 3. 55°C for 20 seconds, 4. 72°C for 30 seconds, 5. Repeat steps 2-4 for a total of 20 cycles, 6. 72°C for 2 minutes, and 7. 10°C hold temperature. Libraries are typically amplified for 12-14 cycles. Libraries can be checked by loading 5 ul of PCR product onto a 1.2% Lonza agrose gel and resolving the product at 180V for 15 minutes.

ＤＮＡ損傷のサイズの影響。 Effect of DNA damage size.

ｇＤＮＡに対するＡＡ→ＡＢアプローチの概念実証データ。ＭＥとインデックス間のＤＮＡ損傷として、異なる数のウラシルを含む３つの異なるＴＳＭ、すなわちＵ、ＵＵ、又はＵＵＵを試験した。第１の伸長を１０回繰り返した。全てのＴＳＭは、機能的であり、異なる効率ではあるがライブラリーを生成し、単一のＵが最も効率的であった（図１１）。ＡＢシステムを対照と比較し、ＳＢＳ１２－ＭＥＴＳＭを、Ａ１４－ＭＥをロードしたＴＳＭと混合した。ｑＰＣＲにより、ＡＡ→ＡＢシステムは、標準的なＡＢシステムと比較して、鋳型を約４倍増加させた。ＬＮＡ－ＭＥ濃度の滴定（本明細書ではデータは不図示）は、第２の伸長について１００ｎＭが効率的であることを示している。 Proof-of-concept data for the AA → AB approach to gDNA. Three different TSMs containing different numbers of uracils were tested as the DNA lesion between the ME and index: U, UU, or UUU. The first extension was repeated 10 times. All TSMs were functional and generated libraries with varying efficiencies, with a single U being the most efficient (Figure 11). The AB system was compared to a control, where SBS12-ME TSM was mixed with A14-ME-loaded TSM. qPCR demonstrated that the AA → AB system increased template yield by approximately 4-fold compared to the standard AB system. Titration of LNA-ME concentration (data not shown herein) indicated that 100 nM was efficient for the second extension.

アダプターを付加するための伸長に対する改変ヌクレオチドの影響。 Effect of modified nucleotides on extension to add adapters.

標準的なＡ１４－ＭＥオリゴ（プライマー中にロック核酸（ＬＮＡ）が存在しない）が、ＡＡ－＞ＡＢ変換において不十分に実施されたことをデータは示している。ＳＢＳ１２－ＭＥ及びＡ１４－ＭＥＴＳＭを含むＡＢシステムを対照として比較した（図１２）。ＬＮＡ－Ａ１４の代わりに、通常の塩基で作製されたオリゴを第２の伸長に適用した。ＰＣＲによる最終ライブラリー収率は、大幅に低減し、図１１と比較して、幅広いスメアを示す。 The data show that standard A14-ME oligos (no locked nucleic acid (LNA) in the primer) performed poorly in AA->AB conversion. An AB system containing SBS12-ME and A14-ME TSM was compared as a control (Figure 12). Instead of LNA-A14, oligos made with normal bases were applied in the second extension. The final library yield from PCR was significantly reduced and exhibited a broad smear compared to Figure 11.

ＬＮＡ－ＭＥは、第２の伸長を増強する。ＬＮＡ－ＭＥ伸長のサイクル数を増加させると、収率が向上し、１０サイクルは、ほぼ理論的最大値に達する（図１３）。ＬＮＡ修飾を有する修飾Ａ１４－ＭＥオリゴを使用したほぼ完全なライブラリー変換と比較すると、標準的なＡ１４－ＭＥオリゴ（ＬＮＡなし）との不十分なライブラリー生成の差は、驚くべきかつ予想外であり、重大な利点であった。加えて、ＡＡ→ＡＢ及びＡＢシステム間の２倍の収率差異は、ほぼ完全な最大変換が得られたことを示している。 LNA-ME enhances the second extension. Increasing the number of cycles of LNA-ME extension improves yield, reaching nearly the theoretical maximum at 10 cycles (Figure 13). Compared to the nearly complete library conversion using modified A14-ME oligos with LNA modifications, the difference in insufficient library generation with standard A14-ME oligos (no LNA) was surprising and unexpected, a significant advantage. Additionally, the two-fold yield difference between the AA→AB and AB systems indicates that nearly complete maximum conversion was achieved.

アニーリング温度の影響。 Effect of annealing temperature.

核ＡＴＡＣバルクアッセイにおけるＬＮＡ－ＭＥアニーリング温度滴定。ＳＢＳ１２及びＡ１４ＭＥを含むＡＢシステムを対照として使用した。同じ数の核内のゲノムＤＮＡをＴＳＭによって転置した。第２の伸長のＡＡ→ＡＢワークフローは、異なるアニーリング温度で行った。約５９．５℃では、最適な効率を示し、ＡＢ対照と比較して、ｑＰＣＲに従って、増幅可能な鋳型を約５倍に増強することができる（図１４）。 LNA-ME annealing temperature titration in the nuclear ATAC bulk assay. An AB system containing SBS12 and A14ME was used as a control. Genomic DNA in the same number of nuclei was transposed by TSM. The second extension AA→AB workflow was performed at different annealing temperatures. Approximately 59.5°C showed optimal efficiency, enhancing amplifiable template by approximately 5-fold compared to the AB control according to qPCR (Figure 14).

実施例２
単一細胞コンビナトリアルインデックス付けの改善
単一細胞オミックスの主要な課題は、各細胞についてのゲノム特性のシークエンシングライブラリーへの効率的な変換である。本明細書では、複数のアッセイに一般化可能であり、カスタムシークエンシング化学を必要としない単一細胞コンビナトリアルインデックス付けワークフロー（ｓｃｉ）のアダプター切り替え戦略について説明する。この技術では、対称鎖ｓｃｉ（ｓ３）は、クロマチンアクセシビリティ（ｓ３－ＡＴＡＣ）、全ゲノムシークエンシング（ｓ３－ＷＧＳ）、及びゲノムとクロマチンの立体配座（ｓ３－ＧＣＣ）を含む様々な特性について、細胞ごとに得られる読み取りにおいて１～２桁の改善を提供する。 Example 2
Improved Single-Cell Combinatorial Indexing A major challenge in single-cell omics is the efficient conversion of genomic features for each cell into a sequencing library. Herein, we describe an adapter-switching strategy for a single-cell combinatorial indexing workflow (sci) that is generalizable to multiple assays and does not require custom sequencing chemistry. In this technology, symmetric stranded sci (s3) provides one to two orders of magnitude improvement in the reads obtained per cell for a variety of features, including chromatin accessibility (s3-ATAC), whole-genome sequencing (s3-WGS), and genome and chromatin conformation (s3-GCC).

主な
単一細胞ゲノミクスアッセイは、ライフサイエンス分野全範囲の複雑な生物学的システムを問い合わせるための有力なプラットフォームに急速になっている。単一細胞レベルで様々な特性を捕捉するためのプラットフォームは、典型的に、細胞スループットと細胞ごとに取得され得る情報の深度との間のトレードオフを被る。本発明者らは、ハイスループットで様々なゲノム特性を評価するためのトランスポザーゼベースのライブラリー構築^２を活用する単一細胞コンビナトリアルインデックス付け（ｓｃｉ）^１を利用するワークフローを説明した。転位反応自体（タグメンテーション）は、非常に効率的であるが、順方向又は逆方向の一次配列の形態で異なるアダプターが分子の各末端に組み込まれている場合のみ、実行可能なシークエンシングライブラリー分子が生成される。タグメンテーション反応の間、２つの配列の各々を組み込む可能性が等しいため、分子の半分が順方向－順方向又は逆方向－逆方向のアダプター組み合わせになり、理論的収率が５０％に低減する。この効率に対抗するために、アダプター種のより大きな相補体の使用^３、ＲＮＡ中間体を通過するためのＴ７プロモーター配列の組み込み^４～６、又は標的化^７若しくはランダムプライミング^８を使用する第２のアダプターの組み込みを含むいくつかの戦略が開発されてきた。本明細書では、アダプター交換を利用して、上下両鎖の順方向及び逆方向アダプターの両方でタグ付けされたライブラリー分子を生成する代替戦略を提示する。更に、この形式により、トランスポザーゼアダプター複合体内に埋め込まれたＤＮＡインデックス配列の使用が可能になり、１回目は転位段階で、２回目はＰＣＲ段階での２回のインデクシングが実行される単一細胞コンビナトリアルインデックス付け（ｓｃｉ）適用が可能になる^{１，９，１０}。 Single-cell genomics assays are rapidly becoming a powerful platform for interrogating complex biological systems across the life sciences. Platforms for capturing various properties at the single-cell level typically suffer from a trade-off between cellular throughput and the depth of information that can be obtained per cell. We have described a workflow using single-cell combinatorial indexing (SCI) ¹ ^that leverages transposase-based library construction2 to assess various genomic properties in a high-throughput manner. The transposition reaction itself (tagmentation) is highly efficient, but viable sequencing library molecules are generated only if different adapters, in the form of forward or reverse primary sequences, are incorporated at each end of the molecule. During the tagmentation reaction, there is an equal probability of incorporating each of the two sequences, resulting in half of the molecules having forward-forward or reverse-reverse adapter combinations, reducing the theoretical yield to 50%. To combat this efficiency, several strategies have been developed, including the use of larger complements of adapter species, ³ the incorporation of T7 promoter sequences to pass RNA intermediates, ^4-6 or the incorporation of a second adapter using ^targeted7 or random ^priming.8 Herein, we present an alternative strategy that utilizes adapter exchange to generate library molecules tagged with both forward and reverse adapters on both the top and bottom strands. Furthermore, this format allows for the use of DNA index sequences embedded within the transposase-adapter complex, enabling single-cell combinatorial indexing (sci) applications in which two rounds of indexing are performed: once at the transposition step and a second time at the PCR ^step.1,9,10

この技術の対称ストランドｓｃｉ（ｓ３）は、ユニバーサルモザイク末端配列及び区画特異的ＤＮＡバーコードに加えて、順方向プライマー配列を組み込むための単一アダプター転位の効率を活用する。アダプターは、タグメンテーション反応中に共有結合で組み込まれる、得られる生成物の上鎖上のトランスポザーゼ認識配列（モザイク末端）の直後にウラシル塩基が存在するように設計されている。ウラシル不耐性酵素を用いたポリメラーゼ伸長は、ＤＮＡバーコード又は順方向プライマー配列への伸長なしに、下鎖上のモザイク末端配列のコピーをもたらす。ウラシル耐性ポリメラーゼとともに逆方向プライマー配列を含有するモザイク末端ロック核酸（ＬＮＡ）鋳型のその後の変性及び付加により、ライブラリー分子の伸長が追加の配列に組み込まれることが可能になる。最大効率を確保するために、鋳型オリゴヌクレオチドを伸長からブロックして、プライマーとしてのその作用を防止し、線形伸長反応を複数回実行することが可能になる（図７）。ｓ３プラットフォームの更なる利点は、アダプター配列が、ｓｃｉ技術に必要なカスタムワークフロー及びプライマーの代わりに、標準的なシークエンシングレシピが使用され得るように設計されていることである。１細胞当たりの通過読み取りの中央値が１６倍改善された単一細胞クロマチンアクセシビリティライブラリー（ｓ３－ＡＴＡＣ）、以前のＳＣＩ－ｓｅｑ／ｓｃｉ－ＤＮＡ－ｓｅｑ^１１に比べて１２６倍改善された単一細胞全ゲノムシークエンシング（ｓ３－ＷＧＳ）、及び以前のコンビナトリアルインデックスＨｉ－Ｃ法^１２よりも高い割合のクロマチン相互作用シグナルにより単一細胞におけるゲノム配列及びクロマチン立体配座情報（ｓ３－ＧＣＣ）の両方を捕捉する新しい技術を生成するためのこのワークフローを実証する。 This technology, symmetric-strand Sci (S3), leverages the efficiency of single-adapter transposition to incorporate a forward primer sequence in addition to a universal mosaic end sequence and a compartment-specific DNA barcode. The adapter is designed so that a uracil base is present immediately after the transposase recognition sequence (mosaic end) on the top strand of the resulting product, which is covalently incorporated during the tagmentation reaction. Polymerase extension using a uracil-intolerant enzyme results in a copy of the mosaic end sequence on the bottom strand without extension to the DNA barcode or forward primer sequence. Subsequent denaturation and addition of a mosaic end-locked nucleic acid (LNA) template containing the reverse primer sequence with a uracil-resistant polymerase allows extension of the library molecule to incorporate additional sequences. To ensure maximum efficiency, the template oligonucleotide is blocked from extension to prevent its action as a primer, allowing multiple rounds of linear extension reactions to be performed (Figure 7). An additional advantage of the S3 platform is that the adapter sequence is designed so that standard sequencing recipes can be used instead of the custom workflow and primers required for Sci technology. We demonstrate this workflow for generating single-cell chromatin accessibility libraries (s3-ATAC) with a 16-fold improvement in median number of passage reads per cell, single-cell whole genome sequencing (s3-WGS) with a 126-fold improvement over previous SCI-seq/sci-DNA- ^seq ¹¹ , and novel techniques that capture both genomic sequence and chromatin conformation information in single cells (s3-GCC) with a higher proportion of chromatin interaction signals than previous combinatorial index Hi-C methods 12 .

本技術の新規成分の前に、核の前処理が最小限であるため、クロマチンアクセシビリティを評価するｓ３技術を確立することを最初に求めた。ｓ３－ＡＴＡＣにおいて核を単離し、次いで従来のｓｃｉ－ＡＴＡＣ－ｓｅｑにおけるようにタグメント化するが、代わりに、シングルエンドインデックス付きトランスポソームを使用し、次いでアダプタースイッチングｓ３のワークフローを介して実行する（図７）。他の核からのゲノム混入がなく、かつバーコード衝突が最小限の真の単一細胞ライブラリーを達成することを確実にするために、一次凍結ヒト皮質組織及び凍結マウス全脳組織に対して「バーンヤード」試験としても知られている混合種実験を行った図１５。クロスセル混入の割合をより正確に捕捉するために、理想化された細胞株設定ではなく、一次組織試料に対してこの試験を実行することを選択した。本発明者らは更に、タグメンテーション段階及びＰＣＲ段階でタグメンテーション前及び後の２つの試料からの核を混合することによって、導入の可能性のある両点でのクロストークのレベルを評価するように実験を設計した。更に、単一細胞コンビナトリアルインデックス付けワークフローの固有の試料多重化能力を活用することにより、純種ライブラリーを生成した。総計で、３０８８６及び２６，５３０の固有の中央値、それぞれヒト及びマウスの１細胞当たり、染色体第１～第２２、第２３（ヒト）、Ｘ及びＹに整列させた高マッピング品質の読み取り（以下、「通過読み取り」と呼ぶ）を有する、１，３６６のヒト及び１０５４のマウスの単一細胞ＡＴＡＣ－ｓｅｑプロファイルを生成した。特に、ライブラリーは非常に複雑であり、一意として細胞に割り当てられた読み取りの中央値が６９．０５％であり、追加の配列深度が現在シークエンシングされている深度のカバレッジを超えて得られ、それを大幅に増加させることを示している。更なるシークエンシング時に予測推定値が経験的データの２％以内に収まる^９、１細胞当たりの固有の読み取りを投影するための確立された方法を使用して、本発明者らは、我々のライブラリーが、それぞれヒト皮質及びマウス全脳試料について９５％ライブラリー飽和で１細胞当たり１２８，１４４及び１７４，８５８の通過読み取りの中央値に達することを見出した。次に、我々のマウス脳試料の現在の深度並びに投影を、同等の組織並びに可能な場合はそれらのライブラリーの投影の公開されているデータセットと比較した。我々のライブラリーは、任意の他のライブラリー又は自己報告ライブラリー投影よりも桁違いに改善していることを見出した図１６。 Prior to the novel components of this technology, we first sought to establish an s3 technique for assessing chromatin accessibility, as it requires minimal nuclear preprocessing. In s3-ATAC, nuclei are isolated and then tagmented as in traditional sci-ATAC-seq, but instead, single-end indexed transposomes are used and then run through the adapter-switching s3 workflow (Figure 7). To ensure true single-cell libraries were achieved without genomic contamination from other nuclei and with minimal barcode collisions, we performed mixed-species experiments, also known as "barnyard" experiments, on primary frozen human cortical tissue and frozen whole mouse brain tissue (Figure 15). To more accurately capture the rate of cross-cell contamination, we chose to perform this test on primary tissue samples rather than an idealized cell line setting. We further designed the experiment to assess the level of crosstalk at both points of potential introduction by mixing nuclei from two samples, pre- and post-tagmentation, at the tagmentation and PCR stages. Furthermore, we generated pure-species libraries by leveraging the inherent sample multiplexing capabilities of the single-cell combinatorial indexing workflow. In total, we generated 1,366 human and 1,054 mouse single-cell ATAC-seq profiles with a median of 30,886 and 26,530 unique, high-mapping-quality reads (hereafter referred to as "pass-through reads") aligned to chromosomes 1-22, 23 (human), X, and Y per cell in human and mouse, respectively. Notably, the libraries were highly complex, with a median of 69.05% of reads assigned to cells as unique, indicating that additional sequencing depth can be obtained beyond and significantly increase the coverage of the currently sequenced depth. Using established methods for projecting unique reads per cell, we found that our libraries reached a median of ^128,144 and 174,858 pass-through reads per cell at 95% library saturation for human cortex and mouse whole-brain samples, respectively. We then compared the current depth and projections of our mouse brain samples to publicly available datasets of comparable tissues and, where available, their library projections, and found that our library represents an order of magnitude improvement over any other library or self-reported library projection (Fig. 16).

本発明者らは、改善がインデックスの重複又はゲノムクロストークによるものではないことを確認するために、ヒトマウス組み合わせ参照ゲノムに対応する一意の読み取り数を評価することによって、試料の純度を実証した。任意の処理の前、すなわち、タグメンテーション前、核を混合した実験条件下では、５．１２％の衝突割合が観察され（図１７、２×２．５６％の検出されたヒト－マウスの衝突）、十分に許容レベル内である。予想されるように、タグメンテーション後の実験条件下でゼロ衝突が観察されたが、タグメンテーション前の実験で観察された衝突は、クロストーク又は周囲クロマチンとは対照的に二重サンプリングに起因することを示唆している。また、本発明者らは、それぞれ、２．７７及び３．９３の中央値で転写開始点（transcription start site、ＴＳＳ）濃縮を評価し、ヒト及びマウス試料についてはそれぞれ、１４．６０％及び１９．４０％でピークと呼ばれる読み取りの割合（fraction of reads in called peak、ＦＲＩＰ）を評価することによって、読み取りの増加が、過剰なバックグラウンドに起因しない生物学的シグナルを実際に捕捉しており、両測定基準が一致する組織型に関して他のプラットフォームに匹敵することを確認した。次に、本発明者らは、十分なシグナルを使用して、試料内に存在する細胞型を識別することを求めた。各種について、集計データ上の呼び出されたピークを使用して、カウントマトリックスを構築し、続いてトピックモデリングツールのｃｉｓＴｏｐｉｃ^１３を使用した次元削減を構築し、次いで、ＵＭＡＰ^１４を使用して可視化し、最終的に、トピックレベルでグラフベースのクラスタリングを行った。本発明者らは、ヒト皮質及びマウス全脳の両試料について各クラスター内の細胞型特異的遺伝子における明確なシグナルとともに、視覚化空間並びに同定されたクラスターの両方における細胞型の明確な分離を見出した（図１８～図２１）。 To confirm that the improvement was not due to index duplication or genomic crosstalk, we verified the purity of the samples by assessing the number of unique reads corresponding to the combined human-mouse reference genome. Before any treatment, i.e., before tagmentation, under experimental conditions in which nuclei were mixed, a collision rate of 5.12% was observed (Figure 17; 2 × 2.56% of detected human-mouse collisions), well within acceptable levels. As expected, zero collisions were observed under experimental conditions after tagmentation, suggesting that the collisions observed in the pre-tagmentation experiments were due to double sampling as opposed to crosstalk or surrounding chromatin. We also confirmed that the increase in reads indeed captured biological signal and not due to excessive background by assessing transcription start site (TSS) enrichment at median values of 2.77 and 3.93, respectively, and fraction of reads in called peak (FRIP) at 14.60% and 19.40% for human and mouse samples, respectively, with both metrics comparable to other platforms for matched tissue types. Next, we sought to use sufficient signal to identify the cell types present within the samples. For each species, we used the called peaks on the aggregated data to construct a count matrix, followed by dimensionality reduction using the topic modeling tool ^cisTopic13 , visualization using ^UMAP14 , and finally graph-based clustering at the topic level. We found clear separation of cell types in both the visualization space and the identified clusters, with distinct signals in cell type-specific genes within each cluster for both human cortex and mouse whole brain samples (Figures 18-21).

次に、ｓ３－ＡＴＡＣによって生成されたデータ品質の改善は、ハイスループットローパス単細胞ゲノムシークエンシング^９のために以前に報告された我々のｓｃｉ－ＤＮＡ－ｓｅｑ法を含む、他の単一細胞コンビナトリアルインデックス付けワークフローに翻訳可能であるべきであると推論した。ｓ３ワークフロー（図７）を使用することに加えて、均一なカバレッジを得るために使用される技術のヌクレオソーム枯渇成分に対する他の改善も調査した。最初に、対照リンパ芽球細胞株（ＧＭ１２８７８）上でｓ３－ＷＧＳ（図６）を展開し、最適化されたバージョンの界面活性剤ベースのヌクレオソーム枯渇（×ＳＤＳ）が、最高の均一性及び読み取り数を提供し、その状態における９２の細胞が、１細胞当たりの通過読み取りが３７．１２％のゲノム捕捉率の中央値に変換される、６，５８４，６０２の中央値を示していることを明らかにした。本発明者らはまた、０．１～０．３（中央値０．１８）内に収まる絶対偏差の中央値（ＭＡＤ）を評価することによって、カバレッジが均一であり、他の単一細胞ゲノムシークエンシング技術に匹敵することを確認した。この最適化されたプロトコルを使用して、次に、ｓ３－ＷＧＳを展開し、最小限の継代数後に原発性膵管腺がん（pancreatic ductal adenocarcinoma、ＰＤＡＣ）腫瘍に由来する２つの細胞株のシークエンシングを行った。 We next reasoned that the improvements in data quality generated by s3-ATAC should be ^translatable to other single-cell combinatorial indexing workflows, including our previously reported sci-DNA-seq method for high-throughput, low-pass single-cell genome sequencing. In addition to using the s3 workflow (Figure 7), we also explored other improvements to the nucleosome depletion component of the technique used to obtain uniform coverage. We first deployed s3-WGS (Figure 6) on a control lymphoblastoid cell line (GM12878) and found that an optimized version of detergent-based nucleosome depletion (×SDS) provided the highest uniformity and read count, with 92 cells in that condition exhibiting a median of 6,584,602 pass-through reads per cell, translating to a median genome capture rate of 37.12%. We also confirmed that coverage was uniform and comparable to other single-cell genome sequencing technologies by assessing the median absolute deviation (MAD), which fell within 0.1-0.3 (median 0.18). Using this optimized protocol, we next deployed s3-WGS to sequence two cell lines derived from primary pancreatic ductal adenocarcinoma (PDAC) tumors after a minimal number of passages.

ＰＤＡＣは、典型的には進行した段階で現れるがんの壊滅的な形態であり、早期検出及び検査が腫瘍進行の鍵となる。ＰＤＡＣ検査は、生検試料中のがん細胞画分が少ないことに悩まされるために、精製された腫瘍からの低継代由来の連続再生細胞株（continuously-regenerating cell line、ＣＲＣ）を使用した。この方法は、核型分析によって証明されるように、腫瘍試料に存在する不均一性の大部分を維持しながら、特性評価及び摂動の複数のモダリティを可能にする^１５。がん遺伝子ＫＲＡＳの２つの異なるサブクローンミスセンス変異（ｐ．Ｇ１２Ｄ及びｐ．Ｇ１２Ｃ）と、Ｇバンディングベース及びスペクトル核型分析によって測定される重大なゲノム不安定性と、を有する２つの系統（ＰＤＡＣ－１及びＰＤＡＣ－２と呼ばれる）を標的とした。これらの系統については、それぞれ、ＰＤＡＣ－１及びＰＤＡＣ－２について、２，０９６，２０７及び１，４４５，３８１の予測通過読み取り数の中央値を示した７０９及び２６７の単一細胞ライブラリーを得た（図２２～図２４）。初期のＧＭ１２８７８対照試料よりも低いが、それは、従来の方法によって得られたカバレッジを大幅に超える。２つの系統のＭＡＤスコア（図２４）は、０．２８及び０．３２の中央値を有するＧＭ１２８７８の比較的正常な核型のものよりも高かったが、しかしながら、これは、試料に存在する広範なコピー数変化を考慮すると、予想されることである。本発明者らは、ＰＤＡＣ－１原発性腫瘍、正常血液、及びＣＲＣ株からの一対の全エクソームシークエンシング及びコピー数呼び出しでこの期待値を検証し、特徴的なゲノム不安定性の強力な証拠を明らかにした。次に、単一細胞コピー数プロファイリングを行い、２つの系統の各々内の高度に変化したゲノムランドスケープを特定した。限定された核型分析データ及び全エクソームデータに従って、マルチメガベースサイズのコピー数異常の細胞ごとの同様のパターンを確認する。ＧＭ１２８７８、及び２つのＰＤＡＣ系統の３つの試料について、ゲノムウィンドウ内の推測コピー数プロファイルを使用して、階層的及びＫ平均クラスタリングを行い、複数のクローンゲノム配置を明らかにした。 PDAC is a devastating form of cancer that typically presents at an advanced stage, making early detection and testing key to tumor progression. Because PDAC testing suffers from low cancer cell fractions in biopsy samples, we used continuously regenerating cell lines (CRCs) derived at low passages from purified tumors. This method allows for multiple modalities of characterization and perturbation while preserving much of the heterogeneity present in tumor samples, as evidenced by karyotyping. ¹⁵ We targeted two lines (termed PDAC-1 and PDAC-2) harboring two distinct subclonal missense mutations (p.G12D and p.G12C) in the oncogene KRAS and significant genomic instability as measured by G-banding-based and spectral karyotyping. For these lines, we obtained 709 and 267 single-cell libraries with median expected pass-through reads of 2,096,207 and 1,445,381 for PDAC-1 and PDAC-2, respectively (Figures 22-24). While lower than the initial GM12878 control sample, it significantly exceeds the coverage achieved by conventional methods. The MAD scores for the two lines (Figure 24) were higher than those for the relatively normal karyotype of GM12878, with median scores of 0.28 and 0.32, respectively; however, this is expected given the extensive copy number alterations present in the samples. We validated this expectation with paired whole-exome sequencing and copy number calling from PDAC-1 primary tumors, normal blood, and CRC lines, revealing strong evidence of characteristic genomic instability. Next, we performed single-cell copy number profiling to identify highly altered genomic landscapes within each of the two lines. According to limited karyotyping and whole-exome data, we confirm similar cell-by-cell patterns of multimegabase copy number aberrations. Hierarchical and K-means clustering was performed using inferred copy number profiles within genomic windows for GM12878 and three samples from two PDAC lines, revealing multiple clonal genomic arrangements.

単一細胞分解能を考慮して、既知のＰＤＡＣ関連がん遺伝子及び腫瘍抑制因子コピー数異常の発生を評価することができた。患者間の違いの例として、ＰＤＡＣ－２試料のみで占められているクラスター７は、ＴＧＦβＲ２及びＰＢＲＭ１を含むゲノム領域の特有の増幅を示し、領域は、細胞増殖に関連し、以前は、ＰＤＡＣ患者における高いがん細胞腫瘍画分と関連していた。ＰＤＡＣ－１試料は、がん遺伝子ＭＹＣ（絶対コピー数２．２６±２．３６）を含むゲノム領域の不均一な増幅を明らかにする。更に、ＰＤＡＣ症例の９０％超で発生することが知られているがん遺伝子ＫＲＡＳと重複するゲノム範囲の限局的増幅を明らかにした。クラスター１は、ＫＲＡＳ増幅による細胞数が最も少ない（２３．３％、４０／１７２細胞）が、クラスター５は、ＫＲＡＳコピー数増加の頻度が最も高い（８２．６％、１３８／１６７細胞）ことが判明した。本発明者らは、ジェノタイピング及びデジタル液滴ＰＣＲに全エクソームデータを利用することによって、この不均一なコピー数異常を検証し、ＰＤＡＣ－１ＣＲＣ株からサンプリングしたＫＲＡＳ対立遺伝子の５３％が過剰発現に関連する変異ＫＲＡＳ対立遺伝子を示していることを見出した。 Given the single-cell resolution, we were able to assess the occurrence of copy number aberrations in known PDAC-associated oncogenes and tumor suppressors. As an example of interpatient differences, cluster 7, occupied exclusively by PDAC-2 samples, showed unique amplification of genomic regions including TGFβR2 and PBRM1, regions associated with cell proliferation and previously associated with a high fraction of cancer cells in PDAC patients. PDAC-1 samples revealed heterogeneous amplification of a genomic region including the oncogene MYC (absolute copy number 2.26 ± 2.36). Furthermore, we found focal amplification of a genomic region overlapping with the oncogene KRAS, known to occur in >90% of PDAC cases. While cluster 1 contained the fewest cells with KRAS amplification (23.3%, 40/172 cells), cluster 5 was found to have the highest frequency of KRAS copy number gain (82.6%, 138/167 cells). We validated this heterogeneous copy number abnormality by utilizing whole-exome data for genotyping and digital droplet PCR and found that 53% of KRAS alleles sampled from PDAC-1 CRC lines displayed mutant KRAS alleles associated with overexpression.

重複及び欠失は、がん細胞増殖において競合的な利点を誘導し得るゲノム再編成の唯一の形態ではない。ゲノム反転は、標準的な核型分析及び染色体ペインティング法を介して評価することが困難であるが、ブレークポイントを捕捉する読み取りのみが裏付けになる証拠を提供するため、染色体転座は、全ゲノム増幅法で明らかにすることが困難である。これらの制限の両方に対処するために、本発明者らは、追加の前処理ワークフローとともにｓ３－ＷＧＳ技術を利用し、固定及びヌクレオソーム枯渇後に制限消化し、次いで（ＨｉＣ法のように、だがビオチン化塩基を組み込むことなく）、ｓ３ライブラリー調製に続いて再ライゲーションした。この追加の処理により、遠位クロマチン接触点を示すキメラライゲーション結合部にわたる読み取りの一部がもたらされ、残りの読み取りが、全ゲノムシークエンシングデータとして機能し、ゲノム及びクロマチン立体配座の両方を可能にする（ｓ３－ＧＣＣ）ことが推論された（図９）。ｓ３－ＷＧＳ実験におけるように同じ２つのＰＤＡＣ細胞株でｓ３－ＧＣＣを実行し（図２５～図２８）、ＰＤＡＣ－１及びＰＤＡＣ－２について、１細胞当たりの予測通過読み取り中央値がそれぞれ、１，０３４，０１４及び１，２４５，２６６で同等である、２２及び９３の細胞プロファイルを生成した。次いで、コピー数呼び出しを実行し、結果をｓ３－ＷＧＳライブラリーと比較して、細胞株群内のもう片方内に散在する各方法のプロファイルと同様のパターンを明らかにした。クロマチン立体配座シグナルの初期測定値を得るために、染色体間読み取り対の割合を評価し、両方のｓ３－ＧＣＣ調製物が、それらのｓ３－ＷＧＳ対応物よりも６８．９１及び５８．９１倍の増加で過剰を含有していた。次いで、１ｋｂｐを超えるインサートサイズを有する読み取りの割合を測定し、各系統についてはそれぞれ、１５．６％及び１７．０％の中央値、平均１６％であり、ｓ３－ＷＧＳに対して３６１倍及び４０２倍の倍数濃縮で、ここでも中央値が同等であった。１細胞当たりの予測される固有の総染色体接触点を評価するために、最初に、クロマチン接触の読み取り数投影が、標準的なゲノムシークエンシング読み取りを表すデータのバルクと同じように実行されると仮定し、総通過読み取り数のパーセンテージをとることができた。これにより、ＰＤＡＣ－１については２０，４５１及びＰＤＡＣ－２については２０，６１１の１細胞当たりの接触点の予測中央値が生成された。更に、本発明者らは、クロマチン接触を表す読み取りの部分に特異的に読み取り数投影を行い、２４４，７２８及び２４５，５６０の同様の値を得た。次いで、本発明者らは、比較的浅いシークエンシング深度から得られた接触点を使用し、異なるトポロジーパターンを示す凝集体プロファイルによりクロマチン接触マップを生成する能力を実証した。本発明者らは、ｓｃＨｉＣｌｕｓｔｅｒを介してそれらの遠位接触の情報によって単一細胞を分離し、３つの異なるクラスターを観察した。特に、この低いシークエンシング深度でさえ、本発明者らは、細胞株の低密度の接触プロファイルを確実に見分けることができる。サンプリングされた細胞にわたる固有の転座及び反転イベントを評価するために、クラスター０（ＰＤＡＣ－１のみが占有している）及び１の集約された接触マップ間の差異を調べた。本発明者らは、単一細胞接触データが、ＰＤＡＣ－１試料について特に明らかになっていないｔ（３；１４）（ｑ２４－２６；ｑ２１－２４）転座の例において、スペクトル核型分析（ＳＫＹ）データから報告された染色体腕スケールの転座を複製することを見出した。本発明者らはまた、ｓ３－ＷＧＳデータに見られる第３染色体のＴＧＦβＲ２及びＰＢＲＭ１領域間の染色体間の接触頻度が高まり、第２及び第４染色体に向かって、コピー数増加の異常なゲノム区画化を示唆している。 Duplications and deletions are not the only forms of genomic rearrangements that can induce a competitive advantage in cancer cell proliferation. Genomic inversions are difficult to assess via standard karyotyping and chromosome painting methods, while chromosomal translocations are difficult to reveal with whole-genome amplification methods because only reads capturing the breakpoints provide supporting evidence. To address both of these limitations, we utilized the s3-WGS technique with an additional preprocessing workflow: restriction digestion after fixation and nucleosome depletion, followed by religation following s3 library preparation (as in the HiC method, but without incorporating biotinylated bases). We reasoned that this additional processing would yield a portion of the reads spanning chimeric ligation junctions, representing distal chromatin contact points, while the remaining reads would serve as whole-genome sequencing data, enabling both genomic and chromatin conformation analysis (s3-GCC) (Figure 9). s3-GCC was performed on the same two PDAC cell lines as in the s3-WGS experiments (Figures 25-28), generating 22 and 93 cell profiles with comparable median predicted reads per cell of 1,034,014 and 1,245,266 for PDAC-1 and PDAC-2, respectively. Copy number calling was then performed, and the results were compared to the s3-WGS library, revealing similar patterns of interspersed profiles from each method within the other within the cell line panel. To obtain an initial measure of chromatin conformation signal, the proportion of interchromosomal read pairs was assessed, and both s3-GCC preparations contained an excess over their s3-WGS counterparts, with increases of 68.91 and 58.91 fold. We then measured the percentage of reads with insert sizes greater than 1 kbp, with a median of 15.6% and 17.0%, respectively, and a mean of 16%, for each lineage, representing 361-fold and 402-fold enrichments over s3-WGS, again with comparable medians. To assess the predicted total unique chromosome contacts per cell, we first assumed that read count projections of chromatin contacts were performed identically to the bulk of the data, representing standard genome sequencing reads, and took the percentage of the total pass-through reads. This produced predicted median contacts per cell of 20,451 for PDAC-1 and 20,611 for PDAC-2. We further performed read count projections specifically on the portion of reads representing chromatin contacts, yielding similar values of 244,728 and 245,560. We then demonstrated the ability to generate chromatin contact maps using contacts obtained from relatively shallow sequencing depths with aggregate profiles exhibiting distinct topological patterns. We separated single cells by their distal contact information via scHiCluster and observed three distinct clusters. Notably, even at this low sequencing depth, we could reliably distinguish the sparse contact profiles of cell lines. To assess unique translocation and inversion events across sampled cells, we examined differences between the aggregated contact maps of clusters 0 (occupied only by PDAC-1) and 1. We found that single-cell contact data replicated the chromosome arm-scale translocations reported from spectral karyotyping (SKY) data, in the case of the t(3;14)(q24-26;q21-24) translocation, which was not specifically evident in the PDAC-1 sample. We also observed an increased frequency of interchromosomal contacts between the TGFβR2 and PBRM1 regions of chromosome 3 seen in the s3-WGS data, suggesting aberrant genomic compartmentalization of copy number gains toward chromosomes 2 and 4.

まとめると、ｓ３ワークフローは、ｓ３－ＡＴＡＣの場合、シグナル濃縮、又はｓ３－ＷＧＳのカバレッジ均一性を犠牲にすることなく、細胞ごとに得られた通過読み取り関して、従来のｓｃｉプラットフォームよりも著しい改善を表す。また、コンビナトリアルインデックス付けワークフローの別の異形、すなわちｓ３－ＧＣＣを導入して、ゲノムシークエンシング及びクロマチン立体配座の両方を取得し、ｓｃｉ－ＨｉＣと比較すると、細胞ごとに得られたクロマチン接点が改善される。本発明者らは、劇的なクロマチン不安定性を有する２つの患者由来の腫瘍細胞株を評価することによって、これらのアプローチの有用性を実証する。本発明者らは、疾患関連遺伝子の焦点増幅のパターンを明らかにし、標準的な核型分析で達成可能ではないスループットで広範囲の不均一性を明らかにする。更に、本発明者らは、クロマチン区画を破壊するコピー数異常の影響を明らかにするためのプロトコルの共同分析を強調する。更に、ｓ３ワークフローは、標準的な単一細胞コンビナトリアルインデックス付けの同じ固有のスループット可能性を有する。このプラットフォームは、ｓｃｉ－ＭＥＴを含む他のトランスポザーゼに基づく技術と互換性があることも期待される^１０。ｓ３プラットフォームの１つの可能な欠点は、一連の８の順方向及び１２の逆方向複合体（９６ウェルプレートの行及び列に対応する）を使用することとは対照的に、固有のトランスポソーム複合体のフルセットを使用する必要があり、ワークフローに必要なオリゴの数を大きくすることである。しかしながら、実験ごとに必要なオリゴが比例して少なくなるため、これらのコストは最終的には収支が合う。最後に、ｓｃｉワークフローとは異なり、ｓ３プラットフォームは、カスタムシークエンシングプライマー又はカスタムシークエンシングレシピを必要とせず、これらの技術を実行しながらラボが直面し得る主要なハードルのうちの１つを取り除く。 In summary, the s3 workflow represents a significant improvement over conventional sci platforms in terms of the number of pass-through reads obtained per cell, without sacrificing signal enrichment in the case of s3-ATAC or coverage uniformity in the case of s3-WGS. We also introduce another variant of the combinatorial indexing workflow, s3-GCC, to capture both genome sequencing and chromatin conformation, improving the number of chromatin contacts obtained per cell compared to sci-HiC. We demonstrate the utility of these approaches by evaluating two patient-derived tumor cell lines with dramatic chromatin instability. We uncover patterns of focal amplification of disease-associated genes, revealing widespread heterogeneity at a throughput not achievable with standard karyotyping. Furthermore, we highlight the collaborative analysis of protocols to reveal the effects of copy number aberrations that disrupt chromatin compartments. Furthermore, the s3 workflow possesses the same inherent throughput potential of standard single-cell combinatorial indexing. This platform is also expected to be compatible with other transposase-based technologies, including sci-MET. ¹⁰ One possible drawback of the s3 platform is the need to use a full set of unique transposome complexes, as opposed to using a series of 8 forward and 12 reverse complexes (corresponding to the rows and columns of a 96-well plate), increasing the number of oligos required for the workflow. However, these costs ultimately pay off as proportionally fewer oligos are required per experiment. Finally, unlike the sci-MET workflow, the s3 platform does not require custom sequencing primers or custom sequencing recipes, eliminating one of the major hurdles labs can face when implementing these technologies.

実施例２の引用
１．Ｃｕｓａｎｏｖｉｃｈ，Ｄ．Ａ．ｅｔａｌ．Ｍｕｌｔｉｐｌｅｘｓｉｎｇｌｅ－ｃｅｌｌｐｒｏｆｉｌｉｎｇｏｆｃｈｒｏｍａｔｉｎａｃｃｅｓｓｉｂｉｌｉｔｙｂｙｃｏｍｂｉｎａｔｏｒｉａｌｃｅｌｌｕｌａｒｉｎｄｅｘｉｎｇ．Ｓｃｉｅｎｃｅ（８０－．）。３４８，９１０－９１４（２０１５）。
２．Ａｄｅｙ，Ａ．ｅｔａｌ．Ｒａｐｉｄ，ｌｏｗ－ｉｎｐｕｔ，ｌｏｗ－ｂｉａｓｃｏｎｓｔｒｕｃｔｉｏｎｏｆｓｈｏｔｇｕｎｆｒａｇｍｅｎｔｌｉｂｒａｒｉｅｓｂｙｈｉｇｈ－ｄｅｎｓｉｔｙｉｎｖｉｔｒｏｔｒａｎｓｐｏｓｉｔｉｏｎ．ＧｅｎｏｍｅＢｉｏｌ．１１，Ｒ１１９（２０１０）。
３．Ｔａｎ，Ｌ．，Ｘｉｎｇ，Ｄ．，Ｃｈａｎｇ，Ｃ．Ｈ．，Ｌｉ，Ｈ．＆Ｘｉｅ，Ｘ．Ｓ．Ｔｈｒｅｅ－ｄｉｍｅｎｓｉｏｎａｌｇｅｎｏｍｅｓｔｒｕｃｔｕｒｅｓｏｆｓｉｎｇｌｅｄｉｐｌｏｉｄｈｕｍａｎｃｅｌｌｓ．Ｓｃｉｅｎｃｅ（８０－．）。３６１，９２４－９２８（２０１８）。
４．Ｓｏｓ，Ｂ．Ｃ．ｅｔａｌ．Ｃｈａｒａｃｔｅｒｉｚａｔｉｏｎｏｆｃｈｒｏｍａｔｉｎａｃｃｅｓｓｉｂｉｌｉｔｙｗｉｔｈａｔｒａｎｓｐｏｓｏｍｅｈｙｐｅｒｓｅｎｓｉｔｉｖｅｓｉｔｅｓｓｅｑｕｅｎｃｉｎｇ（ＴＨＳ－ｓｅｑ）ａｓｓａｙ．ＧｅｎｏｍｅＢｉｏｌ．１７，２０（２０１６）。
５．Ｙｉｎ，Ｙ．ｅｔａｌ．Ｈｉｇｈ－ＴｈｒｏｕｇｈｐｕｔＳｉｎｇｌｅ－ＣｅｌｌＳｅｑｕｅｎｃｉｎｇｗｉｔｈＬｉｎｅａｒＡｍｐｌｉｆｉｃａｔｉｏｎ．Ｍｏｌ．Ｃｅｌｌ７６，６７６－６９０．ｅ１０（２０１９）。
６．Ｃｈｅｎ，Ｃ．ｅｔａｌ．Ｓｉｎｇｌｅ－ｃｅｌｌｗｈｏｌｅ－ｇｅｎｏｍｅａｎａｌｙｓｅｓｂｙＬｉｎｅａｒＡｍｐｌｉｆｉｃａｔｉｏｎｖｉａＴｒａｎｓｐｏｓｏｎＩｎｓｅｒｔｉｏｎ（ＬＩＡＮＴＩ）．Ｓｃｉｅｎｃｅ（８０－．）。３５６，１８９－１９４（２０１７）。
７．Ａｄｅｙ，Ａ．＆Ｓｈｅｎｄｕｒｅ，Ｊ．Ｕｌｔｒａ－ｌｏｗ－ｉｎｐｕｔ，ｔａｇｍｅｎｔａｔｉｏｎ－ｂａｓｅｄｗｈｏｌｅ－ｇｅｎｏｍｅｂｉｓｕｌｆｉｔｅｓｅｑｕｅｎｃｉｎｇ．ＧｅｎｏｍｅＲｅｓ．２２，１１３９－１１４３（２０１２）。
８．Ｍｕｌｑｕｅｅｎ，Ｒ．Ｍ．ｅｔａｌ．ＨｉｇｈｌｙｓｃａｌａｂｌｅｇｅｎｅｒａｔｉｏｎｏｆＤＮＡｍｅｔｈｙｌａｔｉｏｎｐｒｏｆｉｌｅｓｉｎｓｉｎｇｌｅｃｅｌｌｓ．Ｎａｔ．Ｂｉｏｔｅｃｈｎｏｌ．３６，４２８－４３１（２０１８）。
９．Ｖｉｔａｋ，Ｓ．Ａ．ｅｔａｌ．Ｓｅｑｕｅｎｃｉｎｇｔｈｏｕｓａｎｄｓｏｆｓｉｎｇｌｅ－ｃｅｌｌｇｅｎｏｍｅｓｗｉｔｈｃｏｍｂｉｎａｔｏｒｉａｌｉｎｄｅｘｉｎｇ．Ｎａｔ．Ｍｅｔｈｏｄｓ１４，３０２－３０８（２０１７）。
１０．Ｍｕｌｑｕｅｅｎ，Ｒ．Ｍ．ｅｔａｌ．ＨｉｇｈｌｙｓｃａｌａｂｌｅｇｅｎｅｒａｔｉｏｎｏｆＤＮＡｍｅｔｈｙｌａｔｉｏｎｐｒｏｆｉｌｅｓｉｎｓｉｎｇｌｅｃｅｌｌｓ．Ｎａｔ．Ｂｉｏｔｅｃｈｎｏｌ．３６，４２８－４３１（２０１８）。
１１．Ｖｉｔａｋ，Ｓ．Ａ．ｅｔａｌ．Ｓｅｑｕｅｎｃｉｎｇｔｈｏｕｓａｎｄｓｏｆｓｉｎｇｌｅ－ｃｅｌｌｇｅｎｏｍｅｓｗｉｔｈｃｏｍｂｉｎａｔｏｒｉａｌｉｎｄｅｘｉｎｇ．Ｎａｔ．Ｍｅｔｈｏｄｓ１４，（２０１７）。
１２．Ｒａｍａｎｉ，Ｖ．ｅｔａｌ．Ｍａｓｓｉｖｅｌｙｍｕｌｔｉｐｌｅｘｓｉｎｇｌｅ－ｃｅｌｌＨｉ－Ｃ．Ｎａｔ．Ｍｅｔｈｏｄｓ１４，２６３－２６６（２０１７）。
１３．ＢｒａｖｏＧｏｎｚａｌｅｚ－Ｂｌａｓ，Ｃ．ｅｔａｌ．ｃｉｓＴｏｐｉｃ：ｃｉｓ－ｒｅｇｕｌａｔｏｒｙｔｏｐｉｃｍｏｄｅｌｉｎｇｏｎｓｉｎｇｌｅ－ｃｅｌｌＡＴＡＣ－ｓｅｑｄａｔａ．Ｎａｔ．Ｍｅｔｈｏｄｓ１６，３９７－４００（２０１９）。
１４．Ｂｅｃｈｔ，Ｅ．ｅｔａｌ．Ｄｉｍｅｎｓｉｏｎａｌｉｔｙｒｅｄｕｃｔｉｏｎｆｏｒｖｉｓｕａｌｉｚｉｎｇｓｉｎｇｌｅ－ｃｅｌｌｄａｔａｕｓｉｎｇＵＭＡＰ．Ｎａｔ．Ｂｉｏｔｅｃｈｎｏｌ．３７，３８－４４（２０１８）。
１５．Ｌｉｎｄｅｎｂｕｒｇｅｒ，Ｋ．ｅｔａｌ．ＡＢ０２４．Ｓ０２４．Ｄｒｕｇｒｅｓｐｏｎｓｅｓｏｆｐａｔｉｅｎｔ－ｄｅｒｉｖｅｄｃｅｌｌｌｉｎｅｓｉｎｖｉｔｒｏｔｈａｔｍａｔｃｈｄｒｕｇｒｅｓｐｏｎｓｅｓｏｆｐａｔｉｅｎｔＰＤＡｃｔｕｍｏｒｓｉｎｓｉｔｕ．Ａｎｎ．Ｐａｎｃｒｅａｔ．Ｃａｎｃｅｒ１，ＡＢ０２４－ＡＢ０２４（２０１８）。 Citation for Example 2 1. Cusanovich, D. A. et al. Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing. Science (80-.). 348, 910-914 (2015).
2. Adey, A. et al. Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition. Genome Biol. 11, R119 (2010).
3. Tan, L. , Xing, D. , Chang, C. H. , Li, H. & Xie, X. S. Three-dimensional genome structures of single diploid human cells. Science (80-.). 361, 924-928 (2018).
4. Sos, B. C. et al. Characterization of chromatin accessibility with a transposome hypersensitive sites sequencing (THS-seq) assay. Genome Biol. 17, 20 (2016).
5. Yin, Y. et al. High-Throughput Single-Cell Sequencing with Linear Amplification. Mol. Cell 76, 676-690. e10 (2019).
6. Chen, C. et al. Single-cell whole-genome analyzes by Linear Amplification via Transposon Insertion (LIANTI). Science (80-.). 356, 189-194 (2017).
7. Adey, A. & Shendure, J. Ultra-low-input, tagmentation-based whole-genome bisulfite sequencing. Genome Res. 22, 1139-1143 (2012).
8. Mulqueen, R. M. et al. Highly scalable generation of DNA methylation profiles in single cells. Nat. Biotechnol. 36, 428-431 (2018).
9. Vitak, S. A. et al. Sequencing thousands of single-cell genomes with combinatorial indexing. Nat. Methods 14, 302-308 (2017).
10. Mulqueen, R. M. et al. Highly scalable generation of DNA methylation profiles in single cells. Nat. Biotechnol. 36, 428-431 (2018).
11. Vitak, S. A. et al. Sequencing thousands of single-cell genomes with combinatorial indexing. Nat. Methods 14, (2017).
12. Ramani, V. et al. Massively multiplex single-cell Hi-C. Nat. Methods 14, 263-266 (2017).
13. Bravo Gonzalez-Blas, C. et al. cisTopic: cis-regulatory topic modeling on single-cell ATAC-seq data. Nat. Methods 16, 397-400 (2019).
14. Becht, E. et al. Dimensionality reduction for visualizing single-cell data using UMAP. Nat. Biotechnol. 37, 38-44 (2018).
15. Lindenburger, K. et al. AB024. S024. Drug responses of patient-derived cell lines in vitro that match drug responses of patient PDAc tumors in situ. Ann. Pancreat. Cancer 1, AB024-AB024 (2018).

方法
ｓ３－ＡＴＡＣライブラリーの生成
試料取り扱いの前に、複合トランスポザーゼをＩｌｌｕｍｉｎａＩｎｃ．から入手した。９６の一意のインデックス付きトランスポザーゼにそれぞれのアダプターのうちの１つをロードし、２．５ｕＭに希釈し、－２０℃で保存した。５０ｍＬの核単離緩衝液（ＮＩＢ－ＨＥＰＥＳ）を、最終濃度１０ｍＭのＨＥＰＥＳ－ＫＯＨ（それぞれ、ＦｉｓｈｅｒＳｃｉｅｎｔｉｆｉｃ、ＢＰ３１０－５００及びＳｉｇｍａＡｌｄｒｉｃｈ１０５０１２１０００）、ｐＨ７．２、１０ｍＭのＮａＣｌ（ＦｉｓｈｅｒＳｃｉｅｎｔｉｆｉｃＳ２７１－３）、３ｍＭのＭｇＣｌ２（ＦｉｓｈｅｒＳｃｉｅｎｔｉｆｉｃＡＣ２２３２１００１０）、０．１％（ｖ／ｖ）ＩＧＥＰＡＬＣＡ－６３０（ＳｉｇｍａＡｌｄｒｉｃｈＩ３０２１）、０．１％（ｖ／ｖ）Ｔｗｅｅｎ（Ｓｉｇｍａ－ＡｌｄｒｉｃｈＰ－７９４９）で新たに調製し、ＰＣＲグレードの超純蒸留水（ＴｈｅｒｍｏＦｉｓｈｅｒＳｃｉｅｎｔｉｆｉｃ１０９７７０１５）で希釈した。希釈後、Ｐｉｅｒｃｅ（商標）プロテアーゼ阻害剤ミニ錠剤、ＥＤＴＡ不含（ＴｈｅｒｍｏＦｉｓｈｅｒＡ３２９５５）２錠剤を溶解し、懸濁して、核単離中のプロテアーゼ分解を防止した。 Methods Generation of s3-ATAC Library Prior to sample handling, composite transposase was obtained from Illumina Inc. 96 unique indexed transposases were loaded with one of each adapter, diluted to 2.5 uM, and stored at -20°C. Fifty milliliters of nuclear isolation buffer (NIB-HEPES) was freshly prepared with a final concentration of 10 mM HEPES-KOH (Fisher Scientific BP310-500 and Sigma-Aldrich 1050121000, respectively), pH 7.2, 10 mM NaCl (Fisher Scientific S271-3), 3 mM MgCl2 (Fisher Scientific AC223210010), 0.1% (v/v) IGEPAL CA-630 (Sigma-Aldrich I3021), 0.1% (v/v) Tween (Sigma-Aldrich P-7949), and PCR-grade ultrapure distilled water (Thermo After dilution, two tablets of Pierce™ Protease Inhibitor Mini Tablets, EDTA-Free (Thermo Fisher A32955) were dissolved and suspended to prevent protease degradation during nuclei isolation.

ｓ３－ＡＴＡＣ組織取り扱いについて、Ｃ５７／Ｂ６マウス全脳及びヒト皮質の一次試料を抽出し、液体窒素浴中で急速凍結した後、－８０℃で保存した。ベンチでの切開段階は、核抽出の前に設定した。ペトリ皿をドライアイス上に置き、新しい滅菌したかみそりをドライアイス包埋によって予備冷却した。７ｍＬ容量のダウンス型ホモジナイザーに２ｍＬのＮＩＢ－ＨＥＰＥＳ緩衝液を充填し、ウェットアイス上に保持した。氷上で氷冷７０％（ｖ／ｖ）エタノール（ＤｅｃｏｎＬａｂｏｒａｔｏｒｉｅｓＩｎｃ２７０１）を１５ｍＬのチューブに保持することによって、ダウンス型ホモジナイザー乳棒を冷却する。使用直前に、乳棒を冷却した蒸留水ですすいだ。組織解離のために、マウス及びヒトの脳試料を同様に処理した。まだ凍結した組織ブロックを清潔な予備冷却されたペトリ皿上に置き、かみそりで粗く刻んだ。次いで、かみそりを使用して約１ｍｇに刻んだ組織を、ダウンス型ホモジナイザー内の冷却されたＮＩＢ－ＨＥＰＥＳ緩衝液に輸送した。懸濁した試料に５分間かけて、ダウンシング前の塩濃度の変化に平衡化した。次いで、ゆるい（Ａ）乳棒での５ストロークでホモジナイズし、更に５分間インキュベートし、次いできつい（Ｂ）乳棒での５～１０ストロークでホモジナイズした。次いで、試料を、１５ｍＬのコニカルチューブに移す間３５μｍのセルストレーナー（Ｃｏｒｎｉｎｇ３５２２３５）を通して濾過し、続行する準備が整うまで核を氷上に保持した。核を４℃で、４００ｒｃｆの遠心分離により１０分間ペレット化した。上清を除去し、ペレットを１ｍＬのＮＩＢ－ＨＥＰＥＳ緩衝液に再懸濁した。この工程を繰り返して２回目の洗浄を行い、続行する準備が整うまで、核を再び氷上に保持した。懸濁した核の１０ｕＬアリコートを９０ｕＬのＮＩＢ－ＨＥＰＥＳ（１：１０希釈）で希釈し、血球計又はＢｉｏＲａｄＴＣ－２０自動細胞カウンターのいずれかで製造業者の推奨プロトコルに従って定量化した。次いで、ストック核懸濁液を１４００核／ｕＬの濃度に希釈した。 For s3-ATAC tissue processing, primary samples of C57/B6 mouse whole brain and human cortex were extracted, flash-frozen in a liquid nitrogen bath, and then stored at -80°C. A bench dissection stage was set up prior to nuclear extraction. Petri dishes were placed on dry ice, and a new, sterilized razor blade was pre-cooled by embedding it in dry ice. A 7 mL Dounce homogenizer was filled with 2 mL of NIB-HEPES buffer and kept on wet ice. The Dounce homogenizer pestle was cooled by placing a 15 mL tube of ice-cold 70% (v/v) ethanol (Decon Laboratories Inc. 2701) on ice. Immediately before use, the pestle was rinsed with chilled distilled water. For tissue dissociation, mouse and human brain samples were treated similarly. The still-frozen tissue block was placed on a clean, pre-cooled Petri dish and roughly chopped with a razor blade. The tissue, minced to approximately 1 mg using a razor, was then transferred to chilled NIB-HEPES buffer in a Dounce homogenizer. The suspended sample was allowed 5 minutes to equilibrate to the change in salt concentration before Dounce. It was then homogenized with 5 strokes of a loose (A) pestle, incubated for an additional 5 minutes, and then homogenized with 5-10 strokes of a tight (B) pestle. The sample was then filtered through a 35 μm cell strainer (Corning 352235) while being transferred to a 15 mL conical tube, and the nuclei were kept on ice until ready to proceed. The nuclei were pelleted by centrifugation at 400 rcf for 10 minutes at 4°C. The supernatant was removed, and the pellet was resuspended in 1 mL of NIB-HEPES buffer. This process was repeated for a second wash, and the nuclei were again kept on ice until ready to proceed. A 10 μL aliquot of the suspended nuclei was diluted with 90 μL of NIB-HEPES (1:10 dilution) and quantified using either a hemocytometer or a BioRad TC-20 automated cell counter according to the manufacturer's recommended protocol. The stock nuclei suspension was then diluted to a concentration of 1400 nuclei/μL.

タグメンテーションプレートを、４２０ｕＬの１４００核／ｕＬ溶液と５４０ｕＬの２×ＴＤ緩衝液（ＮｅｘｔｅｒａＸＴキット、ＩｌｌｕｍｉｎａＩｎｃ．）との組み合わせによって、調製した。この混合物から、８ｕＬ（総計約５０００核）をウェルスキーマに応じて９６ウェルプレートの各ウェルにピペットで移した。次いで、２５ｕＭの一意のインデックス付きトランスポザーゼ１ｕＬを各ウェルにピペットで移した。タグメンテーションは、３００ｒｃｆエッペンドルフＴｈｅｒｍｏＭｉｘｅｒ上で、５５℃で１０分間行った。このインキュベーション後、プレート温度を氷上で短時間インキュベートして下げて、反応を停止させた。実験的スキーマに応じてタグ付き核のプールを混合して、５ｍｇ／ｍＬのＤＡＰＩ（ＴｈｅｒｍｏＦｉｓｈｅｒＳｃｉｅｎｔｉｆｉｃＤ１３０６）２ｕＬを添加した。 Tagmentation plates were prepared by combining 420 μL of the 1400 nuclei/μL solution with 540 μL of 2x TD buffer (Nextera XT kit, Illumina Inc.). From this mixture, 8 μL (approximately 5000 nuclei total) was pipetted into each well of a 96-well plate according to the well schema. Then, 1 μL of 25 μM uniquely indexed transposase was pipetted into each well. Tagmentation was performed at 55°C for 10 minutes on a 300 rcf Eppendorf ThermoMixer. After this incubation, the plate temperature was reduced by a brief incubation on ice to stop the reaction. The tagged nuclei pool was mixed according to the experimental schema, and 2 μL of 5 mg/mL DAPI (Thermo Fisher Scientific D1306) was added.

次いで、核をＳｏｎｙＳＨ８００を介して流動分類して破片を除去し、ＰＣＲの前に１ウェル当たりの正確な計数を得た。容器９６ウェルプレートを９ｕＬの１×ＴＤ緩衝液（超純水で希釈）で調製した。次いで４℃の保持された試料室に保持した。次いで、蛍光核を、単一核についてサイズ、内部の複雑さ、及びＤＡＰＩ蛍光によるゲーティングでフロー選別した。選別完了直後に、プレートを密閉し、５００ｒｃｆ、４℃で５分間遠心沈殿させて、核が緩衝液内にあることを確認した。 Nuclei were then flow-sorted through a Sony SH800 to remove debris and obtain accurate counts per well prior to PCR. A 96-well plate was prepared with 9uL of 1x TD buffer (diluted with ultrapure water) and then held in a sample chamber at 4°C. Fluorescent nuclei were then flow-sorted for single nuclei, gating by size, internal complexity, and DAPI fluorescence. Immediately after sorting was complete, the plate was sealed and spun down at 500 rcf for 5 minutes at 4°C to ensure the nuclei were within the buffer.

次いで、ヌクレオソーム及び残りのトランスポザーゼを、１ウェル当たり１ｕＬの０．１％ＳＤＳ（約０．０１％ｆ．ｃ．）を添加して変性させた。その後、１ウェル当たり４ｕＬのＮＰＭ（ＮｅｘｔｅｒａＸＴキット、ＩｌｌｕｍｉｎａＩｎｃ）を添加して、タグ付きゲノムＤＮＡ上でギャップ充填を行い、７２℃で１０分間インキュベートした。次いで、１ｕＭのＡ１４－ＬＮＡ－ＭＥオリゴ１．５ｕＬを添加して、アダプター切り替えのために鋳型を供給した。次いで、ポリメラーゼベースのアダプター切り替えを、以下の条件で行った：９８℃で３０秒間の初期変性、９８℃で１０秒間を１０サイクル、５９℃で２０秒間、及び７２℃で１０秒間。次いで、プレートを１０℃に保持した。アダプター切り替え後、超純Ｈ２Ｏ（Ｓｉｇｍａ９３４２６）で溶解した１％（ｖ／ｖ）Ｔｒｉｔｏｎ－Ｘ１００を添加して、ＳＤＳの持続をクエンチした。この時点で、いくつかのプレートを－２０℃で数週間保存し、他のプレートを直ちに処理した。 Nucleosomes and remaining transposase were then denatured by adding 1 μL of 0.1% SDS (approximately 0.01% f.c.) per well. Subsequently, 4 μL of NPM (Nextera XT kit, Illumina Inc.) was added per well to perform gap filling on the tagged genomic DNA and incubated at 72°C for 10 minutes. Next, 1.5 μL of 1 μM A14-LNA-ME oligo was added to provide the template for adapter switching. Polymerase-based adapter switching was then performed using the following conditions: initial denaturation at 98°C for 30 seconds, 10 cycles of 98°C for 10 seconds, 59°C for 20 seconds, and 72°C for 10 seconds. The plate was then held at 10°C. After adapter switching, 1% (v/v) Triton-X100 dissolved in ultrapure HO (Sigma 93426) was added to quench the persistence of SDS. At this point, some plates were stored at -20°C for several weeks, while others were processed immediately.

次いで、ＰＣＲのために１ウェル当たりの以下を混合した：１ウェル当たり５０ｕＬの反応のために１６．５ｕｌの試料、１０μＭのインデックス付きｉ７プライマー２．５ｕＬ、１０μＭのインデックス付きｉ５プライマー２．５ｕＬ、３ｕＬの超純水、及び２５ｕＬのＮＥＢＮｅｘｔＱ５Ｕ２Ｘマスターミックス（ＮｅｗＥｎｇｌａｎｄＢｉｏｌａｂｓＭ０５９７Ｓ）、並びに０．５ｕＬの１００×ＳＹＢＲＧｒｅｅｎＩ（ＴｈｅｒｍｏＳｃｉｅｎｔｉｆｉｃＳ７５６３）。リアルタイムＰＣＲは、以下の条件でＢｉｏＲａｄＣＦＸ上で行い、サイクルごとにＳＹＢＲ蛍光を測定する：９８℃で３０秒間、９８℃で１０秒間を１６～１８サイクル、５５℃で２０秒間、７２℃で３０秒間、蛍光読み取り、７２℃で１０秒間。蛍光が指数関数的成長を過ぎて、屈折し始めた後、試料を７２℃で更に３０秒間保持し、次いで４℃で保存した。 Next, the following was mixed per well for PCR: 16.5 μl of sample, 2.5 μL of 10 μM indexed i7 primer, 2.5 μL of 10 μM indexed i5 primer, 3 μL of ultrapure water, and 25 μL of NEBNext Q5U 2X Master Mix (New England Biolabs M0597S), and 0.5 μL of 100X SYBR Green I (Thermo Scientific S7563) for a 50 μL reaction per well. Real-time PCR was performed on a BioRad CFX with SYBR fluorescence measured after each cycle: 98°C for 30 seconds, 16-18 cycles of 98°C for 10 seconds, 55°C for 20 seconds, 72°C for 30 seconds, fluorescence read, 72°C for 10 seconds. After the fluorescence passed the exponential growth rate and began to refract, the samples were held at 72°C for an additional 30 seconds and then stored at 4°C.

次いで、増幅されたライブラリーを、１ウェル当たり２５ｕＬを１５ｍＬのコニカルチューブにプールすることによってクリーンアップし、製造元のプロトコル（Ｑｉａｇｅｎ２８１０６）に従ってＱｉａｑｕｉｃｋＰＣＲ精製カラムを介してクリーンアップした。プールした試料を１０ｍＭのＴｒｉｓ－ＨＣｌ５０ｕＬ、ｐＨ８．０（ＬｉｆｅｔｅｃｈｎｏｌｏｇｉｅｓＡＭ９８５５）中で溶出した。次いで、ライブラリー分子は、ＳＰＲＩ選択ビーズを介したサイズ選択（Ｍａｇ－Ｂｉｎｄ（登録商標）ＴｏｔａｌＰｕｒｅＮＧＳＯｍｅｇａＢｉｏｔｅｋＭ１３７８－０１）を受けた。室温でボルテックスし、完全に懸濁した５０ｕＬのＳＰＲＩビーズを５０ｕＬのライブラリー（１×クリーンアップ）と混合して、室温で５分間インキュベートした。次いで、反応物をマグネットラック上に置いて１度清澄させ、上清を除去した。残りのペレットを１００ｕＬの新しい８０％エタノールで２回すすいだ。エタノールをピペットで除去した後、チューブを遠心沈殿させ、マグネットラックに戻して、あらゆる残留エタノールを除去する。次いで、３１μＬの１０ｍＭトリス－ＨＣｌ、ｐＨ８．０を使用して、ビーズをマグネットラックから取り外して、再懸濁し、室温で５分間インキュベートした。チューブを再びマグネットラック上に置き、１度清澄させて、上清の全量をクリーンなチューブに移した。次いで、製造業者の説明書（ＴｈｅｒｍｏＦｉｓｈｅｒＱ３２８５１）に従ってＱｕｂｉｔｄｓＤＮＡ高感度アッセイによってＤＮＡを定量化した。次いで、ライブラリーを２ｎｇ／ｕＬに希釈し、ＡｇｉｌｅｎｔＴａｐｅｓｔａｔｉｏｎ４１５０Ｄ５０００テープ（Ａｇｉｌｅｎｔ５０６７－５５９２）上で実行した。次いで、１００～１０００ｂｐの範囲内のライブラリー分子濃度を、ライブラリーの最終希釈の１ｎＭまで使用した。次いで、希釈したライブラリーを、製造元（ＩｌｌｕｍｉｎａＩｎｃ．）の推奨に従ってＮｅｘｔｓｅｑ５００システム上で大容量又は中容量１５０ｂｐシークエンシングキットでシークエンシングした。 The amplified library was then cleaned up by pooling 25 μL per well into a 15 mL conical tube and cleaned up via a Qiaquick PCR purification column according to the manufacturer's protocol (Qiagen 28106). The pooled sample was eluted in 50 μL of 10 mM Tris-HCl, pH 8.0 (Life technologies AM9855). The library molecules then underwent size selection via SPRI selection beads (Mag-Bind® TotalPure NGS Omega Biotek M1378-01). After vortexing at room temperature, 50 μL of fully suspended SPRI beads were mixed with 50 μL of the library (1x cleanup) and incubated for 5 minutes at room temperature. The reaction was then placed on a magnetic rack to clear the mixture, and the supernatant was removed. The remaining pellet was rinsed twice with 100 μL of fresh 80% ethanol. After pipetting off the ethanol, the tube was spun down and returned to the magnet rack to remove any residual ethanol. The beads were then removed from the magnet rack, resuspended using 31 μL of 10 mM Tris-HCl, pH 8.0, and incubated at room temperature for 5 minutes. The tube was placed back on the magnet rack, clarified, and the entire supernatant was transferred to a clean tube. DNA was then quantified using the Qubit dsDNA High Sensitivity Assay according to the manufacturer's instructions (Thermo Fisher Q32851). The library was then diluted to 2 ng/μL and run on an Agilent Tapestation 4150 D5000 tape (Agilent 5067-5592). Library molecule concentrations ranging from 100 to 1000 bp were then used, leading to a final library dilution of 1 nM. The diluted libraries were then sequenced using either a high-capacity or medium-capacity 150 bp sequencing kit on a Nextseq 500 system according to the manufacturer's recommendations (Illumina Inc.).

ｓ３－ＷＧＳライブラリーの生成
処理の前に、以下の緩衝液を調製した：上記のように５０ｍＬのＮＩＢＨＥＰＥＳ緩衝液、並びに最終濃度の１０ｍＭのＴｒｉｓＨＣｌｐＨ７．４（ＬｉｆｅＴｅｃｈｎｏｌｏｇｉｅｓＡＭ９８５５）、１０ｍＭのＮａＣｌ、３ｍＭのＭｇＣｌ２、０．１％（ｖ／ｖ）のＩＧＥＰＡＬＣＡ－６３０、０．１％（ｖ／ｖ）のＴｗｅｅｎを含み、ＰＣＲグレードの超純蒸留水で希釈した５０ｍＬのＴｒｉｓベースのＮＩＢ（ＮＩＢＴｒｉｓ）バリアント。希釈後、Ｐｉｅｒｃｅ（商標）プロテアーゼ阻害剤ミニ錠剤（ＥＤＴＡ不含）２錠を溶解し、懸濁して、核単離中のプロテアーゼ分解を防止した。 Generation of s3-WGS Library Prior to processing, the following buffers were prepared: 50 mL of NIB HEPES buffer as described above, and 50 mL of a Tris-based NIB (NIB Tris) variant containing final concentrations of 10 mM Tris HCl pH 7.4 (Life Technologies AM9855), 10 mM NaCl, 3 mM MgCl2, 0.1% (v/v) IGEPAL CA-630, and 0.1% (v/v) Tween, diluted with PCR-grade ultrapure distilled water. After dilution, two Pierce™ protease inhibitor mini-tablets (EDTA-free) were dissolved and suspended to prevent protease degradation during nuclei isolation.

ｓ３－ＷＧＳライブラリーの調製は、以下のように細胞株で行った。患者由来のＣＲＣ細胞株の場合、処理の前に、Ｔ２５フラスコ上に１×１０６の密度で細胞を播種した。細胞を氷冷１×ＰＢＳ（ＶＷＲ７５８００－９８６）で２回洗浄し、次いで５ｍＬの１×ＴｒｙｐＬＥ（ＴｈｅｒｍｏＦｉｓｈｅｒ１２６０４０３９）で３７℃で１５分間トリプシン処理した。次いで、浮遊細胞を収集し、３００ｒｃｆで４℃で５分間ペレット化した。浮遊増殖細胞株（ＧＭ１２８７８）の場合、細胞を増殖培地からピペットで移し、３００ｒｃｆで４℃で５分間ペレット化した。 s3-WGS library preparation was performed for cell lines as follows. For patient-derived CRC cell lines, cells were seeded at a density of 1 x 106 onto a T25 flask prior to treatment. Cells were washed twice with ice-cold 1x PBS (VWR75800-986) and then trypsinized with 5 mL of 1x TrypLE (Thermo Fisher 12604039) at 37°C for 15 minutes. Suspension cells were then collected and pelleted at 300 reflow for 5 minutes at 4°C. For the suspension-growing cell line (GM12878), cells were pipetted from the growth medium and pelleted at 300 reflow for 5 minutes at 4°C.

初期ペレットに続いて、細胞を氷冷１ｍＬのＮＩＢＨＥＰＥＳで２回洗浄した。２回目の洗浄後、ペレットを３００ｕＬのＮＩＢＨＥＰＥＳに再懸濁した。核を等分し、上記のように定量化し、次いで、定量化に基づいて１００万個の核アリコートを生成した。アリコートを、４℃で５分間３００ｒｃｆの遠心分離によってペレット化し、５ｍＬのＮＩＢＨＥＰＥＳに再懸濁した。次いで、２４６ｕＬの１６％（ｗ／ｖ）ホルムアルデヒド（ＴｈｅｒｍｏＦｉｓｈｅｒ２８９０６）を核懸濁液に添加して（ｆ．ｃ．０．７５％ホルムアルデヒド）、核を軽く固定した。核を、５０ｒｐｍに設定されたオービタルシェーカー上のホルムアルデヒド溶液中で１０分間インキュベートすることによって固定した。次いで、懸濁液を４℃で４分間５００ｒｃｆでペレット化し、上清を吸引した。次いで、ペレットを１ｍＬのＮＩＢＴｒｉｓ緩衝液に再懸濁して、残りのホルムアルデヒドをクエンチした。核を、４℃で４分間５００ｒｃｆで再度ペレット化し、上清を吸引した。ペレットを５００ｕＬ１×ＮＥ緩衝液２．１（ＮＥＢＢ７２０２Ｓ）で１回洗浄し、次いで７６０ｕＬ１×ＮＥ緩衝液２．１で再懸濁した。４０ｕＬの１％ＳＤＳ（ｖ／ｖ）を添加し、試料を３００ｒｃｆで３７℃、２０分間に設定したＴｈｅｒｍｏＭｉｘｅｒ上でインキュベートした。次いで、ヌクレオソーム枯渇核を、５００ｒｃｆで、４℃で５分間ペレット化し、次いで５０ｕＬのＮＩＢトリスに再懸濁した。５ｕＬの核アリコートを採取し、ＮＩＢＴｒｉｓで１：１０に希釈し、次いで上記のように定量化した。Ｎｕｃｌｅｉを、定量化に基づいて、ＮＩＢＴｒｉｓトを添加して、５００核／ｕＬに希釈した。実験設定に応じて、５００核／ｕＬで４２０ｕＬの核を、５４０ｕＬ２×ＴＤ緩衝液と混合した。これに続いて、核をタグメント化し、染色して、フロー選別し、ゲノムＤＮＡをギャップ充填し、アダプターの切り替えを、ｓ３－ＡＴＡＣプロトコルについて説明したように行った。ライブラリー増幅は、上記のようにＰＣＲによって、ライブラリー当たりの初期の捕捉イベントが多い可能性のために少ない総サイクル数（１３～１５）で行った。次いで、前述のように、ライブラリーをクリーンアップし、サイズ選択して、シークエンシングした。 Following the initial pellet, cells were washed twice with 1 mL of ice-cold NIB HEPES. After the second wash, the pellet was resuspended in 300 μL of NIB HEPES. Nuclei were aliquoted and quantified as described above, and then 1 million nuclei aliquots were generated based on the quantification. Aliquots were pelleted by centrifugation at 300 rcf for 5 minutes at 4°C and resuspended in 5 mL of NIB HEPES. Nuclei were then gently fixed by adding 246 μL of 16% (w/v) formaldehyde (Thermo Fisher 28906) to the nuclei suspension (f.c. 0.75% formaldehyde). Nuclei were fixed by incubating in the formaldehyde solution for 10 minutes on an orbital shaker set at 50 rpm. The suspension was then pelleted at 500 rcf for 4 minutes at 4°C, and the supernatant was aspirated. The pellet was then resuspended in 1 mL of NIB Tris buffer to quench any remaining formaldehyde. The nuclei were again pelleted at 500 rcf for 4 minutes at 4°C, and the supernatant was aspirated. The pellet was washed once with 500 μL 1×NE Buffer 2.1 (NEB B7202S) and then resuspended in 760 μL 1×NE Buffer 2.1. 40 μL of 1% SDS (v/v) was added, and the sample was incubated on a ThermoMixer set at 300 rcf for 20 minutes at 37°C. The nucleosome-depleted nuclei were then pelleted at 500 rcf for 5 minutes at 4°C, and then resuspended in 50 μL of NIB Tris. A 5 μL nuclei aliquot was taken, diluted 1:10 with NIB Tris, and then quantified as described above. Based on the quantification, nuclei were diluted to 500 nuclei/μL by adding NIB Tris. Depending on the experimental setup, 420 μL of nuclei at 500 nuclei/μL were mixed with 540 μL 2x TD buffer. Following this, nuclei were tagmented, stained, flow-sorted, gap-filled genomic DNA, and adapter switching were performed as described for the s3-ATAC protocol. Library amplification was performed by PCR as described above, with a low total number of cycles (13-15) to allow for the possibility of a high number of initial capture events per library. Libraries were then cleaned up, size-selected, and sequenced as previously described.

ｓ３－ＧＣＣライブラリーの生成
同じ培養細胞株試料を、ｓ３－ＷＧＳライブラリーの生成について記載したようにサンプリングし、固定されたヌクレオソーム枯渇核の同じプールから処理した。核の定量化に続いて、残りの核懸濁液を全部（試料当たり約２００万～３００万の核）、試料のそれぞれにプールした。核を４℃で５分間５００ｒｃｆでペレット化し、９０ｕＬの１×Ｃｕｔｓｍａｒｔ緩衝液（ＮＥＢＢ７２０４Ｓ）に再懸濁した。１０Ｕ／ｕＬのＡｌｕＩ制限酵素（ＮＥＢＲ０１３７Ｓ）１０ｕＬを各試料に添加した。次いで、試料を、ＴｈｅｒｍｏＭｉｘｅｒ上で、３００ｒｐｍ、３７℃で２時間消化した。消化後、核断片は、近接ライゲーションを受けた。核を４℃で５分間５００ｒｃｆでペレット化し、１００ｕＬのライゲーション反応緩衝液に再懸濁した。ライゲーション緩衝液は、１×Ｔ４ＤＮＡリガーゼ緩衝液＋ＡＴＰ（ＮＥＢＭ０２０２Ｓ）、０．０１％ＴｒｉｔｏｎＸ－１００、０．５ｍＭのＤＴＴ（ＳｉｇｍａＤ０６３２）、２００ＵのＴ４ＤＮＡリガーゼを超純水で希釈した最終濃度を有する混合物である。ライゲーションは、１６℃で１４時間（一晩）行った。このインキュベーション後、核を４℃で５分間５００ｒｃｆでペレット化し、１００ｕＬのＮＩＢＨＥＰＥＳ緩衝液に再懸濁した。核のアリコートを前述のように定量化し、次いで希釈し、アリコートし、タグメント化、プールし、ＤＡＰＩ染色、フロー選別して、ゲノムＤＮＡをギャップ充填し、アダプター切り替えを、ｓ３－ＡＴＡＣプロトコルについて記載したように行った。ライブラリー増幅は、ｓ３－ＷＧＳライブラリーと同じ速度（１３～１５サイクル）で生じ、その後、ライブラリーをプールし、上記のようにクリーンアップし、シークエンシングした。 Generation of s3-GCC Libraries: The same cultured cell line samples were sampled and processed from the same pool of fixed, nucleosome-depleted nuclei as described for generation of the s3-WGS library. Following quantification of the nuclei, the entire remaining nuclear suspension (approximately 2-3 million nuclei per sample) was pooled into each of the samples. Nuclei were pelleted at 500 rcf for 5 minutes at 4°C and resuspended in 90 μL of 1× Cutsmart buffer (NEB B7204S). 10 μL of 10 U/μL AluI restriction enzyme (NEB R0137S) was added to each sample. Samples were then digested for 2 hours at 300 rpm and 37°C on a ThermoMixer. Following digestion, nuclear fragments underwent proximity ligation. Nuclei were pelleted at 500 rcf for 5 minutes at 4°C and resuspended in 100 μL of ligation reaction buffer. The ligation buffer consisted of 1x T4 DNA ligase buffer + ATP (NEB M0202S), 0.01% Triton X-100, 0.5 mM DTT (Sigma D0632), and 200 U of T4 DNA ligase diluted with ultrapure water. Ligation was carried out for 14 hours (overnight) at 16°C. After this incubation, nuclei were pelleted at 500 rcf for 5 minutes at 4°C and resuspended in 100 μL of NIB HEPES buffer. Aliquots of nuclei were quantified as described above, then diluted, aliquoted, tagmented, pooled, DAPI stained, flow sorted, and the genomic DNA gap-filled and adapter-switched as described for the s3-ATAC protocol. Library amplification occurred at the same rate as for s3-WGS libraries (13-15 cycles), after which the libraries were pooled, cleaned up, and sequenced as above.

実施例３
組み合わされたタグメンテーション及びインデックス付けによるライブラリーの調製
以下の実施例において、タグメンテーション及びインデックス付けの工程を使用して、核酸試料からデュアルインデックス付きペアエンドライブラリーを調製するための方法及びシステムを示す。第１のインデックス配列は、タグメンテーションを介して付加され、第２のインデックス配列は、ハイブリダイゼーション及び伸長を介して付加される。 Example 3
Library Preparation by Combined Tagmentation and Indexing In the following examples, methods and systems are presented for preparing dual-indexed paired-end libraries from nucleic acid samples using tagmentation and indexing steps: a first index sequence is added via tagmentation, and a second index sequence is added via hybridization and extension.

この実施例では、５’－プライマー－インデックス－アダプター－ウラシル－トランスポザーゼ認識ドメイン、例えば５’－Ｐ５－ｉ５－Ａ１４－Ｕ－ＭＥ－３’の第１の鎖、及びトランスポザーゼ認識配列の相補体である第２の鎖、例えば５’－ＭＥ’－３’とともにトランスポゾンを有する固定化トランスポソーム複合体を使用する。トランスポゾンの例示的な第１の鎖は、配列番号１であり、トランスポゾンの例示的な第２の鎖は、配列番号１のヌクレオチド５３～７１の相補体である。トランスポソーム複合体は、切断可能なリンカーを介して第１の鎖の５’末端に結合したビオチンを介してビーズ上に固定化される。第２のインデックス配列を含有するオリゴヌクレオチドは、５’－プライマー－インデックス－アダプター－トランスポザーゼ認識配列、例えば、５’－Ｐ７－ｉ７－Ｂ１５－ＭＥ－３’の配列を有する。第２のインデックス配列を含有する例示的なオリゴヌクレオチドは、配列番号２である。第２のインデックス配列は、任意選択的で、３’末端で、例えば、ジデオキシ又はロックド核酸を使用してブロックされる。Ｐ５、ｉ５、Ｐ７、ｉ７、ＭＥ、Ａ１４、及びＢ１５の例示的な配列は、それぞれ配列番号３～９である。 This example uses an immobilized transposome complex having a transposon with a first strand of 5'-primer-index-adapter-uracil-transposase recognition domain, e.g., 5'-P5-i5-A14-U-ME-3', and a second strand that is the complement of the transposase recognition sequence, e.g., 5'-ME'-3'. An exemplary first strand of the transposon is SEQ ID NO:1, and an exemplary second strand of the transposon is the complement of nucleotides 53-71 of SEQ ID NO:1. The transposome complex is immobilized on a bead via biotin attached to the 5' end of the first strand via a cleavable linker. An oligonucleotide containing the second index sequence has a sequence of 5'-primer-index-adapter-transposase recognition sequence, e.g., 5'-P7-i7-B15-ME-3'. An exemplary oligonucleotide containing the second index sequence is SEQ ID NO:2. The second index sequence is optionally blocked at the 3' end, for example, using a dideoxy or locked nucleic acid. Exemplary sequences for P5, i5, P7, i7, ME, A14, and B15 are SEQ ID NOs: 3-9, respectively.

溶液中の核酸を、所望の範囲で９６ウェルプレートの各ウェルに添加する。上記のトランスポゾン配列を有するビーズ結合トランスポソームの懸濁液を各ウェルに添加し、プレートをトランスポザーゼを断片化し、トランスポゾン配列を挿入するのに好適な条件下でインキュベートする。トランスポザーゼ酵素は、例えば、ＳＤＳを添加する又は加熱することによって除去される。ウラシル不耐性ポリメラーゼ（例えば、プルーフリーディングポリメラーゼ又はＰｈｕｓｉｏｎ）を添加して、核酸断片末端と第２のトランスポゾン配列との間のギャップを充填する。ウラシル不耐性ポリメラーゼは、挿入された第１のトランスポゾン配列のＡ１４とＭＥ配列との間に挿入されたウラシルに達すると停止する。タグメント化された核酸は、変性され、第２のインデックス付けオリゴヌクレオチドは、酵素的伸長によってタグメント化された核酸にハイブリダイズする。二重インデックス付き核酸は、増幅を含むがこれらに限定されないシークエンシングのために使用され得るか、又は更に処理され得る。 Nucleic acid in solution is added to each well of a 96-well plate in the desired range. A suspension of bead-bound transposomes carrying the transposon sequence described above is added to each well, and the plate is incubated under conditions suitable for fragmenting the transposase and inserting the transposon sequence. The transposase enzyme is removed, for example, by adding SDS or by heating. A uracil-intolerant polymerase (e.g., proofreading polymerase or Phusion) is added to fill the gap between the ends of the nucleic acid fragment and the second transposon sequence. The uracil-intolerant polymerase stops upon reaching the uracil inserted between A14 of the inserted first transposon sequence and the ME sequence. The tagmented nucleic acid is denatured, and a second indexing oligonucleotide hybridizes to the tagmented nucleic acid by enzymatic extension. The dual-indexed nucleic acid can be used for sequencing, including but not limited to amplification, or further processed.

全ての特許、特許出願、及び刊行物、並びに本明細書で引用した電子的に利用可能な資料の完全な開示（例えば、ＧｅｎＢａｎｋ及びＲｅｆＳｅｑでのヌクレオチド配列の提出、ＳｗｉｓｓＰｒｏｔ、ＰＩＲ、ＰＲＦ、ＰＤＢでのアミノ酸配列の提出、並びにＧｅｎＢａｎｋ及びＲｅｆＳｅｑにおける注釈付きコード領域からの翻訳）は、参照によりその全体が組み込まれる。刊行物で参照されている補足資料（補足表、補足図、補足資料及び方法、並びに／又は補足実験データなど）も同様に、参照によりその全体が組み込まれる。本出願の開示と、参照により本明細書に組み込まれる文書の開示との間に矛盾が存在する場合、本出願の開示が優先するものとする。前述の詳細な説明及び実施例は、理解を明確にするためにのみ提供されている。それから不必要な制限を理解する必要はない。当業者に明らかな変形は、特許請求の範囲によって定義される開示に含まれるため、本開示は、図示及び記載された正確な詳細に限定されない。 The complete disclosures of all patents, patent applications, and publications, as well as electronically available materials cited herein (e.g., nucleotide sequence submissions in GenBank and RefSeq, amino acid sequence submissions in SwissProt, PIR, PRF, and PDB, and translations from annotated coding regions in GenBank and RefSeq), are incorporated by reference in their entirety. Supplementary materials referenced in publications (such as supplementary tables, figures, materials and methods, and/or experimental data) are likewise incorporated by reference in their entirety. In the event of a conflict between the disclosure of this application and the disclosure of a document incorporated by reference herein, the disclosure of this application shall control. The foregoing detailed description and examples are provided for clarity of understanding only. No unnecessary limitations should be understood therefrom. The disclosure is not limited to the exact details shown and described, since variations obvious to those skilled in the art are included in the disclosure defined by the claims.

別途記載のない限り、本明細書及び特許請求の範囲で使用される成分、分子量などの量を表す全ての数は、全ての場合において、用語「約」によって修飾されるものとして理解されるべきである。したがって、別途記載のない限り、本明細書及び特許請求の範囲に記載される数値パラメータは、本開示によって得られることが求められる所望の特性に応じて変化し得る近似値である。少なくとも、かつ均等論を特許請求の範囲に限定する試みとしてではなく、各数値パラメータは、少なくとも、報告された有効桁数に照らして、通常の四捨五入法を適用することによって解釈されるべきである。 Unless otherwise indicated, all numbers expressing quantities of ingredients, molecular weights, and the like used in the specification and claims should be understood to be modified in all instances by the term "about." Accordingly, unless otherwise indicated, the numerical parameters set forth in the specification and claims are approximations that may vary depending upon the desired properties sought to be obtained by the present disclosure. At the very least, and not as an attempt to limit the scope of the claims to the doctrine of equivalents, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.

本開示の広い範囲を示す数値範囲及びパラメータは近似値であることに関わらず、特定の実施例に記載される数値は、可能な限り正確に報告される。しかしながら、全ての数値は、それぞれの試験測定値に見出される標準偏差から必然的に生じる範囲を本質的に含む。 Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the present disclosure are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. However, all numerical values inherently contain ranges necessarily resulting from the standard deviations found in their respective testing measurements.

全ての見出しは読者の便宜のためのものであり、特に明記されていない限り、見出しに続くテキストの意味を制限するために使用されるべきではない。
本発明は、例えば、以下の項目を提供する。
（項目１）
シークエンシングライブラリーを生成するための方法であって、
各末端に第１のアダプター配列を含む複数の対称修飾標的核酸を提供することであって、前記第１のアダプター配列が、ＤＮＡ損傷を含む、提供することと、
前記修飾標的核酸を損傷不耐性ポリメラーゼで伸長して、各鎖の５’末端に前記第１のアダプター配列と、各鎖の３’末端に前記第１のアダプターの一部の相補体とを含む複数の非対称修飾標的核酸を生成することと、を含む、方法。
（項目２）
前記複数の対称修飾標的核酸が、二本鎖であり、各鎖が、５’から３’に前記ＤＮＡ損傷を含む前記第１のアダプター配列と、前記標的核酸と、少なくとも１つのヌクレオチドを含むギャップと、前記ＤＮＡ損傷を含まない前記第１のアダプター配列の一部の相補体と、を含む、項目１に記載の方法。
（項目３）
前記伸長が、前記ギャップで開始する、項目１に記載の方法。
（項目４）
プライマーを前記複数の非対称修飾標的核酸にアニーリングすることであって、前記プライマーが、５’から３’に第２のアダプター配列及びアニーリングドメインを含み、前記アニーリングドメインが、前記複数の非対称修飾標的核酸の前記第１のアダプターの前記一部の前記相補体にアニーリングするヌクレオチド配列を含む、アニーリングすることと、
アニーリングされた前記非対称修飾標的核酸の３’末端を、損傷不耐性ポリメラーゼで伸長することと、を更に含み、
前記伸長が、５’から３’に（ｉ）前記第１のアダプター、（ｉｉ）前記標的核酸、（ｉｉｉ）前記第１のアダプターの前記一部の前記相補体、及び（ｉｖ）前記第２のアダプターの相補体を含む、複数の非対称修飾標的核酸をもたらす、項目２に記載の方法。
（項目５）
前記アニーリングされた非対称修飾標的核酸の３’末端の前記伸長が、少なくとも３回繰り返される、項目４に記載の方法。
（項目６）
前記ＤＮＡ損傷が、脱塩基部位、修飾塩基、ミスマッチ、一本鎖切断、又は架橋ヌクレオチドのうちの少なくとも１つを含む、項目１に記載の方法。
（項目７）
前記ＤＮＡ損傷が、少なくとも１つのウラシルを含む、項目１に記載の方法。
（項目８）
前記プライマーの前記アニーリングドメインが、対応する天然ＤＮＡヌクレオチドと比較して、融解温度を増加させる少なくとも１つの改変ヌクレオチドを含む、項目４に記載の方法。
（項目９）
前記改変ヌクレオチドが、ロックド核酸、ＰＮＡ、又はＲＮＡを含む、項目８に記載の方法。
（項目１０）
前記プライマーの３’末端がブロックされている、項目４に記載の方法。
（項目１１）
前記第１のアダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のユニバーサル分子識別子、又はそれらの組み合わせを含む、項目１に記載の方法。
（項目１２）
前記１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子のうちの少なくとも１つが、前記ＤＮＡ損傷と前記標的核酸の遠位の
前記アダプターの末端との間の前記アダプターに位置する、項目１１に記載の方法。
（項目１３）
前記第２のアダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のユニバーサル分子識別子、又はそれらの組み合わせを含む、項目４に記載の方法。
（項目１４）
前記第１のアダプターの前記１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子が、前記第２のアダプターの前記１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子と比較して固有である、項目１３に記載の方法。
（項目１５）
前記第１のアダプターの前記１つ以上のインデックス配列が、区画特異的である、項目１１に記載の方法。
（項目１６）
前記第２のアダプターの前記１つ以上のインデックス配列が、区画特異的である、項目１３に記載の方法。
（項目１７）
前記第１のアダプターが、トランスポザーゼ認識部位を含む、項目１に記載の方法。
（項目１８）
前記標的核酸が、単一細胞に由来する核酸由来である、項目１に記載の方法。
（項目１９）
前記標的核酸が、複数の細胞に由来する核酸由来である、項目１に記載の方法。
（項目２０）
単一細胞又は複数の細胞に由来する前記標的核酸が、ＲＮＡを含む、項目１８又は１９に記載の方法。
（項目２１）
前記ＲＮＡが、ｍＲＮＡを含む、項目２０に記載の方法。
（項目２２）
単一細胞又は複数の細胞に由来する前記標的核酸が、ＤＮＡを含む、項目１８又は１９に記載の方法。
（項目２３）
前記ＤＮＡが、全細胞ゲノムＤＮＡを含む、項目２２に記載の方法。
（項目２４）
前記全細胞ゲノムＤＮＡが、ヌクレオソームを含む、項目２３に記載の方法。
（項目２５）
前記標的核酸が、無細胞ＤＮＡに由来する核酸由来である、項目１に記載の方法。
（項目２６）
前記方法が、コンビナトリアルインデックス付けを含む、項目１１～１６のいずれか一項に記載の方法。
（項目２７）
前記非対称修飾標的核酸を増幅することを更に含み、
前記増幅が、第２のプライマー及び損傷耐性ポリメラーゼを含み、
前記第２のプライマーが、前記第１のアダプター配列又は前記その相補体にアニーリングするヌクレオチド配列を含む、項目１に記載の方法。
（項目２８）
前記第２のプライマーが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のユニバーサル分子識別子、又はそれらの組み合わせを更に含む、項目２７に記載の方法。
（項目２９）
前記第２のプライマーの前記１つ以上のユニバーサル配列、１つ以上のインデックス配
列、及び１つ以上のユニバーサル分子識別子が、前記第１のアダプター及び前記第２のアダプターの前記１つ以上のユニバーサル配列、１つ以上のインデックス配列、及び１つ以上のユニバーサル分子識別子と比較して固有である、項目２８に記載の方法。
（項目３０）
前記複数の非対称修飾標的核酸のサブセットが、複数の区画内に存在し、（ｉ）前記第１のアダプターが、第１の区画特異的インデックスを含むか、（ｉｉ）前記第２のアダプターが、第２の区画特異的インデックスを含むか、又は（ｉ）及び（ｉｉ）の両方のいずれかである、項目１に記載の方法。
（項目３１）
異なる区画からの前記非対称修飾標的核酸を組み合わせて、プールされたインデックス付き非対称修飾標的核酸を生成することを更に含む、項目３０に記載の方法。
（項目３２）
前記プールされたインデックス付き非対称修飾標的核酸のサブセットを第２の複数の区画に分配し、前記インデックス付き非対称修飾標的核酸を修飾することを更に含み、
前記修飾が、各サブセット中に存在する前記インデックス付き非対称修飾標的核酸に追加の区画特異的インデックス配列を付加して、インデックス付きＤＮＡ核酸をもたらすことを含み、
前記修飾が、ライゲーション又は伸長を含む、項目３１に記載の方法。
（項目３３）
前記区画が、ウェル又は液滴を含む、項目３０～３２のいずれか一項に記載の方法。
（項目３４）
前記提供が、複数のＤＮＡ断片を前記第１のアダプターと、前記ＤＮＡ断片の両端に前記第１のアダプターをライゲートする条件下で、接触させることを含む、項目１に記載の方法。
（項目３５）
前記ＤＮＡ断片が、二本鎖及び平滑末端である、項目３４に記載の方法。
（項目３６）
前記第１のアダプターが、二本鎖ＤＮＡオリゴヌクレオチドである、項目３４又は３５に記載の方法。
（項目３７）
前記第１のアダプターの一方の３’末端が、ブロックされている、項目３４又は３５に記載の方法。
（項目３８）
前記ＤＮＡ断片が、二本鎖であり、一方又は両方の３’末端に一本鎖領域を含む、項目３４に記載の方法。
（項目３９）
前記第１のアダプターが、一方の末端に一本鎖領域を含む二本鎖ＤＮＡオリゴヌクレオチドであり、前記一本鎖領域が、前記ＤＮＡ断片上に存在する前記一本鎖領域にアニーリングすることができる、項目３４又は３８に記載の方法。
（項目４０）
前記アダプターが、フォーク型アダプターである、項目３４、３５、又は３８に記載の方法。
（項目４１）
前記提供が、ＤＮＡをトランスポソーム複合体と接触させることを含み、前記トランスポソーム複合体が、トランスポザーゼと、前記第１のアダプターと、を含み、前記接触が、前記第１のアダプターの前記ＤＮＡへのライゲーションに好適な条件下で生じて、前記対称修飾標的核酸を生成する、項目１に記載の方法。
（項目４２）
生成された前記対称修飾標的核酸が、ライゲートされた前記第１のアダプターと前記標的核酸との間で１つの鎖中に少なくとも１つのヌクレオチドのギャップを含む、項目４
１に記載の方法。
（項目４３）
前記ＤＮＡが、複数の区画内に存在し、各区画内の前記第１のアダプターが、区画特異的インデックスを含む、項目４１又は４２に記載の方法。
（項目４４）
異なる区画からの一本鎖修飾標的核酸を組み合わせて、プールされた対称修飾標的核酸を生成することと、前記対称修飾標的核酸を第２の複数の区画に分配することと、を更に含む、項目４２に記載の方法。
（項目４５）
前記方法が、前記全細胞ゲノムＤＮＡの断片化を更に含む、項目２３に記載の方法。
（項目４６）
前記断片化が、制限エンドヌクレアーゼを用いた前記全細胞ゲノムＤＮＡの消化を含む、項目４５に記載の方法。
（項目４７）
断片化された前記ＤＮＡが、キメラ標的核酸を結合するための近接ライゲーションに供される、項目４５又は４６に記載の方法。
（項目４８）
アダプターのシトシン残基が、５－メチルシトシンで置き換えられる、項目１又は４に記載の方法。
（項目４９）
前記対称又は非対称標的核酸が、化学的又は酵素的メチル化変換に供される、項目４８に記載の方法。
（項目５０）
前記提供が、単離された核を固定することと、前記単離された核を、ゲノムＤＮＡからヌクレオソームを解離させる条件に供することと、前記ゲノムＤＮＡを断片化することと、キメラ標的核酸を結合する近接ライゲーションに前記断片を供することと、ライゲートされた前記断片をトランスポソーム複合体に接触させることと、を含み、前記トランスポソーム複合体が、トランスポザーゼと、前記第１のアダプターと、を含み、前記接触が、前記第１のアダプターの前記ＤＮＡへのライゲーションに好適な条件下で生じて、前記対称修飾標的核酸を生成する、項目１に記載の方法。
（項目５１）
前記断片化が、制限エンドヌクレアーゼによる消化を含む、項目５０に記載の方法。
（項目５２）
複数の増幅部位を含む表面を提供することであって、
前記増幅部位が、遊離３’末端を有する結合した一本鎖捕捉オリゴヌクレオチドの少なくとも２つの集団を含む、提供することと、
個々の非対称修飾標的核酸からのアンプリコンのクローン集団を各々含む複数の増幅部位を生成するのに好適な条件下で、前記増幅部位を含む表面を、前記複数の非対称修飾標的核酸と接触させることと、を更に含む、項目４に記載の方法。
（項目５３）
トランスポソーム複合体及びＤＮＡポリメラーゼを含む組成物であって、
前記トランスポソームが、トランスポゾン配列に結合したトランスポザーゼを含み、
前記トランスポゾン配列が、アダプター及びＤＮＡ損傷を含み、
前記ＤＮＡポリメラーゼが、損傷不耐性ポリメラーゼである、組成物。
（項目５４）
前記アダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含む、項目５３に記載の組成物。
（項目５５）
損傷耐性ＤＮＡポリメラーゼを更に含む、項目５３又は５４に記載の組成物。
（項目５６）
組成物であって、
５’から３’にＤＮＡ損傷を含む第１のアダプター、標的核酸、及び前記第１のアダプターの相補体を含む複数の修飾標的核酸と、
５’から３’に第２のアダプターと、アニーリングドメインと、を含む、プライマーであって、前記アニーリングドメインが、前記第１のアダプターの前記相補体にアニーリングするヌクレオチド配列を含む、プライマーと、
損傷不耐性ＤＮＡポリメラーゼと、を含む、組成物。
（項目５７）
前記プライマーが、対応する天然ＤＮＡヌクレオチドと比較して、融解温度を増加させる少なくとも１つの改変ヌクレオチドを含む、項目５６に記載の組成物。
（項目５８）
前記プライマーが、標的核酸にアニーリングされる、項目５６又は５７に記載の組成物。
（項目５９）
前記プライマーの３’末端が、ブロックされている、項目５６に記載の組成物。
（項目６０）
前記第１のアダプターが、トランスポザーゼ認識部位を含む、項目５６に記載の組成物。
（項目６１）
別々の容器中のトランスポソーム複合体及びＤＮＡポリメラーゼであって、
トランスポソームが、トランスポゾン配列に結合したトランスポザーゼを含み、
前記トランスポゾン配列が、第１のアダプター及びＤＮＡ損傷を含み、
前記ＤＮＡポリメラーゼが、損傷不耐性ポリメラーゼである、トランスポソーム複合体及びＤＮＡポリメラーゼと、
使用説明書と、を含む、キット。
（項目６２）
別々の容器中に第２のＤＮＡポリメラーゼを更に含み、
前記第２のＤＮＡポリメラーゼが、損傷耐性ポリメラーゼである、項目６１に記載のキット。
（項目６３）
プライマーを更に含み、
前記プライマーが、５’から３’に第２のアダプター及びアニーリングドメインを含み、前記アニーリングドメインが、前記第１のアダプターの相補体にアニーリングするヌクレオチド配列を含む、項目６１又は６２に記載のキット。
（項目６４）
前記プライマーの３’末端が、ブロックされている、項目６３に記載のキット。
（項目６５）
前記第１のアダプターが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを含む、項目６１に記載のキット。
（項目６６）
前記第２のアダプタープライマーが、１つ以上のユニバーサル配列、１つ以上のインデックス配列、１つ以上のＵＭＩ、又はそれらの組み合わせを更に含む、項目６３に記載のキット。
（項目６７）
トランスポソーム複合体であって、
トランスポザーゼと、
第１の鎖上に、５’から３’に、少なくとも１つのユニバーサル配列、少なくとも１つのインデックス配列、少なくとも１つのＵＭＩ、又はそれらの組み合わせ、ＤＮＡ損傷、及びトランスポザーゼ認識配列を含み、第２の鎖上に、前記トランスポザーゼ認識配列の少なくとも一部に相補的なヌクレオチドを含むアダプターを含む核酸を含むトランスポゾ
ンと、を含む、トランスポソーム複合体。
（項目６８）
前記第１の鎖が、前記第１の鎖の５’末端に捕捉剤を更に含む、項目６７に記載のトランスポソーム複合体。
（項目６９）
前記第１の鎖が、捕捉剤と前記５’末端との間に位置する切断可能なリンカーを更に含む、項目６８に記載のトランスポソーム複合体。
（項目７０）
前記第２の鎖が、前記第２の鎖の３’末端に捕捉剤を更に含む、項目６７に記載のトランスポソーム複合体。
（項目７１）
前記第２の鎖が、捕捉剤と前記３’末端との間に位置する切断可能なリンカーを更に含む、項目７０に記載のトランスポソーム複合体。 All headings are for the convenience of the reader and should not be used to limit the meaning of the text that follows the heading, unless specifically stated.
The present invention provides, for example, the following items.
(Item 1)
1. A method for generating a sequencing library, comprising:
providing a plurality of symmetrically modified target nucleic acids comprising a first adaptor sequence at each end, wherein the first adaptor sequence comprises a DNA lesion;
extending the modified target nucleic acid with a damage-intolerant polymerase to generate a plurality of asymmetrically modified target nucleic acids comprising the first adapter sequence at the 5' end of each strand and the complement of a portion of the first adapter at the 3' end of each strand.
(Item 2)
2. The method of claim 1, wherein the plurality of symmetrically modified target nucleic acids are double-stranded, each strand comprising the first adapter sequence comprising the DNA lesion 5' to 3', the target nucleic acid, a gap comprising at least one nucleotide, and the complement of a portion of the first adapter sequence that does not comprise the DNA lesion.
(Item 3)
2. The method of claim 1, wherein the extension begins at the gap.
(Item 4)
annealing a primer to the plurality of asymmetrically modified target nucleic acids, the primer comprising a second adapter sequence and an annealing domain from 5' to 3', the annealing domain comprising a nucleotide sequence that anneals to the complement of the portion of the first adapter of the plurality of asymmetrically modified target nucleic acids;
extending the 3' end of the annealed asymmetrically modified target nucleic acid with a damage-intolerant polymerase;
3. The method of claim 2, wherein the extension results in a plurality of asymmetrically modified target nucleic acids comprising, from 5' to 3', (i) the first adaptor, (ii) the target nucleic acid, (iii) the complement of the portion of the first adaptor, and (iv) the complement of the second adaptor.
(Item 5)
5. The method of claim 4, wherein the extension of the 3' end of the annealed asymmetrically modified target nucleic acid is repeated at least three times.
(Item 6)
2. The method of claim 1, wherein the DNA damage comprises at least one of an abasic site, a modified base, a mismatch, a single-strand break, or a crosslinked nucleotide.
(Item 7)
2. The method of claim 1, wherein the DNA damage comprises at least one uracil.
(Item 8)
5. The method of claim 4, wherein the annealing domain of the primer comprises at least one modified nucleotide that increases the melting temperature compared to the corresponding natural DNA nucleotide.
(Item 9)
9. The method of claim 8, wherein the modified nucleotide comprises a locked nucleic acid, PNA, or RNA.
(Item 10)
5. The method of claim 4, wherein the 3' end of the primer is blocked.
(Item 11)
2. The method of claim 1, wherein the first adapter comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.
(Item 12)
At least one of the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers is located distal to the DNA lesion and the target nucleic acid.
12. The method of claim 11, wherein the adaptor is located between the ends of the adaptor.
(Item 13)
5. The method of claim 4, wherein the second adapter comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.
(Item 14)
14. The method of claim 13, wherein the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the first adaptor are unique compared to the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the second adaptor.
(Item 15)
12. The method of claim 11, wherein the one or more index sequences of the first adaptor are compartment-specific.
(Item 16)
14. The method of claim 13, wherein the one or more index sequences of the second adaptor are compartment-specific.
(Item 17)
2. The method of claim 1, wherein the first adapter comprises a transposase recognition site.
(Item 18)
2. The method of claim 1, wherein the target nucleic acid is derived from nucleic acid derived from a single cell.
(Item 19)
2. The method of claim 1, wherein the target nucleic acid is derived from nucleic acids derived from a plurality of cells.
(Item 20)
20. The method of claim 18 or 19, wherein the target nucleic acid derived from a single cell or multiple cells comprises RNA.
(Item 21)
21. The method of claim 20, wherein the RNA comprises mRNA.
(Item 22)
20. The method of claim 18 or 19, wherein the target nucleic acid derived from a single cell or multiple cells comprises DNA.
(Item 23)
23. The method of claim 22, wherein the DNA comprises whole cell genomic DNA.
(Item 24)
24. The method of claim 23, wherein the whole cell genomic DNA comprises nucleosomes.
(Item 25)
2. The method of claim 1, wherein the target nucleic acid is derived from a nucleic acid derived from cell-free DNA.
(Item 26)
17. The method of any one of items 11 to 16, wherein the method comprises combinatorial indexing.
(Item 27)
further comprising amplifying the asymmetrically modified target nucleic acid;
the amplification comprises a second primer and a damage-tolerant polymerase;
2. The method of claim 1, wherein the second primer comprises a nucleotide sequence that anneals to the first adapter sequence or its complement.
(Item 28)
28. The method of claim 27, wherein the second primer further comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.
(Item 29)
The one or more universal sequences, one or more index sequences of the second primer
29. The method of claim 28, wherein the sequence, and the one or more universal molecular identifiers are unique compared to the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the first adaptor and the second adaptor.
(Item 30)
2. The method of claim 1, wherein a subset of the plurality of asymmetrically modified target nucleic acids is present within a plurality of compartments, and wherein (i) the first adapter comprises a first compartment-specific index, (ii) the second adapter comprises a second compartment-specific index, or both (i) and (ii).
(Item 31)
31. The method of claim 30, further comprising combining the asymmetrically modified target nucleic acids from different compartments to generate pooled indexed asymmetrically modified target nucleic acids.
(Item 32)
further comprising distributing a subset of the pooled indexed asymmetrically modified target nucleic acids into a second plurality of compartments and modifying the indexed asymmetrically modified target nucleic acids;
the modification comprises adding an additional compartment-specific index sequence to the indexed asymmetrically modified target nucleic acids present in each subset to provide indexed DNA nucleic acids;
32. The method of claim 31, wherein the modification comprises ligation or extension.
(Item 33)
33. The method according to any one of items 30 to 32, wherein the compartment comprises a well or a droplet.
(Item 34)
2. The method of claim 1, wherein the providing comprises contacting a plurality of DNA fragments with the first adaptors under conditions that ligate the first adaptors to both ends of the DNA fragments.
(Item 35)
35. The method of claim 34, wherein the DNA fragment is double-stranded and blunt-ended.
(Item 36)
36. The method of claim 34, wherein the first adaptor is a double-stranded DNA oligonucleotide.
(Item 37)
36. The method according to item 34 or 35, wherein one 3' end of the first adaptor is blocked.
(Item 38)
35. The method of claim 34, wherein the DNA fragment is double-stranded and contains a single-stranded region at one or both 3' ends.
(Item 39)
39. The method of claim 34 or 38, wherein the first adaptor is a double-stranded DNA oligonucleotide comprising a single-stranded region at one end, wherein the single-stranded region is capable of annealing to the single-stranded region present on the DNA fragment.
(Item 40)
39. The method of claim 34, 35, or 38, wherein the adapter is a forked adapter.
(Item 41)
2. The method of claim 1, wherein the providing comprises contacting DNA with a transposome complex, the transposome complex comprising a transposase and the first adapter, and the contacting occurs under conditions suitable for ligation of the first adapter to the DNA to generate the symmetrically modified target nucleic acid.
(Item 42)
Item 4. The symmetrically modified target nucleic acid produced comprises a gap of at least one nucleotide in one strand between the ligated first adapter and the target nucleic acid.
1. The method according to claim 1.
(Item 43)
43. The method of claim 41 or 42, wherein the DNA is present in multiple compartments, and the first adapter in each compartment comprises a compartment-specific index.
(Item 44)
43. The method of claim 42, further comprising combining single-stranded modified target nucleic acids from different compartments to generate pooled symmetrically modified target nucleic acids, and distributing the symmetrically modified target nucleic acids into a second plurality of compartments.
(Item 45)
24. The method of claim 23, wherein the method further comprises fragmenting the whole cell genomic DNA.
(Item 46)
46. The method of claim 45, wherein the fragmenting comprises digestion of the total cellular genomic DNA with a restriction endonuclease.
(Item 47)
47. The method of claim 45 or 46, wherein the fragmented DNA is subjected to proximity ligation to attach chimeric target nucleic acids.
(Item 48)
5. The method of claim 1 or 4, wherein cytosine residues of the adapter are replaced with 5-methylcytosine.
(Item 49)
49. The method of claim 48, wherein the symmetric or asymmetric target nucleic acid is subjected to a chemical or enzymatic methylation conversion.
(Item 50)
2. The method of claim 1, wherein said providing comprises fixing isolated nuclei, subjecting said isolated nuclei to conditions that dissociate nucleosomes from genomic DNA, fragmenting said genomic DNA, subjecting said fragments to proximity ligation that joins a chimeric target nucleic acid, and contacting said ligated fragments with a transposome complex, wherein said transposome complex comprises a transposase and said first adapter, and said contacting occurs under conditions suitable for ligation of said first adapter to said DNA to produce said symmetrically modified target nucleic acid.
(Item 51)
51. The method of claim 50, wherein the fragmentation comprises digestion with a restriction endonuclease.
(Item 52)
providing a surface comprising a plurality of amplification sites,
providing the amplification site comprising at least two populations of linked single-stranded capture oligonucleotides having free 3'ends;
5. The method of claim 4, further comprising contacting a surface comprising the amplification sites with the plurality of asymmetrically modified target nucleic acids under conditions suitable to generate a plurality of amplification sites each comprising a clonal population of amplicons from an individual asymmetrically modified target nucleic acid.
(Item 53)
A composition comprising a transposome complex and a DNA polymerase,
the transposome comprises a transposase bound to a transposon sequence;
the transposon sequence comprises an adapter and a DNA lesion;
The composition, wherein the DNA polymerase is a damage-intolerant polymerase.
(Item 54)
54. The composition of claim 53, wherein the adaptor comprises one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof.
(Item 55)
55. The composition of claim 53 or 54, further comprising a damage-tolerant DNA polymerase.
(Item 56)
1. A composition comprising:
a plurality of modified target nucleic acids comprising a first adaptor comprising a 5' to 3' DNA lesion, a target nucleic acid, and a complement of the first adaptor;
a primer comprising a second adaptor from 5' to 3' and an annealing domain, wherein the annealing domain comprises a nucleotide sequence that anneals to the complement of the first adaptor;
A composition comprising: a damage-intolerant DNA polymerase.
(Item 57)
57. The composition of claim 56, wherein the primer comprises at least one modified nucleotide that increases the melting temperature compared to the corresponding natural DNA nucleotide.
(Item 58)
58. The composition of claim 56 or 57, wherein the primer is annealed to a target nucleic acid.
(Item 59)
57. The composition of claim 56, wherein the 3' end of the primer is blocked.
(Item 60)
57. The composition of claim 56, wherein the first adapter comprises a transposase recognition site.
(Item 61)
a transposome complex and a DNA polymerase in separate containers,
the transposome comprises a transposase bound to a transposon sequence;
the transposon sequence comprises a first adapter and a DNA lesion;
a transposome complex and a DNA polymerase, wherein the DNA polymerase is a damage-intolerant polymerase;
A kit including instructions for use.
(Item 62)
further comprising a second DNA polymerase in a separate container;
62. The kit of claim 61, wherein the second DNA polymerase is a damage-tolerant polymerase.
(Item 63)
further comprising a primer,
63. The kit of claim 61 or 62, wherein the primer comprises a second adapter and an annealing domain from 5' to 3', the annealing domain comprising a nucleotide sequence that anneals to the complement of the first adapter.
(Item 64)
64. The kit according to claim 63, wherein the 3' end of the primer is blocked.
(Item 65)
62. The kit of claim 61, wherein the first adapter comprises one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof.
(Item 66)
64. The kit of claim 63, wherein the second adapter primer further comprises one or more universal sequences, one or more index sequences, one or more UMIs, or a combination thereof.
(Item 67)
A transposome complex comprising:
Transposase and
a transposon comprising a nucleic acid comprising, on a first strand, from 5' to 3', at least one universal sequence, at least one index sequence, at least one UMI, or a combination thereof, DNA damage, and a transposase recognition sequence; and, on a second strand, an adapter comprising a nucleotide complementary to at least a portion of the transposase recognition sequence.
and a transposome complex comprising:
(Item 68)
68. The transposome complex of claim 67, wherein the first strand further comprises a capture agent at the 5' end of the first strand.
(Item 69)
69. The transposome complex of claim 68, wherein the first strand further comprises a cleavable linker positioned between the capture agent and the 5′ end.
(Item 70)
68. The transposome complex of claim 67, wherein the second strand further comprises a capture agent at the 3' end of the second strand.
(Item 71)
71. The transposome complex of claim 70, wherein the second strand further comprises a cleavable linker positioned between the capture agent and the 3′ end.

Claims

1. A method for generating a sequencing library , the method comprising:
providing a plurality of double-stranded symmetrically modified target nucleic acids comprising a first adapter sequence at each end;
wherein the first adapter sequence comprises a DNA lesion;
wherein each strand of the symmetrically modified target nucleic acid comprises, from 5' to 3', the first adapter sequence comprising the DNA lesion, the target nucleic acid, a gap comprising at least one nucleotide, and the complement of a portion of the first adapter sequence that does not comprise the DNA lesion .
To provide and
extending the modified target nucleic acid with a damage-intolerant polymerase to generate a plurality of asymmetrically modified target nucleic acids comprising the first adapter sequence at the 5'-end of each strand and the complement of a portion of the first adapter at the 3'-end of each strand;
annealing a primer to the plurality of asymmetrically modified target nucleic acids, the primer comprising a second adapter sequence and an annealing domain from 5' to 3', the annealing domain comprising a nucleotide sequence that anneals to the complement of the portion of the first adapter of the plurality of asymmetrically modified target nucleic acids;
wherein the annealing domain comprises at least one modified nucleotide that increases the melting temperature compared to the corresponding natural DNA nucleotide.
Annealing, and
extending the 3' end of the annealed asymmetrically modified target nucleic acid with a damage-intolerant polymerase using the primer as a template;
wherein the extending results in a plurality of asymmetrically modified target nucleic acids comprising, from 5' to 3': (i) the first adaptor, (ii) the target nucleic acid, (iii) the complement of the portion of the first adaptor, and (iv) the complement of the second adaptor .

The method of claim 1, wherein the extension begins at the gap.

The method of claim 1 , wherein the extension of the 3′ end of the annealed asymmetrically modified target nucleic acid is repeated at least three times.

The method of claim 1, wherein the DNA damage comprises at least one of an abasic site, a modified base, a mismatch, a single-strand break, or a crosslinked nucleotide.

The method of claim 1, wherein the DNA damage includes at least one uracil.

10. The method of claim 1 , wherein the modified nucleotide comprises a locked nucleic acid, a PNA, or an RNA.

The method of claim 1 , wherein the 3′ end of the primer is blocked.

The method of claim 1, wherein the first adapter comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.

9. The method of claim 8, wherein at least one of the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers is located in the adaptor between the DNA lesion and the end of the adaptor distal to the target nucleic acid .

10. The method of claim 1 , wherein the second adaptor comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.

11. The method of claim 10, wherein the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the first adaptor are unique compared to the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the second adaptor.

9. The method of claim 8 , wherein the one or more index sequences of the first adaptor are compartment-specific.

The method of claim 10 , wherein the one or more index sequences of the second adaptor are compartment-specific.

The method of claim 1, wherein the first adapter comprises a transposase recognition site.

10. The method of claim 1, wherein the target nucleic acid is derived from nucleic acid derived from a single cell, and the nucleic acid comprises RNA or DNA .

10. The method of claim 1, wherein the target nucleic acid is derived from nucleic acids derived from a plurality of cells , and the nucleic acids comprise RNA or DNA .

17. The method of claim 15 or 16 , wherein the RNA comprises mRNA.

17. The method of claim 15 or 16 , wherein the DNA comprises whole cell genomic DNA.

19. The method of claim 18 , wherein the whole cell genomic DNA comprises nucleosomes.

The method of claim 1, wherein the target nucleic acid is derived from nucleic acid derived from cell-free DNA.

The method of any one of claims 8 to 13 , wherein the method comprises combinatorial indexing.

further comprising amplifying the asymmetrically modified target nucleic acid;
the amplification comprises a second primer and a damage-tolerant polymerase;
The method of claim 1 , wherein the second primer comprises a nucleotide sequence that anneals to the first adapter sequence or its complement.

23. The method of claim 22 , wherein the second primer further comprises one or more universal sequences, one or more index sequences, one or more universal molecular identifiers, or a combination thereof.

23. The method of claim 22, wherein the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the second primer are unique compared to the one or more universal sequences, one or more index sequences, and one or more universal molecular identifiers of the first adaptor and the second adaptor.

The method of claim 1, wherein a subset of the plurality of asymmetrically modified target nucleic acids is present within a plurality of compartments, and (i) the first adapter comprises a first compartment-specific index, (ii) the second adapter comprises a second compartment-specific index, or both (i) and (ii).

26. The method of claim 25 , further comprising combining the asymmetrically modified target nucleic acids from different compartments to generate pooled indexed asymmetrically modified target nucleic acids.

further comprising distributing a subset of the pooled indexed asymmetrically modified target nucleic acids into a second plurality of compartments and modifying the indexed asymmetrically modified target nucleic acids;
the modification comprises adding an additional compartment-specific index sequence to the indexed asymmetrically modified target nucleic acids present in each subset to provide indexed DNA nucleic acids;
27. The method of claim 26 , wherein the modification comprises ligation or extension.

28. The method of claim 25 , wherein subsets of the plurality of asymmetrically modified target nucleic acids are present in a plurality of compartments, the compartments comprising wells or droplets.

The method of claim 1, wherein the providing comprises contacting a plurality of DNA fragments with the first adaptors under conditions that ligate the first adaptors to both ends of the DNA fragments.

30. The method of claim 29 , wherein the DNA fragment is double-stranded and blunt-ended.

31. The method of claim 29 or 30 , wherein the first adaptor is a double-stranded DNA oligonucleotide.

31. The method of claim 29 or 30 , wherein one 3' end of the first adaptor is blocked.

30. The method of claim 29 , wherein the DNA fragment is double-stranded and contains a single-stranded region at one or both 3' ends.

29. The method of claim 33, wherein the first adaptor is a double-stranded DNA oligonucleotide comprising a single-stranded region at one end, the single-stranded region being capable of annealing to the single -stranded region present on the DNA fragment .

29. The method of claim 28 , wherein the adapter is a forked adapter.

The method of claim 1, wherein the providing comprises contacting DNA with a transposome complex, the transposome complex comprising a transposase and the first adapter, and the contacting occurs under conditions suitable for ligation of the first adapter to the DNA to produce the symmetrically modified target nucleic acid.

37. The method of Claim 36 , wherein the symmetrically modified target nucleic acid produced comprises a gap of at least one nucleotide in one strand between the ligated first adaptor and the target nucleic acid.

38. The method of claim 36 or 37 , wherein the DNA is present in multiple compartments, and the first adapter in each compartment comprises a compartment-specific index.

38. The method of claim 37, further comprising combining single-stranded modified target nucleic acids from different compartments to generate pooled symmetrically modified target nucleic acids, and distributing the symmetrically modified target nucleic acids into a second plurality of compartments.

20. The method of claim 18 , wherein the method further comprises fragmenting the whole cell genomic DNA.

41. The method of claim 40 , wherein said fragmenting comprises digestion of said total cellular genomic DNA with a restriction endonuclease.

4. The method of claim 40 or 41 , wherein the fragmented DNA is subjected to proximity ligation to attach a chimeric target nucleic acid.

2. The method of claim 1 , wherein the cytosine residues of the adapter are replaced with 5-methylcytosine.

The method of claim 43 , wherein the symmetric or asymmetric target nucleic acid is subjected to chemical or enzymatic methylation conversion.

The method of claim 1, wherein the providing comprises fixing isolated nuclei, subjecting the isolated nuclei to conditions that dissociate nucleosomes from genomic DNA, fragmenting the genomic DNA, subjecting the fragments to proximity ligation that joins a chimeric target nucleic acid, and contacting the ligated fragments with a transposome complex, the transposome complex comprising a transposase and the first adapter, and the contacting occurs under conditions suitable for ligation of the first adapter to the DNA to produce the symmetrically modified target nucleic acid.

46. The method of claim 45 , wherein said fragmenting comprises digestion with a restriction endonuclease.

providing a surface comprising a plurality of amplification sites,
providing the amplification site comprising at least two populations of linked single-stranded capture oligonucleotides having free 3'ends;
2. The method of claim 1, further comprising contacting a surface comprising the amplification sites with the plurality of asymmetrically modified target nucleic acids under conditions suitable to generate a plurality of amplification sites each comprising a clonal population of amplicons from an individual asymmetrically modified target nucleic acid.

1. A composition comprising:
a plurality of modified target nucleic acids comprising, from 5' to 3' , a first adaptor comprising a DNA lesion, a target nucleic acid, a gap comprising at least one nucleotide, and a complement of a portion of the first adaptor sequence that does not comprise the DNA lesion ;
a primer comprising, from 5' to 3' , a second adapter and an annealing domain, wherein the annealing domain comprises a nucleotide sequence that anneals to the complement of the portion of the first adapter, and wherein the annealing domain comprises at least one modified nucleotide that increases the melting temperature compared to a corresponding natural DNA nucleotide ;
A composition comprising: a damage-intolerant DNA polymerase ;

49. The composition of claim 48 , wherein the primer is annealed to a target nucleic acid.

49. The composition of claim 48 , wherein the 3' end of the primer is blocked.

49. The composition of claim 48 , wherein the first adapter comprises a transposase recognition site.

The composition of claim 7 , wherein the 3′ end of the primer comprises a dideoxynucleotide.