Home Journals Progress in Chemistry
Progress in Chemistry

Abbreviation (ISO4): Prog Chem      Editor in chief: Jincai ZHAO

About  /  Aim & scope  /  Editorial board  /  Indexed  /  Contact  / 
Review

Intracellular Single Strand DNA and High-Throughput Analysis Techniques

  • Ruiqi Li 1, 2 ,
  • Weiyi Lai , 1, * ,
  • Hailin Wang 1, 2
Expand
  • 1 Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
  • 2 Hangzhou Institute for Advance Study, University of Chinese Academy of Sciences, Hangzhou 310024, China

Received date: 2024-03-04

  Revised date: 2024-03-30

  Online published: 2024-04-16

Supported by

National Natural Science Foundation of China(22234008)

National Natural Science Foundation of China(21927807)

National Natural Science Foundation of China(22021003)

National Natural Science Foundation of China(22274166)

Abstract

During many life processes such as replication,transcription,double-strand breaks repair and so on,double-stranded DNA will temporarily unwind and form single strand DNA(ssDNA).ssDNA may affect genomic stability and may also participate in the formation of non-B DNA structure,which in turn regulates and influences many key cellular processes.This review briefly describes the causes of the formation of single-stranded DNA,the structures containing single-stranded DNA and their possible functions in cells,and summarizes some high-throughput analysis techniques of single-stranded DNA,which provides the method inspiration for the subsequent ssDNA research and promotes the further development of ssDNA analysis techniques and methods。

Contents

1 Overview of ssDNA

2 Formation and function of ssDNA

3 ssDNA sequencing methods

3.1 ssDNA-seq

3.2 KAS-seq

3.3 DRIP-seq

3.4 R-ChIP

3.5 SMRF-seq

3.6 MapR

3.7 G4 ChIP-seq

3.8 G4 CUT&Tag

4 Conclusion and outlook

Cite this article

Ruiqi Li , Weiyi Lai , Hailin Wang . Intracellular Single Strand DNA and High-Throughput Analysis Techniques[J]. Progress in Chemistry, 2024 , 36(9) : 1283 -1290 . DOI: 10.7536/PC240304

1 Overview of single-stranded DNA

DNA in eukaryotes usually exists in a double helix structure,and two deoxynucleotide chains are connected by hydrogen bonds according to the principle of complementary base pairing,forming a stable B-type double helix structure(two complementary double helix chains are antiparallel,with the same central longitudinal axis,spiraling from the right to the top).However,during replication,transcription,and DNA repair,double-stranded DNA may be temporarily unwound into two Single-stranded DNA(ssDNA).single-stranded DNA is related to some non-B-type structures,such as R-loop,G-quadruplex,H-DNA,Slipped-strand,hairpin structure,cruciform,etc[1][2][3][4][5][6]。 These single-stranded DNAs and non-B-type structures containing single-stranded DNAs in cells regulate many important cellular processes.Therefore,high-throughput sequencing of intracellular ssDNA is an important means to understand the location of ssDNA in the genome and to carry out functional studies。

2 Formation and function of ssDNA

ssDNA can arise from normal biological processes.In the process of DNA replication,transcription and repair,dsDNA is partially melted.If the temporarily formed ssDNA has a special sequence,it may form a non-B-type structure and produce a relatively stable ssDNA structure.For example,the R-loop is formed by the combination of the new RNA and the template DNA strand during transcription,thus replacing the non-template DNA strand.the local three-stranded structure is composed of the replaced single-stranded DNA and an RNA-DNA hybrid(Fig.1A).the RNA-DNA hybrid structure is often more stable than the double-stranded DNA,so the R-loop can exist stably[7,8]。 G-quadruplex is usually folded by DNA sequences rich in tandem repeats.G-quartet is the basic structural unit of G-quadruplex,which is a square planar structure composed of four guanines,and each guanine is connected with two adjacent guanines through Hoogsteen hydrogen bond interaction(Fig.1B).Hoogsteen hydrogen bond is a special type of hydrogen bond,which is different from the common Watson-Crick hydrogen bond in the relative position,orientation and special pairing between bases.It connects the bases in a non-classical way and helps to maintain the stability of DNA with specific structure[9]。 A stack of multiple G-tetrad planes forms a G-quadruplex,and the sequences between them are squeezed into a single-stranded loop[10]; At that same time,when the G-quadruplex is compose of one strand of dsDNA,the complementary strand cannot pair with it to form a single strand(Fig.1B).H-DNA is an intramolecular triplex,with the third strand folded inward by the rotation of the mirror repeat in one strand of the double-stranded DNA molecule,resulting in a triplex structure,and the other strand unpaired,single-stranded DNA(fig.1C)[11]。 in the triplex structure of H-DNA,purines In the double strand can form Hoogsteen hydrogen bonds with the third strand bases through rotation around the glycosidic bond and base flipping,thus maintaining the stability of the structure[12]。 A slip-strand structure is formed when the direct repeat base pairs with the complementary strand in a misaligned manner(Fig.1D),and after DNA unwinding,a hairpin or exocyclic base may be generated[13]。 the inverted repeats can form hairpin structures(Fig.1E)and cruciform structures(Fig.1F).In this structure,sequences equidistant from the center of symmetry are complementary to each other,thus forming an intrachain hairpin,with the nonrepetitive sequence at the center of the structure protruding to form a single-stranded loop.the hairpin structure is formed when the single-stranded DNA inverted repeat sequence folds back on itself to form an intrastrand pair[14]。 Similarly,inverted repeats in double-stranded DNA may trigger two intrastrand hairpins,forming a cruciform structure[15]
图1 细胞内含单链的DNA结构示意图:(A)R环;(B)G-四链体;(C)H-DNA;(D)滑链;(E)发卡结构;(F)十字型结构

Fig. 1 Schematic representation of the structure of DNA containing a single strand. (A) R-loop; (B) G-quadruplex; (C) H-DNA; (D) Slipped-strand ;(E) Hairpin structure; (F) Cruciform

single-stranded DNA is produced in many cellular processes,and the structure containing Single-stranded DNA will in turn affect some important cellular physiological processes,including the transcription of genetic information,DNA replication,homologous recombination repair,meiosis and so on[16~18]
R-loops can play a role in transcriptional activation and termination in mammals.R-loops of different lengths formed under different conditions help recruit specific chromatin remodeling factors to regulate transcription.Many ncRNAs may also assist transcriptional control by changing chromatin status through R-loops[19]。 When R-loops accumulate during transcription,SOSS-INTAC exerts RNA endonuclease action to induce termination of transcription near the promoter,possibly through SSB1-mediated single-stranded DNA recognition,thereby preventing genomic instability induced by R-loop accumulation[20]。 In addition,G-quadruplex structures may be formed during transcription,which may either promote or inhibit transcription at or near the promoter region.If the G-quadruplex motif is located on the template strand,it blocks the transcription mechanism and thus inhibits transcription[21]。 If the G-quadruplex motif is located on the non-template strand,it may prevent annealing of the transcription template to the complementary strand to maintain the single-stranded conformation of the transcribed strand to facilitate transcription[10]。 Negative supercoiling generated by the transcription process may induce the formation of H-DNA,and in some cases,nascent purine-rich RNA binding to single-stranded DNA in the H-DNA structure can further stabilize the formation of H-DNA,which may inhibit or block the transcription process when it is formed[11]。 Hairpin or cruciform structures affect the helical state of DNA and may promote or prevent DNA-protein interactions,thereby affecting the transcription process.cruciform overhang reduces local supercoiling of DNA,and local changes in supercoiling density can affect promoter activity,so cruciform overhang of promoter regions may reduce its activity[22]
For DNA replication,in the initial stage of replication,RNA will act as a primer to participate in complementary pairing with template DNA as an origin of replication to form DNA:RNA hybrids,but the length of DNA:RNA hybrids produced by DNA replication is shorter,which is different from the more stable R-loop[23]。 G-quadruplexes are easily produced in the lagging strand of DNA replication,and when replication is slowed down,G-quadruplexes are more easily formed,and replication will not continue after the formation of G-quadruplex,and it is likely that helicases are needed to unwind the G-quadruplex structure[10]。 in addition,simultaneous DNA replication and gene transcription in eukaryotic and prokaryotic cells may occur"conflict",that is,when DNA replication and gene transcription occur in opposite directions or in the same direction,R-loops formed during transcription may affect DNA replication[24]。 Because R-loops block DNA replication forks,they may cause DNA damage,such as DNA replication fork breaks,DNA double-strand breaks and single-strand DNA gaps(ssDNA gaps)[25,26]。 When these DNA damages occur,repair mechanisms such as homologous recombination are generally activated to repair DNA damage in time to avoid adverse effects on genome stability such as gene mutation[17,27]。 R-loops can trigger homologous recombination,and the occurrence of this type of homologous recombination associated with gene transcription(Transcription associated recombination,TAR)may prevent cells from Transcription-induced genomic instability[24,28]。 in studies of S.cerevisiae and C.elegans,it was found that R-loops can form in meiosis and negatively affect meiotic replication and genome stability,while THO and THSC/TREX-2,which can prevent the formation of R-loops in meiosis,play an important role in the reproductive inheritance of organisms[29]
In addition to affecting cellular physiological processes,single-stranded DNA is vulnerable to nucleases,chemical agents,and inappropriate protein binding,affecting genome stability[18]。 single-stranded DNA may also be a target for the action of specific mutagens or enzymes,such as AID/APOBEC cytosine deaminase,AID(activation-induced deaminase)expressed predominantly in B cells,Its main function is to convert cytosine(C)in Single-stranded DNA to uracil(U)by deamination,and to mediate somatic hypermutation and immunoglobulin class recombination switch with APOBEC(polypeptide-like apolipoprotein B that catalyzes mRNA editing)[30,31]。 ssDNA is more susceptible to mutagenic DNA damage and mutation than dsDNA,and therefore,single-stranded DNA regions need to be adequately protected to avoid loss of genetic information[32]。 When ssDNA is generated,it is usually rapidly wrapped by single-stranded DNA-binding proteins such as Replication protein A(RPA)complex.This not only protects ssDNA from nuclease degradation,but also coordinates the activation of DNA damage checkpoint response and DNA repair,while RPA also prevents the formation of DNA secondary structure[33][34][35]。 However,because RPA binding to ssDNA is dynamic,RPA is unable to fully protect the bases of ssDNA from chemical damage[36]。 Nucleases such as Yen1/GEN1 and SLX1/SLX4 can recognize and cleave cruciform structures,and are also recruited to cut fragile sites containing repetitive sequences,resulting in double-strand breaks and genomic instability[37]。 the slip-strand structure may produce mismatch repair in The process of DNA damage repair,leading to genome instability[38]
Single-stranded DNA is also associated with many human diseases and cancers.the H-DNA formed in the human c-MYC gene promoter coincides with a breakpoint associated with c-MYC-induced leukemia,and the H-DNA in the human PKD1 gene may increase the mutation rate of the gene and lead to the genetic disease Autosomal dominant polycystic kidney disease(ADPKD)[39,40][41,42]。 Transcription of repeats such as CTG and GAA may lead to the formation of R-loops and promote the instability of the repeats themselves,resulting in their expansion or contraction[43][44,45]。 nervous system diseases such as Friedreich ataxia(FRDA)and fragile X syndrome are closely related to the genetic instability of trinucleotide repeat expansion,so R-loop may be a potential factor inducing these Nervous system diseases[43][46][47]。 the clearance of intracellular R-loops depends on RNase H,which specifically recognizes the DNA-RNA hybrid in the R-loop and excises the RNA strand in it.When the activity of RNase H1 is reduced or RNase H1 and RNase H2 are knocked out,the instability of CAG and other repetitive sequences will increase.in addition to RNase H,R-loops can also be unwound by DNA-RNA helicases such as AQR and SETX,which are mutated in two neurodegenerative diseases and are closely related to the accumulation of R-loops[44][48,49]
genomic instability and replication stress are symptomatic hallmarks of tumor cells,and R-loop accumulation can lead to Genomic instability and replication stress,so R-loops are potential drivers of cancer[50]。 R-loops are linked to cancer through the DNA Double strand break(DSB)repair genes BRCA1 and BRCA2.When cells knock out BRCA1 or BRCA2,R-loop accumulation and DSB increase,which is partially reduced by overexpression of RNase H1[51,52]。 Upon R-loop dysregulation,a portion of the nuclear R-loop that cannot be resolved is processed by XPG,resulting in cytoplasmic accumulation of DNA-RNA hybrids that are subsequently recognized by cytoplasmic cGAS and TLR3,activating IRF3-mediated immune signaling and apoptosis[53]。 Burkitt lymphoma is caused by the translocation between the MYC proto-oncogene and the immunoglobulin S region.in mice,AID is important for the generation of DNA double-strand breaks In the GC-rich S region and the transcribed Myc region,and this process may form R-loops,so R-loops may be the cause of Burkitt lymphoma[19]
G-quadruplexes have been implicated in human leukemia.In some leukemia patients,the expression of Aven protein is increased.MLL1 and MLL4 are two important genes related to leukemia.In the coding region of mRNA,Aven protein can bind to the G-quadruplex structure of RGG/RG region,promote the expression of MLL1 and MLL4,and induce the transcription of leukemia genes.Loss of Aven protein results in decreased synthesis of MLL1 and MLL4 proteins,leading to decreased proliferation of leukemic cells[54]。 G-quadruplex is also closely related to neurodegenerative diseases such as amyotrophic lateral sclerosis and frontotemporal dementia.the formation of G-quadruplex structure will block the movement of RNA polymerase II during transcription and stop transcription.This not only leads to the loss of normal protein products,but also to incomplete transcript fragments,which can fold into G-quadruplex structures by themselves,which may hinder RNA-binding proteins and lead to nucleolar stress response to damage cells.It may also escape from the nucleus and bind to ribosomal complexes to be translated into dipeptide repeat proteins to destroy cells[55]

3 ssDNA structure sequencing method

Among the existing single-stranded DNA sequencing technologies,only some methods can be used to sequence ssDNA structures,such as ssDNA-seq,KAS-seq,etc.,and most of the methods are used to sequence and locate specific non-B-type structures(mainly R-loops and G-quadruplexes)containing single-stranded DNA,such as DRIP-seq,R-ChIP,MapR,SMRF-seq[56]

3.1 ssDNA-seq

ssDNA-seq uses potassium permanganate Mung Bean Nuclease to treat living cells.The KMnO4prefers to oxidize the thymine C5-C6 double bond of ssDNA,and the oxidized products can not re-pair with the complementary strand,thus maintaining the stability of the intracellular single-stranded structure.Therefore,ssDNA-seq is easily digested by Mung Bean Nuclease(Mung Bean Nuclease),a single-stranded specific Nuclease,resulting in DNA fragmentation.Subsequently,the broken DNA ends were labeled with Biotin using terminal deoxynucleotidyl transferase(TdT).Genomic DNA was sonicated,and the labeled fragments were enriched using biotin-streptavidin interaction,followed by high-throughput sequencing(Figure 2 )[57]。 the disadvantage of this method is that the use of potassium permanganate oxidation can cause damage to the genome of living cells and may affect the activity of other enzymes[57]
图2 ssDNA-seq原理示意图

Fig. 2 Schematic diagram of ssDNA-seq

3.2 KAS-seq

KAS-seq is a single-stranded DNA high-throughput sequencing method based on the rapid and specific reaction of azido-ethoxybutanone(N3-kethoxal)a nd guanine in single-stranded DNA(Fig.3).N3-kethoxal can specifically label guanine on single-stranded DNA,which can specifically react with guanine in single-stranded DNA in living cells within 5 min at 37℃,and then Biotinylation is carried out after modification of N3-Kethoxal,and then the labeled single-stranded DNA is enriched by biotin-streptavidin interaction.The N3-kethoxal modification can be removed for a short time at 95°C without affecting PCR amplification,after which high-throughput sequencing is performed by library construction[58]。 KAS-seq can maintain high sensitivity in small cell samples and is suitable for studying very small cell samples due to the characteristics of N3-kethoxal that can react with guanine and the high affinity between biotin and streptavidin[58]
图3 KAS-seq原理示意图

Fig. 3 Schematic diagram of KAS-seq

3.3 DRIP-seq

DRIP-seq(DNA:RNA hybrid immunoprecipitation and sequencing)is a co-immunoprecipitation and high-throughput sequencing analysis technology based on the specific recognition of DNA:RNA hybrid strand by S9.6 monoclonal antibody.DRIP-seq is the most widely used genome-wide R-loop capture technology,which uses S9.6 monoclonal antibody to specifically recognize DNA:RNA hybrids,capture chromatin fragments by co-immunoprecipitation,and then sequence the recovered R-loop fragments containing DNA:RNA hybrids[59,60]。 Methodological improvements to improve resolution,specificity,or sensitivity have also been made:bisDRIP-seq recognizes R-loop-associated single-stranded DNA by combining DRIP and bisulfite footprinting[61]; S1-DRIP-seq utilizes nuclease S1 to remove non-template single-stranded DNA in the R-loop,preventing it from re-annealing into template DNA during co-immunoprecipitation[62]。 RDIP-seq is pretreated with RNase I and then co-immunoprecipitated to enrich R-loops.RNase I can remove single-stranded RNA that does not form secondary structure and prevent it from forming double-stranded RNA or annealing with genomic DNA to form DNA:RNA hybrids that bind to S9.6 antibody[63]。 After co-immunoprecipitation,the RNA was released by DNase I treatment and then reverse transcribed into cDNA for high-throughput sequencing(DRIPc-seq)[64]
the S9.6 antibody recognizes DNA:RNA hybrids in a sequence-independent manner and also binds nonspecifically to double-stranded RNA,resulting in false positive signals,so there are limitations to The S9.6 antibody-based approach[65][66,67]

3.4 R-ChIP

R-ChIP is a method to detect R-loop in vivo based on RNase H with chromatin immunoprecipitation(ChIP).RNase H can recognize DNA:RNA hybrids and degrade the RNA strands therein,while limiting or preventing R-loop accumulation within the cell[68]。 A V5-tagged RNase H1 mutant(still capable of recognizing DNA:RNA hybrids but without RNase activity)expression vector was constructed by R-ChIP and transferred into cells,and cells stably expressing the mutant protein were obtained by drug selection.Cell-expressed RNase H1 mutants bind to DNA:RNA hybrids,and nuclei are subsequently dissected out and fragmented by sonication.Chromatin immunoprecipitation(ChIP)was performed using the V5 tag,and the DNA:RNA hybrids were purified and recovered,and finally the library was established for sequencing[69,70]。 Because RNase H mutants can specifically recognize DNA:RNA hybrids,R-ChIP has a high specificity for the recognition and capture of R-loops.However,it takes a long time to obtain cells stably expressing RNase H mutants,and the expression level of RNase H1 mutants needs to be controlled,which may affect the normal physiological process of cells.If it is too low,the binding of non-mutated normal endogenous RNase H1 to DNA:RNA hybrid may affect the efficiency of R-ChIP,which is not suitable for animal tissues or primary cells,so it is limited in the scope of use[69][66]

3.5 SMRF-seq

single-Molecule R loop Footprinting and sequencing(SMRF-seq)relies on DNA to convert the unpaired cytosine C of Single-stranded DNA in the R-loop structure to uracil U under non-denaturing conditions with bisulfite.After site-specific PCR amplification and library construction,the DNA sequence amplified from the labeled Single strand contained a cytosine to thymine(T)transition site,while the DNA sequences on either side of it were not converted.These sites represent R-loop"footprints".Using SMRT-seq on PCR products allows sequencing analysis of a collection of individual R-loop footprints at high coverage[71]。 SMRF-seq processing is highly efficient and enables the mapping and sequencing of R-loops with single-molecule resolution and high coverage on strands of thousands of bases in length.However,the SMRF-seq method requires DNA extraction,and some short and unstable R-loops may be destroyed during DNA fragmentation in vitro,resulting in low R-loop content[72]

3.6 MapR

MapR is a method based on RNase H recognition,Micrococcal nuclease(MNase)digestion,and CUT&RUN technology to detect R-loops in the whole genome.Cells are first fixed and permeabilized,and the catalytically inactive RNase H(RH RH∆)-MNase fusion protein(RH RH∆-MNase)diffuses into the nucleus in the absence of calcium ions,which are divalent metal ions necessary for the activation of the Nuclease MNase,which is inactive and does not cleave DNA.RH RH∆can bind to chromatin fragments containing R loops.After the addition of calcium ions,MNase cleaves the nucleic acid at both ends of the RH RH∆binding site to release the fragments bound to RH RH∆.After the reaction,EGTA,a calcium chelator,is added to stop the reaction.Finally,the released nucleic acid fragments were recovered,purified,and subjected to high-throughput sequencing[73]。 MapR is an antibody-independent,efficient and convenient R-loop sequencing method for various cell types,and can also effectively detect R-loops in a small number of cell samples.This method cannot distinguish whether the DNA in the DNA:RNA hybrid is a positive strand or a negative strand,and cannot provide information about a single strand[73]

3.7 G4 ChIP-seq

G4 ChIP-seq is a method to detect intracellular G-quadruplexes based on the G-quadruplex-specific antibody BG4 with chromatin immunoprecipitation(ChIP).Nuclei were first isolated and sonicated to fragment chromatin,which was then treated with RNase A to remove RNA G-quadruplexes and DNA-RNA hybrid G-quadruplexes.Subsequently,G-quadruplex specific antibody BG4 was used for co-immunoprecipitation,and the immunoprecipitated DNA was eluted and purified to recover the labeled DNA fragment by reverse cross-linking,followed by PCR amplification and library sequencing[74]。 Various experimental procedures(such as chromatin fragmentation and immunoprecipitation)may lead to the disruption of G-quadruplex integrity[75]

3.8 G4 CUT&Tag

G4 CUT&Tag is a technique for locating and sequencing intracellular G-quadruplexes under mild conditions using CUT&Tag technology.FLAG-tagged BG4 antibody was used to bind to the intracellular G-quadruplex,followed by the addition of anti-FLAG antibody and secondary antibody,and the addition of Protein A-fused Tn5 transposase for incubation to immobilize the enzyme at the antibody-bound site to complete the labeling of the G-quadruplex.After that,Mg2+was added to activate Tn5 transposase to fragment the target DNA and insert the sequencing adapter,followed by high-throughput sequencing.Like ChIP,this method requires antibodies to recognize and bind to the target,while this method uses Tn5 transposase and Protein A to direct the enzyme to the antibody bound to the target chromatin.The method has the advantages of less background signal,high resolution and high repeatability[75,76]

4 Conclusion and prospect

ssDNA exists widely in organisms and plays an important role in maintaining the normal function of cells and the transmission of genetic information.with the deepening of research,its importance has become increasingly prominent.in this paper,the types of intracellular ssDNA,the causes of its formation,its biological functions,its relationship With diseases,and its high-throughput analysis techniques are described.However,due to the limitation of ssDNA high-throughput analysis technology,the distribution and function of ssDNA in the genome still have problems and challenges.Accurate analysis of the distribution of ssDNA across the genome is critical to understanding highly dynamic transcriptional events,DNA replication and repair,and other processes。
in terms of methodology,the following aspects can be improved and attempted in future studies:establishing a new mild ssDNA labeling method;Reduce the damage of ssDNA structure caused by DNA treatment;Improve method resolution.Methodological advances will also further reveal the biological functions of the above ssDNA and help people understand the role of ssDNA in disease。
[1]
Shibata T, Iwasaki W, Hirota K. Comput. Struct. Biotechnol. J., 2020, 18: 3350.

[2]
Mathad R I, Yang D Z. Telomeres and Telomerase: Methods and Protocols, Zhou S Y, (Ed.), 2011. 77.

[3]
Wang G L, Vasquez K M. Proc. Natl. Acad. Sci. U. S. A., 2004, 101(37): 13448.

[4]
Guiblet W M, Cremona M A, Harris R S, Chen D, Eckert K A, Chiaromonte F, Huang Y F, Makova K D. Nucleic Acids Res., 2021, 49(3): 1497.

[5]
Bacolla A, Tainer J A, Vasquez K M, Cooper D N. Nucleic Acids Res., 2016, 44(12): 5673.

[6]
Boyer A S, Grgurevic S, Cazaux C, Hoffmann J S. J. Mol. Biol., 2013, 425(23): 4767.

[7]
Petermann E, Lan L, Zou L. Nat. Rev. Mol. Cell Bio., 2022, 23: 521.

[8]
García-Muse T, Aguilera A. Cell, 2019, 179(3): 604.

[9]
Nikolova E N, Kim E, Wise A A, O’Brien P J, Andricioaei I, Al-Hashimi H M. Nature, 2011, 470(7335): 498.

[10]
Bochman M L, Paeschke K, Zakian V A. Nat. Rev. Genet., 2012, 13(11): 770.

[11]
Jain A, Wang G L, Vasquez K M. Biochimie, 2008, 90(8): 1117.

[12]
Hoogsteen K. Acta Crystallogr., 1959, 12(10): 822.

[13]
Sinden R R, Pytlos-Sinden M J, Potaman V N. Front. Biosci., 2007, 12(12): 4788.

[14]
Smith G R. Genes Dev., 2008, 22(19): 2612.

[15]
Brázda V, Laister R C, Jagelská E B, Arrowsmith C. BMC Mol. Biol., 2011, 12(1): 33.

[16]
Zhao J H, Bacolla A, Wang G L, Vasquez K M. Cell. Mol. Life Sci., 2010, 67(1): 43.

[17]
Gottipati P, Helleday T. Mutagenesis, 2009, 24(3): 203.

[18]
Richard D J, Bolderson E, Khanna K K. Crit. Rev. Biochem. Mol. Biol., 2009, 44(2-3): 98.

[19]
Santos-Pereira J M, Aguilera A. Nat. Rev. Genet., 2015, 16(10): 583.

[20]
Xu C L, Li C Y, Chen J W, Xiong Y, Qiao Z B, Fan P Y, Li C H, Ma S Y, Liu J, Song A X, Tao B L, Xu T, Xu W, Chi Y Y, Xue J Y, Wang P, Ye D, Gu H Z, Zhang P, Wang Q, Xiao R J, Cheng J D, Zheng H, Yu X L, Zhang Z, Wu J, Liang K W, Liu Y J, Lu H S, Chen F X. Nature, 2023, 621(7979): 610.

[21]
Duquette M L, Handa P, Vincent J A, Taylor A F, Maizels N. Genes Dev., 2004, 18(13): 1618.

[22]
Bikard D, Loot C, Baharoglu Z, Mazel D. Microbiol. Mol. Biol. Rev., 2010, 74(4): 570.

[23]
Aguilera A, García-Muse T. Mol. Cell, 2012, 46(2): 115.

[24]
Pan X, Jiang N, Chen X, Zhou X, Ding L, Duan F. Hereditas, 2014, 36(12): 1185.

[25]
Helmrich A, Ballarino M, Tora L. Mol. Cell, 2011, 44(6): 966.

[26]
Paulsen R D, Soni D V, Wollman R, Hahn A T, Yee M C, Guan A N, Hesley J A, Miller S C, Cromwell E F, Solow-Cordero D E, Meyer T, Cimprich K A. Mol. Cell, 2009, 35(2): 228.

[27]
Mortusewicz O, Herr P, Helleday T. EMBO J., 2013, 32(4): 493.

[28]
Bermejo R, Lai M, Foiani M. Mol. Cell, 2012, 45(6): 710.

[29]
Castellano-Pozo M, García-Muse T, Aguilera A. EMBO Rep., 2012, 13(10): 923.

[30]
Chaudhuri J, Alt F W. Nat. Rev. Immunol., 2004, 4(7): 541.

[31]
Pecori R, Di Giorgio S, Paulo Lorenzo J, Nina Papavasiliou F. Nat. Rev. Genet., 2022, 23(8): 505.

[32]
Aguilera A. EMBO J., 2002, 21(3): 195.

[33]
Wold M S. Annu. Rev. Biochem., 1997, 66: 61.

[34]
Zou L E, Elledge S J. Science, 2003, 300(5625): 1542.

[35]
Chen H, Lisby M, Symington L. Mol. Cell, 2013, 50(4): 589.

[36]
Anindya R. DNA Repair, 2020, 87: 102804.

[37]
Minocherhomji S, Hickson I D. Trends Cell Biol., 2014, 24(5): 321.

[38]
Nakanishi K, Shima A, Fukuda M, Fujita S. Mech. Ageing Dev., 1979, 10(3-4): 273.

[39]
Kinniburgh A J. Nucleic Acids Res., 1989, 17(19): 7771.

[40]
Saglio G, Borrello M G, Guerrasio A, Sozzi G, Serra A, Di Celle P F, Foa R, Ferrarini M, Roncella S, Pignatti C B, Marradi P, Izzo P, Soler J, Pierotti M. Genes Chromosom. Cancer, 1993, 8(1): 1.

[41]
Blaszak R T, Potaman V, Sinden R R, Bissler J J. Nucleic Acids Res., 1999, 27(13): 2610.

[42]
Watnick T J, Piontek K B, Cordal T M, Weber H, Gandolph M A, Qian F, Lens X M, Neumann H P H, Germino G G. Hum. Mol. Genet., 1997, 6(9): 1473.

[43]
Grabczyk E, Mancuso M, Sammarco M C. Nucleic Acids Res., 2007, 35(16): 5351.

[44]
Lin Y F, Dent S Y R, Wilson J H, Wells R D, Napierala M. Proc. Natl. Acad. Sci. U. S. A., 2010, 107(2): 692.

[45]
Reddy K, Tam M, Bowater R P, Barber M, Tomlinson M, Nichol Edamura K, Wang Y H, Pearson C E. Nucleic Acids Res., 2011, 39(5): 1749.

[46]
Loomis E W, Sanz L A, Chédin F, Hagerman P J. PLoS Genet., 2014, 10(4): e1004294.

[47]
McIvor E I, Polak U, Napierala M. RNA Biol., 2010, 7(5): 551.

[48]
Hasanova Z, Klapstova V, Porrua O, Stefl R, Sebesta M. Nucleic Acids Res., 2023, 51(6): 2818.

[49]
Gatti V, De Domenico S, Melino G, Peschiaroli A. Cell Death Discov., 2023, 9: 145.

[50]
Negrini S, Gorgoulis V G, Halazonetis T D. Nat. Rev. Mol. Cell Biol., 2010, 11(3): 220.

[51]
Belotserkovskii B P, Mirkin S M, Hanawalt P C. Chem. Rev., 2013, 113(11): 8620.

[52]
Hill S J, Rolland T, Adelmant G, Xia X F, Owen M S, Dricot A, Zack T I, Sahni N, Jacob Y, Hao T, McKinney K M, Clark A P, Reyon D, Tsai S Q, Joung J K, Beroukhim R, Marto J A, Vidal M, Gaudet S, Hill D E, Livingston D M. Genes Dev., 2014, 28(17): 1957.

[53]
Crossley M P, Song C L, Bocek M J, Choi J H, Kousouros J N, Sathirachinda A, Lin C, Brickner J R, Bai G S, Lans H, Vermeulen W, Abu-Remaileh M, Cimprich K A. Nature, 2023, 613(7942): 187.

[54]
Thandapani P, Song J W, Gandin V, Cai Y T, Rouleau S G, Garant J M, Boisvert F M, Yu Z B, Perreault J P, Topisirovic I, Richard S. eLife, 2015, 4: 06234.

[55]
Haeusler A R, Donnelly C J, Periz G, Simko E A J, Shaw P G, Kim M S, Maragakis N J, Troncoso J C, Pandey A, Sattler R, Rothstein J D, Wang J O. Nature, 2014, 507(7491): 195.

[56]
Chambers V S, Marsico G, Boutell J M, Di Antonio M, Smith G P, Balasubramanian S. Nat. Biotechnol., 2015, 33(8): 877.

[57]
Kouzine F, Wojtowicz D, Yamane A, Resch W, Kieffer-Kwon K R, Bandle R, Nelson S, Nakahashi H, Awasthi P, Feigenbaum L, Menoni H, Hoeijmakers J, Vermeulen W, Ge H, Przytycka T, Levens D, Casellas R. Cell, 2013, 153(5): 988.

[58]
Wu T, Lyu R T, You Q C, He C. Nat. Meth., 2020, 17(5): 515.

[59]
Boguslawski S J, Smith D E, Michalak M A, Mickelson K E, Yehle C O, Patterson W L, Carrico R J. J. Immunol. Meth., 1986, 89(1): 123.

[60]
Ginno P, Lott P, Christensen H, Korf I, Chédin F. Mol. Cell, 2012, 45(6): 814.

[61]
Dumelie J G, Jaffrey S R. eLife, 2017, 6: 28306.

[62]
Wahba L, Costantino L, Tan F J, Zimmer A, Koshland D. Genes Dev., 2016, 30(11): 1327.

[63]
Nadel J, Athanasiadou R, Lemetre C, Wijetunga N A, Ó Broin P, Sato H, Zhang Z D, Jeddeloh J, Montagna C, Golden A, Seoighe C, Greally J M. Epigenet. Chromatin, 2015, 8(1): 46.

[64]
Sanz L A, Chédin F. Nat. Protoc., 2019, 14(6): 1734.

[65]
Phillips D D, Garboczi D N, Singh K, Hu Z L, Leppla S H, Leysath C E. J. Mol. Recognit., 2013, 26(8): 376.

[66]
Vanoosthuyse V. Non Coding RNA, 2018, 4(2): 9.

[67]
Hartono S R, Malapert A, Legros P, Bernard P, Chédin F, Vanoosthuyse V. J. Mol. Biol., 2018, 430(3): 272.

[68]
Cerritelli S M, Crouch R J. FEBS J., 2009, 276(6): 1494.

[69]
Chen J Y, Zhang X, Fu X D, Chen L. Nat. Protoc., 2019, 14(5): 1661.

[70]
Chen L, Chen J Y, Zhang X, Gu Y, Xiao R, Shao C W, Tang P, Qian H, Luo D J, Li H R, Zhou Y, Zhang D E, Fu X D. Mol. Cell, 2017, 68(4): 745.

[71]
Malig M, Chedin F. RNA-Chromatin Interactions: Methods and Protocols, Orom U A V(Ed.), 2020. 209.

[72]
Malig M, Hartono S R, Giafaglione J M, Sanz L A, Chedin F. J. Mol. Biol., 2020, 432(7): 2271.

[73]
Yan Q Q, Shields E J, Bonasio R, Sarma K. Cell Rep., 2019, 29(5): 1369.

[74]
Hänsel-Hertsch R, Spiegel J, Marsico G, Tannahill D, Balasubramanian S. Nat. Protoc., 2018, 13(3): 551.

[75]
Lyu J, Shao R, Kwong Yung P Y, Elsässer S J. Nucleic Acids Res., 2022, 50(3): e13.

[76]
Kaya-Okur H S, Wu S J, Codomo C A, Pledger E S, Bryson T D, Henikoff J G, Ahmad K, Henikoff S. Nat. Commun., 2019, 10: 1930.

Outlines

/