[内容简介]
Tag-based approaches were originally designed to increase the throughput of capillary sequencing, where concatemers of short sequences were first used in expression profiling. New Next Generation Sequencing methods largely extended the use of tag-based approaches as the tag lengths perfectly match with the short read length of highly parallel sequencing reactions. Tag-based approaches will maintain their important role in life and biomedical science, because longer read lengths are often not required to obtain meaningful data for many applications. Whereas genome re-sequencing and de novo sequencing will benefit from ever more powerful sequencing methods, analytical applications can be performed by tag-based approaches, where the focus shifts from 'sequencing power' to better means of data analysis and visualization for common users. Today Next Generation Sequence data require powerful bioinformatics expertise that has to be converted into easy-to-use data analysis tools. The book's intention is to give an overview on recently developed tag-based approaches along with means of their data analysis together with introductions to Next-Generation Sequencing Methods, protocols and user guides to be an entry for scientists to tag-based approaches for Next Generation Sequencing.
[目录]
Preface XIX
List of Contributors XXI
Part One Tag-Based Nucleic Acid Analysis 1
1 DeepSuperSAGE: High-Throughput Transcriptome Sequencing with Now- and Next-Generation Sequencing Technologies 3
Hideo Matsumura, Carlos Molina, Detlev H. Kruger, Ryohei Terauchi, and Gunter Kahl
1.1 Introduction 3
1.2 Overview of the Protocols 5
1.3 Methods and Protocols 9
1.4 Applications 14
1.5 Perspectives 19
References 20
2 DeepCAGE: Genome-Wide Mapping of Transcription Start Sites 23
Matthias Harbers, Mitchell S. Dushay, and Piero Carninci
2.1 Introduction 23
2.2 What is CAGE? 24
2.3 Why CAGE? 26
2.4 Methods and Protocols 28
2.5 Applications 43
2.6 Perspectives 44
References 45
3 Definition of Promotome–Transcriptome Architecture Using CAGEscan 47
Nicolas Bertin, Charles Plessy, Piero Carninci, and Matthias Harbers
3.1 Introduction 47
3.2 What is CAGEscan? 48
3.3 Why CAGEscan? 50
3.4 Methods and Protocols 51
3.5 Applications and Perspectives 59
References 61
4 RACE: New Applications of an Old Method to Connect Exons 63
Charles Plessy
4.1 Introduction 63
4.2 Deep-RACE 65
4.3 Methods Outline 67
4.4 Perspectives 70
References 71
5 RNA-PET: Full-Length Transcript Analysis Using 50- and 30-Paired-End Tag Next-Generation Sequencing 73
Xiaoan Ruan and Yijun Ruan
5.1 Introduction 73
5.2 Methods and Protocols 75
5.3 Applications 88
5.4 Perspectives 90
References 90
6 Stranded RNA-Seq: Strand-Specific Shotgun Sequencing of RNA 91
Alistair R.R. Forrest
6.1 Introduction 91
6.2 Methods and Protocols 93
6.3 Bioinformatic Considerations 103
6.4 Applications 104
6.5 Perspectives 105
References 107
7 Differential RNA Sequencing (dRNA-Seq): Deep-Sequencing-Based Analysis of Primary Transcriptomes 109
Anne Borries, Jorg Vogel, and Cynthia M. Sharma
7.1 Introduction 109
7.2 What is dRNA-Seq? 111
7.3 Why dRNA-Seq? 112
7.4 Methods and Protocols 115
7.5 Applications 119
7.6 Perspectives 120
References 121
8 Identification and Expression Profiling of Small RNA Populations Using High-Throughput Sequencing 123
Javier Armisen, W. Robert Shaw, and Eric A. Miska
8.1 Introduction 123
8.2 HTS/NGS 127
8.3 Methods and Protocols 128
8.4 Troubleshooting 134
8.5 Applications 134
8.6 Perspectives 136
References 138
9 Genome-Wide Mapping of Protein–DNA Interactions by ChIP-Seq 139
Joshua W.K. Ho, Artyom A. Alekseyenko, Mitzi I. Kuroda, and Peter J. Park
9.1 Introduction 139
9.2 Methods and Protocols 141
9.3 Applications 147
9.4 Perspectives 150
References 151
10 Analysis of Protein–RNA Interactions with Single-Nucleotide Resolution Using iCLIP and Next-Generation Sequencing 153
Julian Konig, Nicholas J. McGlincy, and Jernej Ule
10.1 Introduction 153
10.2 Procedure Overview 154
10.3 Antibody and Library Preparation Quality Controls 155
10.4 Oligonucleotide Design 156
10.5 Recent Modifications of the iCLIP Protocol 158
10.6 Troubleshooting 158
10.7 Methods and Protocols 159
References 169
11 Massively Parallel Tag Sequencing Unveils the Complexity of Marine Protistan Communities in Oxygen-Depleted Habitats 171
Virginia Edgcomb and Thorsten Stoeck
11.1 Introduction 171
11.2 Cariaco Basin 173
11.3 Framvaren Fjord 176
11.4 Comparison of Cariaco Basin to Framvaren Fjord 177
11.5 Perspectives on Interpretation of Microbial Eukaryote 454 Data 179
References 182
12 Chromatin Interaction Analysis Using Paired-End Tag Sequencing (ChIA-PET) 185
Xiaoan Ruan and Yijun Ruan
12.1 Introduction 185
12.2 Methods and Protocols 192
12.3 Timeline 206
12.4 Anticipated Results 207
12.5 Perspectives 209
References 209
13 Tag-Seq: Next-Generation Tag Sequencing for Gene Expression Profiling 211
Sorana Morrissy, Yongjun Zhao, Allen Delaney, Jennifer Asano, Noreen Dhalla, Irene Li, Helen McDonald, Pawan Pandoh, Anna-Liisa Prabhu, Angela Tam, Martin Hirst, and Marco Marra
13.1 Introduction 211
13.2 Protocol Details 212
13.3 Protocol Overview and Timeline 213
13.4 Critical Parameters and Troubleshooting 214
13.5 Methods and Protocols 215
13.6 Applications 239
13.7 Perspectives 240
References 241
14 Isolation of Active Regulatory Elements from Eukaryotic Chromatin Using FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) 243
Paul G. Giresi and Jason D. Lieb
14.1 Introduction 243
14.2 Methods and Protocols 245
14.3 Applications 254
14.4 Perspectives 254
References 255
15 Identification of Nucleotide Variation in Genomes Using Next-Generation Sequencing 257
Hendrik-Jan Megens and Martien A.M. Groenen
15.1 Introduction 257
15.2 Methods 261
15.3 Notes 275
References 275
16 DGS (Ditag Genome Scanning) – A Restriction-Based Paired-End Sequencing Approach for Genome Structural Analysis 277
Jun Chen, Yeong C. Kim, and San Ming Wang
16.1 Introduction 277
16.2 Methods and Protocols 278
16.3 Applications 283
16.4 Perspectives 284
References 285
17 Next-Generation Sequencing of Bacterial Artificial Chromosome Clones for Next-Generation Physical Mapping 287
Robert Bogden, Keith Stormo, Jason Dobry, Amy Mraz, Quanzhou Tao, Michiel van Eijk, Jan van Oeveren, Marcel Prins, Jon Wittendorp, and Mark van Haaren
17.1 History of the Bacterial Artificial Chromosome Vector Systems 287
17.2 History of Physical Mapping 288
17.3 What is WGP? 289
17.4 Flow of a WGP Project 289
17.5 BAC Pooling Strategies 290
17.6 Methods and Protocols 291
17.7 Applications 294
17.8 Perspectives 296
References 297
18 HELP-Tagging: Tag-Based Genome-Wide Cytosine Methylation Profiling 299
Masako Suzuki and John M. Greally
18.1 Introduction 299
18.2 Genome-Wide DNA Methylation Analysis 299
18.3 What is HELP-Tagging? 300
18.4 Methods and Protocols 301
18.5 Applications 308
18.6 Perspectives 308
References 309
19 Second-Generation Sequencing Library Preparation: In Vitro Tagmentation via Transposome Insertion311
Fraz Syed
19.1 Introduction 311
19.2 Methods and Protocols 313
19.3 Perspectives 321
References 321
Part Two Next-Generation Tag-Based Sequencing 323
20 Moving Towards Third-Generation Sequencing Technologies 325
Karolina Janitz and Michal Janitz
20.1 Introduction 325
20.2 Differences Between NGS and Sanger Sequencing 326
20.3 Preparation of Templates for Sequencing 326
20.4 Real-Time Sequencing 327
20.5 Nanopore Sequencing 328
20.6 Ion Torrent Electronic Sequencing 329
20.7 Genome Enrichment 331
20.8 Advantages of NGS 331
20.9 Problem of Short Reads 333
20.10 Perspectives 335
References 335
21 Beyond Tags to Full-Length Transcripts 337
Mohammed Mohiuddin, Stephen Hutchison, and Thomas Jarvie
21.1 Introduction 337
21.2 Generation of Full-Length Transcriptomes 338
21.3 Methods 342
21.4 Applications 344
21.5 Perspectives 350
References 351
22 Helicos Single-Molecule Sequencing for Accurate Tag-Based RNA Quantitation 353
John F. Thompson, Tal Raz, and Patrice M. Milos
22.1 Introduction 353
22.2 Methods and Protocols 355
22.3 Applications 362
22.4 Perspectives 364
References 365
23 Total RNA-seq: Complete Analysis of the Transcriptome Using Illumina Sequencing-by-Synthesis Sequencing 367
Shujun Luo, Geoffrey P. Smith, Irina Khrebtukova, and Gary P. Schroth
23.1 Introduction 367
23.2 Total RNA-Seq 368
23.3 Methods and Protocols 369
23.4 Total RNA-Seq Data Collection and Interpretation 378
23.5 Applications 380
References 381
Part Three Bioinformatics for Tag-Based Technologies 383
24 Computational Infrastructure and Basic Data Analysis for Next-Generation Sequencing 385
David Sexton
24.1 Introduction 385
24.2 Background 386
24.3 Getting Started with the Next-Generation Manufacturers 387
24.4 Infrastructure and Data Analysis 388
24.5 Applications 392
24.6 Perspectives 392
25 CLC Bio Integrated Platform for Handling and Analysis of Tag Sequencing Data 393
Roald Forsberg, Søren Mønsted, and Anne-Mette Hein
25.1 Introduction 393
25.2 Main Components and Features 394
25.3 Applications 396
25.4 Perspectives 404
References 405
26 Multidimensional Context of Sequence Tags: Biological Data Integration 407
Korbinian Grote and Thomas Werner
26.1 Introduction 407
26.2 Methods and Strategies 408
26.3 Perspectives 414
References 415
27 Experimental Design and Quality Control of Next-Generation Sequencing Experiments 417
Peter A.C. 't Hoen, Matthew S. Hestand, Judith M. Boer, Yuching Lai, Maarten van Iterson, Michiel van Galen, Henk P. Buermans, and Johan T. den Dunnen
27.1 Introduction 417
27.2 Choice of Platform 417
27.3 Sequencing Depth 420
27.4 Replicates, Randomization, and Statistical Testing 422
27.5 Experimental Controls 425
27.6 General Quality Assessment 427
27.7 Platform-Specific Quality Scores 428
27.8 Quality Checks After Alignment 430
27.9 What Can Go Wrong 431
27.10 Perspectives 432
References 432
28 UTGB Toolkit for Personalized Genome Browsers 435
Taro L. Saito, Jun Yoshimura, Budrul Ahsan, Atsushi Sasaki, Reginaldo Kurosh, and Shinichi Morishita
28.1 Introduction 435
28.2 Overview of the UTGB Toolkit 436
28.3 Methods 438
28.4 Applications 444
28.5 Perspectives 447
References 447
29 Beyond the Pipelines: Cloud Computing Facilitates Management, Distribution, Security, and Analysis of High-Speed Sequencer Data 449
Boris Umylny and Richard S.J. Weisburd
29.1 Introduction 449
29.2 Data Management 450
29.3 Distribution 454
29.4 Analysis 456
29.5 Security 462
29.6 Healthcare Data and Privacy Issues 464
29.7 Sample Evaluation of a Vendor Solution 465
29.8 Perspectives 465
References 467
30 Computational Methods for the Identification of MicroRNAs from Small RNA Sequencing Data 469
Eugene Berezikov
30.1 Introduction 469
30.2 Implementing the miR-Intess Pipeline 470
30.3 Applications 474
References 474
Glossary 477
Link Collection for Next-Generation Sequencing 565
Index 575