【专题讨论】RNAi片段siRNA设计原则----搜索了一下好像还没有!
丁香园论坛
1144
RNAi片段siRNA设计原则
RNAi target selection rules:
1. Targeted regions on the cDNA sequence of a targeted gene should be located 50-100 nt downstream of the start codon (ATG).
2. Search for sequence motif AA(N19)TT or NA(N21), or NAR(N17)YNN, where N is any nucleotide, R is purine (A, G) and Y is pyrimidine (C, U).
3. Avoid targeting introns, since RNAi only works in the cytoplasm and not within the nucleus.
4. Avoid sequences with > 50% G+C content.
5. Avoid stretches of 4 or more nucleotide repeats.
6. Avoid 5URT and 3UTR, although siRNAs targeting UTRs have successfully induced gene inhibition.
7. Avoid sequences that share a certain degree of homology with other related or unrelated genes.
How to obtain a cDNA sequence for target selection
Before finding a RNAi target on the gene of your interest, first you have to get its mRNA sequence or sequence accession number as some siRNA design tools can take accession number as input. It is recommended to use the gene's RefSeq from NCBI, since the RefSeq represents non-redundant, curated and validated sequences. RefSeq mRNA sequences have unique accession numbers which start with NM or XM, followed by 6 digits. For example, NM_123456 (curated mRNA sequence) or XM_0123456 (model mRNA sequence predicted by genome sequence analysis). There are several ways of querying RefSeq.
1. Search LocusLink by gene name or symbol at http://www.ncbi.nlm.nih.gov/LocusLink/. Once the locus of your gene is found, scroll down to the "NCBI Reference Sequence (RefSeq)" section and look for mRNA.
2. Search Entrez Gene at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene, and select the right gene of desired organism. Once the page for the gene is shown, scroll down to the "NCBI Reference Sequence (RefSeq)" and look for mRNA.
3. Search Nucleotide database using Entrez query tool at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Nucleotide and use Entrez Limits settings to restrict your query to the RefSeq database only
o select "RefSeq" from the "Only from" menu, this restricts the query to the RefSeq collection
o select "mRNA" from the "Molecule" menu, this restricts the query to mRNA RefSeq records
Homology search
The RNAi targeted region on the mRNA sequence of a gene should not share significant homology with other genes or sequences in the genome, therefore, homology search is essential to minimize off-target effects. Although most siRNA design tools provide BLAST option, some simply use NCBI BLAST tools which sometimes are quite slow. Here are some BLAST tools for homology search.
• NCBI Blast tool: Nucleotide-nucleotide BLAST (blastn) or Search for short, nearly exact matches
• Blat tool on UCSC Genome Website http://genome.ucsc.edu/cgi-bin/hgBlat
• Ensembl Blast http://www.ensembl.org/Multi/blastview
References
1. Elbashir SM, Harborth J, Lendeckel W, Yalcin A, Weber K, Tuschl T. Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature. 2001 May 24;411(6836):494-8.
2. Elbashir SM, Lendeckel W, Tuschl T. RNA interference is mediated by 21- and 22-nucleotide RNAs. Genes Dev. 2001 Jan 15;15(2):188-200.
更多信息:
• siRNA Design Guidelines (Ribobio) - Using siRNA for gene silencing is a rapidly evolving tool in molecular biology. There are several methods for preparing siRNA, such as chemical synthesis, in vitro transcription, siRNA expression vectors, and PCR expression cassettes. Irrespective of which method one uses, the first step in designing a siRNA is to choose the siRNA target site. This guidelines for choosing siRNA target sites are based on both the current literature, and on empirical observations by scientists at Ribobio. Using these guidelines, approximately half of all siRNAs yield>50% reduction in target mRNA levels. You can design siRNA at : http://www.sirna.cn/design.aspx
• siRNA Design Rules (Protocol Online) - General siRNA design rules and rational siRNA design.
General Guidelines
1. siRNA targeted sequence is usually 21 nt in length.
2. Avoid regions within 50-100 bp of the start codon and the termination codon
3. Avoid intron regions
4. Avoid stretches of 4 or more bases such as AAAA, CCCC
5. Avoid regions with GC content <30% or > 60%.
6. Avoid repeats and low complex sequence
7. Avoid single nucleotide polymorphism (SNP) sites
8. Perform BLAST homology search to avoid off-target effects on other genes or sequences
9. Always design negative controls by scrambling targeted siRNA sequence. The control RNA should have the same length and nucleotide composition as the siRNA but have at least 4-5 bases mismatched to the siRNA. Make sure the scrambling will not create new homology to other genes.
Tom Tuschl's rules
1. Select targeted region from a given cDNA sequence beginning 50-100 nt downstream of start condon
2. First search for 23-nt sequence motif AA(N19). If no suitable sequence is found, then,
3. Search for 23-nt sequence motif NA(N21) and convert the 3' end of the sense siRNA to TT
4. Or search for NAR(N17)YNN
5. Target sequence should have a GC content of around 50%
A = Adenine; T = Thymine; R = Adenine or Guanine (Purines); Y = Thymine or Cytosine (Pyrimidines); N = Any.
Rational siRNA design
By experimentally analyzing the silencing efficiency of 180 siRNAs targeting the mRNA of two genes and correlating it with various sequence features of individual siRNAs, Reynolds et al at Dharmacon, Inc identified eight characteristics associated with siRNA functionality. These characteristics are used by rational siRNA design algorithm to evaluate potential targeted sequences and assign scores to them. Sequences with higher scores will have higher chance of success in RNAi. The table below lists the 8 criteria and the methods of score assignment.
Criteria Description Score
Yes No
1 Moderate to low (30%-52%) GC Content 1 point
2 At least 3 A/Us at positions 15-19 (sense) 1 point /per A or U
3 Lack of internal repeats (Tundefined<20¡ãC) 1 point
4 A at position 19 (sense) 1 point
5 A at position 3 (sense) 1 point
6 U at position 10 (sense) 1 point
7 No G/C at position 19 (sense) -1 point
8 No G at position 13 (sense) -1 point
A sum score of 6 defines the cutoff for selecting siRNAs. All siRNAs scoring higher than 6 are acceptable candidates.
~undefinedTm = 79.8 + 18.undefinedlog10([Na+]) + (58.4 * GC%/100) + (11.8 * (GC%/100)2) - (820/Length)
For example, the Tm can be calculated as follows for the siRNA UUCUCCAGCUUCUAAAAUA
Tm = 79.8 + 18.undefinedlog10(0.05) + (58.4 * 31.6/100) + (11.8 * (31.6/100)2) - (820/19)
Tm = 32.19
There are two siRNA design tools which implement this siRNA design algorithm: one is offered by Dharmacon, Inc; the other is a downloadable Excel template, written by Maurice Ho at http://boz094.ust.hk/RNAi/siRNA.
References
1. Elbashir SM et al. (2001) Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature. 411:494-498.
2. Elbahir SM et al. (2001). Functional anatomy of siRNAs for mediating efficient RNAi in Drosophila melanogaster embryo lysate. EMBO J. 20:6877-6888.
3. Elbashir SM et al. (2002). Analysis of gene function in somatic mammalian cells using small interfering RNAs. Methods. 26:199-213.
4. Reynolds A, Leake D, Boese Q, Scaringe S, Marshall WS, Khvorova A. Rational siRNA design for RNA interference. Nat Biotechnol. 2004 Mar;22(3):326-30.
RNAi target selection rules:
1. Targeted regions on the cDNA sequence of a targeted gene should be located 50-100 nt downstream of the start codon (ATG).
2. Search for sequence motif AA(N19)TT or NA(N21), or NAR(N17)YNN, where N is any nucleotide, R is purine (A, G) and Y is pyrimidine (C, U).
3. Avoid targeting introns, since RNAi only works in the cytoplasm and not within the nucleus.
4. Avoid sequences with > 50% G+C content.
5. Avoid stretches of 4 or more nucleotide repeats.
6. Avoid 5URT and 3UTR, although siRNAs targeting UTRs have successfully induced gene inhibition.
7. Avoid sequences that share a certain degree of homology with other related or unrelated genes.
How to obtain a cDNA sequence for target selection
Before finding a RNAi target on the gene of your interest, first you have to get its mRNA sequence or sequence accession number as some siRNA design tools can take accession number as input. It is recommended to use the gene's RefSeq from NCBI, since the RefSeq represents non-redundant, curated and validated sequences. RefSeq mRNA sequences have unique accession numbers which start with NM or XM, followed by 6 digits. For example, NM_123456 (curated mRNA sequence) or XM_0123456 (model mRNA sequence predicted by genome sequence analysis). There are several ways of querying RefSeq.
1. Search LocusLink by gene name or symbol at http://www.ncbi.nlm.nih.gov/LocusLink/. Once the locus of your gene is found, scroll down to the "NCBI Reference Sequence (RefSeq)" section and look for mRNA.
2. Search Entrez Gene at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene, and select the right gene of desired organism. Once the page for the gene is shown, scroll down to the "NCBI Reference Sequence (RefSeq)" and look for mRNA.
3. Search Nucleotide database using Entrez query tool at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Nucleotide and use Entrez Limits settings to restrict your query to the RefSeq database only
o select "RefSeq" from the "Only from" menu, this restricts the query to the RefSeq collection
o select "mRNA" from the "Molecule" menu, this restricts the query to mRNA RefSeq records
Homology search
The RNAi targeted region on the mRNA sequence of a gene should not share significant homology with other genes or sequences in the genome, therefore, homology search is essential to minimize off-target effects. Although most siRNA design tools provide BLAST option, some simply use NCBI BLAST tools which sometimes are quite slow. Here are some BLAST tools for homology search.
• NCBI Blast tool: Nucleotide-nucleotide BLAST (blastn) or Search for short, nearly exact matches
• Blat tool on UCSC Genome Website http://genome.ucsc.edu/cgi-bin/hgBlat
• Ensembl Blast http://www.ensembl.org/Multi/blastview
References
1. Elbashir SM, Harborth J, Lendeckel W, Yalcin A, Weber K, Tuschl T. Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature. 2001 May 24;411(6836):494-8.
2. Elbashir SM, Lendeckel W, Tuschl T. RNA interference is mediated by 21- and 22-nucleotide RNAs. Genes Dev. 2001 Jan 15;15(2):188-200.
更多信息:
• siRNA Design Guidelines (Ribobio) - Using siRNA for gene silencing is a rapidly evolving tool in molecular biology. There are several methods for preparing siRNA, such as chemical synthesis, in vitro transcription, siRNA expression vectors, and PCR expression cassettes. Irrespective of which method one uses, the first step in designing a siRNA is to choose the siRNA target site. This guidelines for choosing siRNA target sites are based on both the current literature, and on empirical observations by scientists at Ribobio. Using these guidelines, approximately half of all siRNAs yield>50% reduction in target mRNA levels. You can design siRNA at : http://www.sirna.cn/design.aspx
• siRNA Design Rules (Protocol Online) - General siRNA design rules and rational siRNA design.
General Guidelines
1. siRNA targeted sequence is usually 21 nt in length.
2. Avoid regions within 50-100 bp of the start codon and the termination codon
3. Avoid intron regions
4. Avoid stretches of 4 or more bases such as AAAA, CCCC
5. Avoid regions with GC content <30% or > 60%.
6. Avoid repeats and low complex sequence
7. Avoid single nucleotide polymorphism (SNP) sites
8. Perform BLAST homology search to avoid off-target effects on other genes or sequences
9. Always design negative controls by scrambling targeted siRNA sequence. The control RNA should have the same length and nucleotide composition as the siRNA but have at least 4-5 bases mismatched to the siRNA. Make sure the scrambling will not create new homology to other genes.
Tom Tuschl's rules
1. Select targeted region from a given cDNA sequence beginning 50-100 nt downstream of start condon
2. First search for 23-nt sequence motif AA(N19). If no suitable sequence is found, then,
3. Search for 23-nt sequence motif NA(N21) and convert the 3' end of the sense siRNA to TT
4. Or search for NAR(N17)YNN
5. Target sequence should have a GC content of around 50%
A = Adenine; T = Thymine; R = Adenine or Guanine (Purines); Y = Thymine or Cytosine (Pyrimidines); N = Any.
Rational siRNA design
By experimentally analyzing the silencing efficiency of 180 siRNAs targeting the mRNA of two genes and correlating it with various sequence features of individual siRNAs, Reynolds et al at Dharmacon, Inc identified eight characteristics associated with siRNA functionality. These characteristics are used by rational siRNA design algorithm to evaluate potential targeted sequences and assign scores to them. Sequences with higher scores will have higher chance of success in RNAi. The table below lists the 8 criteria and the methods of score assignment.
Criteria Description Score
Yes No
1 Moderate to low (30%-52%) GC Content 1 point
2 At least 3 A/Us at positions 15-19 (sense) 1 point /per A or U
3 Lack of internal repeats (Tundefined<20¡ãC) 1 point
4 A at position 19 (sense) 1 point
5 A at position 3 (sense) 1 point
6 U at position 10 (sense) 1 point
7 No G/C at position 19 (sense) -1 point
8 No G at position 13 (sense) -1 point
A sum score of 6 defines the cutoff for selecting siRNAs. All siRNAs scoring higher than 6 are acceptable candidates.
~undefinedTm = 79.8 + 18.undefinedlog10([Na+]) + (58.4 * GC%/100) + (11.8 * (GC%/100)2) - (820/Length)
For example, the Tm can be calculated as follows for the siRNA UUCUCCAGCUUCUAAAAUA
Tm = 79.8 + 18.undefinedlog10(0.05) + (58.4 * 31.6/100) + (11.8 * (31.6/100)2) - (820/19)
Tm = 32.19
There are two siRNA design tools which implement this siRNA design algorithm: one is offered by Dharmacon, Inc; the other is a downloadable Excel template, written by Maurice Ho at http://boz094.ust.hk/RNAi/siRNA.
References
1. Elbashir SM et al. (2001) Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature. 411:494-498.
2. Elbahir SM et al. (2001). Functional anatomy of siRNAs for mediating efficient RNAi in Drosophila melanogaster embryo lysate. EMBO J. 20:6877-6888.
3. Elbashir SM et al. (2002). Analysis of gene function in somatic mammalian cells using small interfering RNAs. Methods. 26:199-213.
4. Reynolds A, Leake D, Boese Q, Scaringe S, Marshall WS, Khvorova A. Rational siRNA design for RNA interference. Nat Biotechnol. 2004 Mar;22(3):326-30.