고 반복 모티프를 포함하는 DNA 서열에 대한 효율적이고 특이적인 표적화를 위한 희소-절단 엔도뉴클레아제의 설계 |
|||||||||||||||||||||||||||||||
申请号 | KR1020167013791 | 申请日 | 2014-10-24 | 公开(公告)号 | KR1020160068971A | 公开(公告)日 | 2016-06-15 | ||||||||||||||||||||||||
申请人 | 셀렉티스; | 发明人 | 뒤차티우,필립; 쥐이라,알렉산드르; | ||||||||||||||||||||||||||||
摘要 | 본발명은유전공학편집도구및 유전공학적방법에관한분야에속한다. 본발명은특정유전질병, 특히헌팅턴병과같은삼중반복질병의기원이되는염색체내에서고 반복모티프를수축하도록설계된희소-절단엔도뉴클레아제의재조합설계에관한것이다. 본발명은반복모티프를수축시키기위한방법, 반복장애와관련된유전자내의반복모티프를수축시키기위하여사용되는희소-절단엔도뉴클레아제, 이를인코딩하는폴리뉴클레오티드, 및벡터, 그리고이에따른약제학적조성물을포함한다. | ||||||||||||||||||||||||||||||
权利要求 | 반복장애와 관련된 유전자 내의 폴리뉴클레오티드 반복을 수축시키기 위하여 세포 내에서 사용되고, 반복서열을 특이적으로 절단하도록 재조합설계되며, 상기 반복서열에 인접한 영역을 포함하는 표적서열을 인식하는 것을 특징으로 하는 희소-절단 엔도뉴클레아제. 청구항 1에 있어서, 상기 반복서열에 인접한 적어도 10개의 핵산 염기 및 상기 반복서열 내의 적어도 5개의 핵산 염기에 걸친 표적서열을 인식하는 희소-절단 엔도뉴클레아제. 청구항 1에 있어서, 상기 희소-절단 엔도뉴클레아제는 상기 반복서열에 인접한 영역을 포함하는 표적서열을 인식하는 결합 도메인 및 상기 반복서열 내부를 절단하는 엔도뉴클레아제 도메인을 포함하는 키메릭 엔도뉴클레아제인 희소-절단 엔도뉴클레아제. 청구항 3에 있어서, 상기 결합 도메인은 재조합설계된 TALE, MBBBD 또는 ZF 결합 도메인인 희소-절단 엔도뉴클레아제. 청구항 4에 있어서, 상기 엔도뉴클레아제 도메인은 I-TevI, NucA, ColE7 또는 Fok-1로 이루어진 군으로부터 선택되는 희소-절단 엔도뉴클레아제. 청구항 1에 있어서, 상기 희소-절단 엔도뉴클레아제는 Cas9이며, 상기 표적 서열의 인식은 상기 표적서열과 혼성화 결합할 수 있는 가이드RNA에 의하여 이루어지는 희소-절단 엔도뉴클레아제. 청구항 1에 있어서, 상기 표적서열은 서열번호 1 내지 서열번호 3으로 이루어진 군으로부터 선택되는 서열 내에 있는 희소-절단 엔도뉴클레아제. 청구항 7에 있어서, 상기 희소-절단 엔도뉴클레아제는 서열번호 8, 10 및 15와 적어도 80%, 바람직하게는 85%, 90%, 95% 동일한 아미노산 서열을 갖는 희소-절단 엔도뉴클레아제. 청구항 1 내지 8의 어느 한 항에 있어서, 반복질병을 치료 또는 예방하기 위한 희소-절단 엔도뉴클레아제. 청구항 1 내지 8의 어느 한 항에 있어서, 헌팅턴병을 치료 또는 예방하기 위한 희소-절단 엔도뉴클레아제. 청구항 1 내지 10의 어느 한 항에 따른 상기 희소-절단 엔도뉴클레아제를 인코딩하는 폴리뉴클레오티드. 청구항 11의 폴리뉴클레오티드를 포함하는 벡터. 청구항 1 내지 11의 어느 한 항에 따른 적어도 하나의 희소-절단 엔도뉴클레아제 또는 청구항 11 또는 12에 따른 폴리뉴클레오티드를 포함하는 약제학적 조성물. (a) 반복서열에 인접한 영역을 포함하는 표적서열을 선택하는 단계; (b) 상기 표적서열에 결합하고 상기 반복서열을 절단할 수 있는 희소-절단 엔도뉴클레아제를 제공하는 단계; (c) DNA 결합 뉴클레아제가 반복서열 내에서 절단을 유도하고, 상기 반복서열을 수축시키는 것을 수행하는 회복과정을 유도하도록 상기 희소-절단 엔도뉴클레아제를 상기 세포 내로 도입하는 단계;를 포함하는 살아있는 세포 내에서 반복장애와 관련된 유전자서열 내의 반복서열을 수축시키는 방법. 청구항 14에 있어서, 상기 반복서열을 수축시키는 것을 수행하는 회복과정은 SSA (단일 가닥 어닐링)인 세포의 유전자서열 내의 반복서열을 수축시키는 방법. 청구항 15에 있어서, 상기 희소-절단 엔도뉴클레아제는 청구항 1 내지 10의 어느 한 항에 따른 것인 세포의 유전자서열 내의 반복서열을 수축시키는 방법. 청구항 9 내지 12의 어느 한 항에 따른 적어도 하나의 희소-절단 엔도뉴클레아제 또는 청구항 13의 폴리뉴클레오티드를 포함하는 분리된 세포. 청구항 17에 있어서, 포유동물 세포인 분리된 세포. |
||||||||||||||||||||||||||||||
说明书全文 |
|
이름 | 서열 |
TiFLAN | TCTCAAGATTTCGCTGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCA |
TiFLAN2_T01.1 | TGTGATCCCCCCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCA |
표 1: TALEN 쌍에 의하여 표적화되는 서열 목록. 반복서열을 플랭킹하는 TALEN™ (위치 T0가 생략됨)에 의하여 표적화되는 16개의 염기쌍 서열이 밀줄로 표시되어 있다.
37℃에서 | pCLS9984/pCLS9996 | pCLS16715/pCLS9996 |
TiFLAN | +++ | - |
TiFLAN2_T01.1 | + | +++ |
30℃에서 | pCLS9984/pCLS9996 | pCLS16715/pCLS9996 |
TiFLAN | +++ | - |
TiFLAN2_T01.1 | - | ++ |
표 2: 37℃ 및 30℃에서 전술된 (국제 PCT 출원 WO 2004/067736 및 (Epinat, Arnould et al. 2003; Chames, Epinat et al. 2005; Arnould, Chames et al. 2006; Smith, Grizot et al. 2006; Smith, Grizot et al. 2006) 효모 SSA 분석에서의 TALEN™의 활성. -는 검출가능한 활성이 없음을 나타내며, +는 약한 활성을 나타내고, ++는 높은 활성을 나타낸다. na는 사용가능한 데이터가 없음을 나타낸다.
참고문헌
Arnould, S., P. Chames, et al. (2006). "Engineering of large numbers of highly specific homing endonucleases that induce recombination on novel DNA targets." J Mol Biol 355(3): 443-58.
Atkins, JF, NM Wills, et al. (2007). "A case for "StopGo": reprogramming translation to augment codon meaning of GGN by promoting unconventional termination (Stop) after addition of glycine and then allowing continued translation (Go)." Rna 13(6): 803-10.
Boch, J., H. Scholze, et al. (2009). "Breaking the code of DNA binding specificity of TAL-type III effectors." Science 326(5959): 1509-12.
Caplen, NJ, JP Taylor, et al. (2002). "Rescue of polyglutamine-mediated cytotoxicity by double-stranded RNA-mediated RNA interference." Hum Mol Genet 11(2): 175-84.
Chames, P., JC Epinat, et al. (2005). "In vivo selection of engineered homing endonucleases using double-strand break induced homologous recombination." Nucleic Acids Res 33(20): e178.
Christian, M., T. Cermak, et al. (2010). "Targeting DNA double-strand breaks with TAL effector nucleases." Genetics 186(2): 757-61.
Cong, L., FA Ran, et al. (2013). "Multiplex genome engineering using CRISPR/Cas systems." Science 339(6121): 819-23.
DeJesus-Hernandez, M., IR Mackenzie, et al. (2011). "Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS." Neuron 72(2): 245-56.
Deltcheva, E., K. Chylinski, et al. (2011). "CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III." Nature 471(7340): 602-7.
DiFiglia, M., M. Sena-Esteves, et al. (2007). "Therapeutic silencing of mutant huntingtin with siRNA attenuates striatal and cortical neuropathology and behavioral deficits." Proc Natl Acad Sci USA 104(43): 17204-9.
Donnelly, M. and G. Elliott (2001). "Nuclear localization and shuttling of herpes simplex virus tegument protein VP13/14." J Virol 75(6): 2566-74.
Donnelly, ML, G. Luke, et al. (2001). "Analysis of the aphthovirus 2A/2B polyprotein 'cleavage' mechanism indicates not a proteolytic reaction, but a novel translational effect: a putative ribosomal 'skip'." J Gen Virol 82(Pt 5): 1013-25.
Doronina, VA, C. Wu, et al. (2008). "Site-specific release of nascent chains from ribosomes at a sense codon." Mol Cell Biol 28(13): 4227-39.
Dragatsis, I., MS Levine, et al. (2000). "Inactivation of Hdh in the brain and testis results in progressive neurodegeneration and sterility in mice." Nat Genet 26(3): 300-6.
Duyao, MP, AB Auerbach, et al. (1995). "Inactivation of the mouse Huntington's disease gene homolog Hdh." Science 269(5222): 407-10.
Epinat, JC, S. Arnould, et al. (2003). "A novel engineered meganuclease induces homologous recombination in yeast and mammalian cells." Nucleic Acids Res 31(11): 2952-62.
Garneau, JE, ME Dupuis, et al. (2010). "The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA." Nature 468(7320): 67-71.
Garriga-Canut, M., C. Agustin-Pavon, et al. "Synthetic zinc finger repressors reduce mutant huntingtin expression in the brain of R6/2 mice." Proc Natl Acad Sci USA 109(45): E3136-45.
Gasiunas, G., R. Barrangou, et al. (2012). "Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria." Proc Natl Acad Sci USA 109(39): E2579-86.
Jinek, M., K. Chylinski, et al. (2012). "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity." Science 337(6096): 816-21.
Lackner, G., N. Moebius, et al. (2011). "Complete genome sequence of Burkholderia rhizoxinica, an Endosymbiont of Rhizopus microsporus." J Bacteriol 193(3): 783-4.
Machida, Y., T. Okada, et al. (2006). "rAAV-mediated shRNA ameliorated neuropathology in Huntington disease model mouse." Biochem Biophys Res Commun 343(1): 190-7.
Mali, P., L. Yang, et al. (2013). "RNA-guided human genome engineering via Cas9." Science 339(6121): 823-6.
Mirkin, SM (2007). "Expandable DNA repeats and human disease." Nature 447(7147): 932-40.
Moscou, MJ and AJ Bogdanove (2009). "A simple cipher governs DNA recognition by TAL effectors." Science 326(5959): 1501.
Nelson, DL, HT Orr, et al. (2013). "The unstable repeats--three evolving faces of neurological disease." Neuron 77(5): 825-43.
Orr, HT and HY Zoghbi (2007). "trinucleotide repeat disorders." Annu Rev Neurosci 30: 575-621.
Renton, AE, E. Majounie, et al. (2011). "A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD." Neuron 72(2): 257-68.
Richard, GF, B. Dujon, et al. (1999). "Double-strand break repair can lead to high frequencies of deletions within short CAG/CTG trinucleotide repeats." Mol Gen Genet 261(4-5): 871-82.
Smith, J., S. Grizot, et al. (2006). "A combinatorial approach to create artificial homing endonucleases cleaving chosen sequences." Nucleic Acids Res 34(22): e149.
Smith, J., S. Grizot, et al. (2006). "A combinatorial approach to create artificial homing endonucleases cleaving chosen sequences." Nucleic Acids Res.
Sorek, R., CM Lawrence, et al. (2013). "CRISPR-mediated Adaptive Immune Systems in Bacteria and Archaea." Annu Rev Biochem.
Stoddard, BL (2005). "Homing endonuclease structure and function." Q Rev Biophys 38(1): 49-95.
Wang, YL, W. Liu, et al. (2005). "Clinico-pathological rescue of a model mouse of Huntington's disease by siRNA." Neurosci Res 53(3): 241-9.
SEQUENCE LISTING <110> Cellectis <120> DESIGN OF RARE-CUTTING ENDONUCLEASES FOR EFFICIENT AND SPECIFIC TARGETING DNA SEQUENCES COMPRISING HIGHLY REPETITIVE MOTIVES <130> P81313675PCT00 <160> 16 <170> patentIn version 3.5 <210> 1 <211> 13481 <212> DNA <213> Homo sapiens <220> <223> huntingtin (HTT) mRNA <400> 1 gctgccggga cgggtccaag atggacggcc gctcaggttc tgcttttacc tgcggcccag 60 agccccattc attgccccgg tgctgagcgg cgccgcgagt cggcccgagg cctccgggga 120 ctgccgtgcc gggcgggaga ccgccatggc gaccctggaa aagctgatga aggccttcga 180 gtccctcaag tccttccagc agcagcagca gcagcagcag cagcagcagc agcagcagca 240 gcagcagcag cagcagcagc aacagccgcc accgccgccg ccgccgccgc cgcctcctca 300 gcttcctcag ccgccgccgc aggcacagcc gctgctgcct cagccgcagc cgcccccgcc 360 gccgcccccg ccgccacccg gcccggctgt ggctgaggag ccgctgcacc gaccaaagaa 420 agaactttca gctaccaaga aagaccgtgt gaatcattgt ctgacaatat gtgaaaacat 480 agtggcacag tctgtcagaa attctccaga atttcagaaa cttctgggca tcgctatgga 540 actttttctg ctgtgcagtg atgacgcaga gtcagatgtc aggatggtgg ctgacgaatg 600 cctcaacaaa gttatcaaag ctttgatgga ttctaatctt ccaaggttac agctcgagct 660 ctataaggaa attaaaaaga atggtgcccc tcggagtttg cgtgctgccc tgtggaggtt 720 tgctgagctg gctcacctgg ttcggcctca gaaatgcagg ccttacctgg tgaaccttct 780 gccgtgcctg actcgaacaa gcaagagacc cgaagaatca gtccaggaga ccttggctgc 840 agctgttccc aaaattatgg cttcttttgg caattttgca aatgacaatg aaattaaggt 900 tttgttaaag gccttcatag cgaacctgaa gtcaagctcc cccaccattc ggcggacagc 960 ggctggatca gcagtgagca tctgccagca ctcaagaagg acacaatatt tctatagttg 1020 gctactaaat gtgctcttag gcttactcgt tcctgtcgag gatgaacact ccactctgct 1080 gattcttggc gtgctgctca ccctgaggta tttggtgccc ttgctgcagc agcaggtcaa 1140 ggacacaagc ctgaaaggca gcttcggagt gacaaggaaa gaaatggaag tctctccttc 1200 tgcagagcag cttgtccagg tttatgaact gacgttacat catacacagc accaagacca 1260 caatgttgtg accggagccc tggagctgtt gcagcagctc ttcagaacgc ctccacccga 1320 gcttctgcaa accctgaccg cagtcggggg cattgggcag ctcaccgctg ctaaggagga 1380 gtctggtggc cgaagccgta gtgggagtat tgtggaactt atagctggag ggggttcctc 1440 atgcagccct gtcctttcaa gaaaacaaaa aggcaaagtg ctcttaggag aagaagaagc 1500 cttggaggat gactctgaat cgagatcgga tgtcagcagc tctgccttaa cagcctcagt 1560 gaaggatgag atcagtggag agctggctgc ttcttcaggg gtttccactc cagggtcagc 1620 aggtcatgac atcatcacag aacagccacg gtcacagcac acactgcagg cggactcagt 1680 ggatctggcc agctgtgact tgacaagctc tgccactgat ggggatgagg aggatatctt 1740 gagccacagc tccagccagg tcagcgccgt cccatctgac cctgccatgg acctgaatga 1800 tgggacccag gcctcgtcgc ccatcagcga cagctcccag accaccaccg aagggcctga 1860 ttcagctgtt accccttcag acagttctga aattgtgtta gacggtaccg acaaccagta 1920 tttgggcctg cagattggac agccccagga tgaagatgag gaagccacag gtattcttcc 1980 tgatgaagcc tcggaggcct tcaggaactc ttccatggcc cttcaacagg cacatttatt 2040 gaaaaacatg agtcactgca ggcagccttc tgacagcagt gttgataaat ttgtgttgag 2100 agatgaagct actgaaccgg gtgatcaaga aaacaagcct tgccgcatca aaggtgacat 2160 tggacagtcc actgatgatg actctgcacc tcttgtccat tgtgtccgcc ttttatctgc 2220 ttcgtttttg ctaacagggg gaaaaaatgt gctggttccg gacagggatg tgagggtcag 2280 cgtgaaggcc ctggccctca gctgtgtggg agcagctgtg gccctccacc cggaatcttt 2340 cttcagcaaa ctctataaag ttcctcttga caccacggaa taccctgagg aacagtatgt 2400 ctcagacatc ttgaactaca tcgatcatgg agacccacag gttcgaggag ccactgccat 2460 tctctgtggg accctcatct gctccatcct cagcaggtcc cgcttccacg tgggagattg 2520 gatgggcacc attagaaccc tcacaggaaa tacattttct ttggcggatt gcattccttt 2580 gctgcggaaa acactgaagg atgagtcttc tgttacttgc aagttagctt gtacagctgt 2640 gaggaactgt gtcatgagtc tctgcagcag cagctacagt gagttaggac tgcagctgat 2700 catcgatgtg ctgactctga ggaacagttc ctattggctg gtgaggacag agcttctgga 2760 aacccttgca gagattgact tcaggctggt gagctttttg gaggcaaaag cagaaaactt 2820 acacagaggg gctcatcatt atacagggct tttaaaactg caagaacgag tgctcaataa 2880 tgttgtcatc catttgcttg gagatgaaga ccccagggtg cgacatgttg ccgcagcatc 2940 actaattagg cttgtcccaa agctgtttta taaatgtgac caaggacaag ctgatccagt 3000 agtggccgtg gcaagagatc aaagcagtgt ttacctgaaa cttctcatgc atgagacgca 3060 gcctccatct catttctccg tcagcacaat aaccagaata tatagaggct ataacctact 3120 accaagcata acagacgtca ctatggaaaa taacctttca agagttattg cagcagtttc 3180 tcatgaacta atcacatcaa ccaccagagc actcacattt ggatgctgtg aagctttgtg 3240 tcttctttcc actgccttcc cagtttgcat ttggagttta ggttggcact gtggagtgcc 3300 tccactgagt gcctcagatg agtctaggaa gagctgtacc gttgggatgg ccacaatgat 3360 tctgaccctg ctctcgtcag cttggttccc attggatctc tcagcccatc aagatgcttt 3420 gattttggcc ggaaacttgc ttgcagccag tgctcccaaa tctctgagaa gttcatgggc 3480 ctctgaagaa gaagccaacc cagcagccac caagcaagag gaggtctggc cagccctggg 3540 ggaccgggcc ctggtgccca tggtggagca gctcttctct cacctgctga aggtgattaa 3600 catttgtgcc cacgtcctgg atgacgtggc tcctggaccc gcaataaagg cagccttgcc 3660 ttctctaaca aacccccctt ctctaagtcc catccgacga aaggggaagg agaaagaacc 3720 aggagaacaa gcatctgtac cgttgagtcc caagaaaggc agtgaggcca gtgcagcttc 3780 tagacaatct gatacctcag gtcctgttac aacaagtaaa tcctcatcac tggggagttt 3840 ctatcatctt ccttcatacc tcaaactgca tgatgtcctg aaagctacac acgctaacta 3900 caaggtcacg ctggatcttc agaacagcac ggaaaagttt ggagggtttc tccgctcagc 3960 cttggatgtt ctttctcaga tactagagct ggccacactg caggacattg ggaagtgtgt 4020 tgaagagatc ctaggatacc tgaaatcctg ctttagtcga gaaccaatga tggcaactgt 4080 ttgtgttcaa caattgttga agactctctt tggcacaaac ttggcctccc agtttgatgg 4140 cttatcttcc aaccccagca agtcacaagg ccgagcacag cgccttggct cctccagtgt 4200 gaggccaggc ttgtaccact actgcttcat ggccccgtac acccacttca cccaggccct 4260 cgctgacgcc agcctgagga acatggtgca ggcggagcag gagaacgaca cctcgggatg 4320 gtttgatgtc ctccagaaag tgtctaccca gttgaagaca aacctcacga gtgtcacaaa 4380 gaaccgtgca gataagaatg ctattcataa tcacattcgt ttgtttgaac ctcttgttat 4440 aaaagcttta aaacagtaca cgactacaac atgtgtgcag ttacagaagc aggttttaga 4500 tttgctggcg cagctggttc agttacgggt taattactgt cttctggatt cagatcaggt 4560 gtttattggc tttgtattga aacagtttga atacattgaa gtgggccagt tcagggaatc 4620 agaggcaatc attccaaaca tctttttctt cttggtatta ctatcttatg aacgctatca 4680 ttcaaaacag atcattggaa ttcctaaaat cattcagctc tgtgatggca tcatggccag 4740 tggaaggaag gctgtgacac atgccatacc ggctctgcag cccatagtcc acgacctctt 4800 tgtattaaga ggaacaaata aagctgatgc aggaaaagag cttgaaaccc aaaaagaggt 4860 ggtggtgtca atgttactga gactcatcca gtaccatcag gtgttggaga tgttcattct 4920 tgtcctgcag cagtgccaca aggagaatga agacaagtgg aagcgactgt ctcgacagat 4980 agctgacatc atcctcccaa tgttagccaa acagcagatg cacattgact ctcatgaagc 5040 ccttggagtg ttaaatacat tatttgagat tttggcccct tcctccctcc gtccggtaga 5100 catgctttta cggagtatgt tcgtcactcc aaacacaatg gcgtccgtga gcactgttca 5160 actgtggata tcgggaattc tggccatttt gagggttctg atttcccagt caactgaaga 5220 tattgttctt tctcgtattc aggagctctc cttctctccg tatttaatct cctgtacagt 5280 aattaatagg ttaagagatg gggacagtac ttcaacgcta gaagaacaca gtgaagggaa 5340 acaaataaag aatttgccag aagaaacatt ttcaaggttt ctattacaac tggttggtat 5400 tcttttagaa gacattgtta caaaacagct gaaggtggaa atgagtgagc agcaacatac 5460 tttctattgc caggaactag gcacactgct aatgtgtctg atccacatct tcaagtctgg 5520 aatgttccgg agaatcacag cagctgccac taggctgttc cgcagtgatg gctgtggcgg 5580 cagtttctac accctggaca gcttgaactt gcgggctcgt tccatgatca ccacccaccc 5640 ggccctggtg ctgctctggt gtcagatact gctgcttgtc aaccacaccg actaccgctg 5700 gtgggcagaa gtgcagcaga ccccgaaaag acacagtctg tccagcacaa agttacttag 5760 tccccagatg tctggagaag aggaggattc tgacttggca gccaaacttg gaatgtgcaa 5820 tagagaaata gtacgaagag gggctctcat tctcttctgt gattatgtct gtcagaacct 5880 ccatgactcc gagcacttaa cgtggctcat tgtaaatcac attcaagatc tgatcagcct 5940 ttcccacgag cctccagtac aggacttcat cagtgccgtt catcggaact ctgctgccag 6000 cggcctgttc atccaggcaa ttcagtctcg ttgtgaaaac ctttcaactc caaccatgct 6060 gaagaaaact cttcagtgct tggaggggat ccatctcagc cagtcgggag ctgtgctcac 6120 gctgtatgtg gacaggcttc tgtgcacccc tttccgtgtg ctggctcgca tggtcgacat 6180 ccttgcttgt cgccgggtag aaatgcttct ggctgcaaat ttacagagca gcatggccca 6240 gttgccaatg gaagaactca acagaatcca ggaatacctt cagagcagcg ggctcgctca 6300 gagacaccaa aggctctatt ccctgctgga caggtttcgt ctctccacca tgcaagactc 6360 acttagtccc tctcctccag tctcttccca cccgctggac ggggatgggc acgtgtcact 6420 ggaaacagtg agtccggaca aagactggta cgttcatctt gtcaaatccc agtgttggac 6480 caggtcagat tctgcactgc tggaaggtgc agagctggtg aatcggattc ctgctgaaga 6540 tatgaatgcc ttcatgatga actcggagtt caacctaagc ctgctagctc catgcttaag 6600 cctagggatg agtgaaattt ctggtggcca gaagagtgcc ctttttgaag cagcccgtga 6660 ggtgactctg gcccgtgtga gcggcaccgt gcagcagctc cctgctgtcc atcatgtctt 6720 ccagcccgag ctgcctgcag agccggcggc ctactggagc aagttgaatg atctgtttgg 6780 ggatgctgca ctgtatcagt ccctgcccac tctggcccgg gccctggcac agtacctggt 6840 ggtggtctcc aaactgccca gtcatttgca ccttcctcct gagaaagaga aggacattgt 6900 gaaattcgtg gtggcaaccc ttgaggccct gtcctggcat ttgatccatg agcagatccc 6960 gctgagtctg gatctccagg cagggctgga ctgctgctgc ctggccctgc agctgcctgg 7020 cctctggagc gtggtctcct ccacagagtt tgtgacccac gcctgctccc tcatctactg 7080 tgtgcacttc atcctggagg ccgttgcagt gcagcctgga gagcagcttc ttagtccaga 7140 aagaaggaca aataccccaa aagccatcag cgaggaggag gaggaagtag atccaaacac 7200 acagaatcct aagtatatca ctgcagcctg tgagatggtg gcagaaatgg tggagtctct 7260 gcagtcggtg ttggccttgg gtcataaaag gaatagcggc gtgccggcgt ttctcacgcc 7320 attgctaagg aacatcatca tcagcctggc ccgcctgccc cttgtcaaca gctacacacg 7380 tgtgccccca ctggtgtgga agcttggatg gtcacccaaa ccgggagggg attttggcac 7440 agcattccct gagatccccg tggagttcct ccaggaaaag gaagtcttta aggagttcat 7500 ctaccgcatc aacacactag gctggaccag tcgtactcag tttgaagaaa cttgggccac 7560 cctccttggt gtcctggtga cgcagcccct cgtgatggag caggaggaga gcccaccaga 7620 agaagacaca gagaggaccc agatcaacgt cctggccgtg caggccatca cctcactggt 7680 gctcagtgca atgactgtgc ctgtggccgg caacccagct gtaagctgct tggagcagca 7740 gccccggaac aagcctctga aagctctcga caccaggttt gggaggaagc tgagcattat 7800 cagagggatt gtggagcaag agattcaagc aatggtttca aagagagaga atattgccac 7860 ccatcattta tatcaggcat gggatcctgt cccttctctg tctccggcta ctacaggtgc 7920 cctcatcagc cacgagaagc tgctgctaca gatcaacccc gagcgggagc tggggagcat 7980 gagctacaaa ctcggccagg tgtccataca ctccgtgtgg ctggggaaca gcatcacacc 8040 cctgagggag gaggaatggg acgaggaaga ggaggaggag gccgacgccc ctgcaccttc 8100 gtcaccaccc acgtctccag tcaactccag gaaacaccgg gctggagttg acatccactc 8160 ctgttcgcag tttttgcttg agttgtacag ccgctggatc ctgccgtcca gctcagccag 8220 gaggaccccg gccatcctga tcagtgaggt ggtcagatcc cttctagtgg tctcagactt 8280 gttcaccgag cgcaaccagt ttgagctgat gtatgtgacg ctgacagaac tgcgaagggt 8340 gcacccttca gaagacgaga tcctcgctca gtacctggtg cctgccacct gcaaggcagc 8400 tgccgtcctt gggatggaca aggccgtggc ggagcctgtc agccgcctgc tggagagcac 8460 gctcaggagc agccacctgc ccagcagggt tggagccctg cacggcgtcc tctatgtgct 8520 ggagtgcgac ctgctggacg acactgccaa gcagctcatc ccggtcatca gcgactatct 8580 cctctccaac ctgaaaggga tcgcccactg cgtgaacatt cacagccagc agcacgtact 8640 ggtcatgtgt gccactgcgt tttacctcat tgagaactat cctctggacg tagggccgga 8700 attttcagca tcaataatac agatgtgtgg ggtgatgctg tctggaagtg aggagtccac 8760 cccctccatc atttaccact gtgccctcag aggcctggag cgcctcctgc tctctgagca 8820 gctctcccgc ctggatgcag aatcgctggt caagctgagt gtggacagag tgaacgtgca 8880 cagcccgcac cgggccatgg cggctctggg cctgatgctc acctgcatgt acacaggaaa 8940 ggagaaagtc agtccgggta gaacttcaga ccctaatcct gcagcccccg acagcgagtc 9000 agtgattgtt gctatggagc gggtatctgt tctttttgat aggatcagga aaggctttcc 9060 ttgtgaagcc agagtggtgg ccaggatcct gccccagttt ctagacgact tcttcccacc 9120 ccaggacatc atgaacaaag tcatcggaga gtttctgtcc aaccagcagc cataccccca 9180 gttcatggcc accgtggtgt ataaggtgtt tcagactctg cacagcaccg ggcagtcgtc 9240 catggtccgg gactgggtca tgctgtccct ctccaacttc acgcagaggg ccccggtcgc 9300 catggccacg tggagcctct cctgcttctt tgtcagcgcg tccaccagcc cgtgggtcgc 9360 ggcgatcctc ccacatgtca tcagcaggat gggcaagctg gagcaggtgg acgtgaacct 9420 tttctgcctg gtcgccacag acttctacag acaccagata gaggaggagc tcgaccgcag 9480 ggccttccag tctgtgcttg aggtggttgc agccccagga agcccatatc accggctgct 9540 gacttgttta cgaaatgtcc acaaggtcac cacctgctga gcgccatggt gggagagact 9600 gtgaggcggc agctggggcc ggagcctttg gaagtctgcg cccttgtgcc ctgcctccac 9660 cgagccagct tggtccctat gggcttccgc acatgccgcg ggcggccagg caacgtgcgt 9720 gtctctgcca tgtggcagaa gtgctctttg tggcagtggc caggcaggga gtgtctgcag 9780 tcctggtggg gctgagcctg aggccttcca gaaagcagga gcagctgtgc tgcaccccat 9840 gtgggtgacc aggtcctttc tcctgatagt cacctgctgg ttgttgccag gttgcagctg 9900 ctcttgcatc tgggccagaa gtcctccctc ctgcaggctg gctgttggcc cctctgctgt 9960 cctgcagtag aaggtgccgt gagcaggctt tgggaacact ggcctgggtc tccctggtgg 10020 ggtgtgcatg ccacgccccg tgtctggatg cacagatgcc atggcctgtg ctgggccagt 10080 ggctgggggt gctagacacc cggcaccatt ctcccttctc tcttttcttc tcaggattta 10140 aaatttaatt atatcagtaa agagattaat tttaacgtaa ctctttctat gcccgtgtaa 10200 agtatgtgaa tcgcaaggcc tgtgctgcat gcgacagcgt ccggggtggt ggacagggcc 10260 cccggccacg ctccctctcc tgtagccact ggcatagccc tcctgagcac ccgctgacat 10320 ttccgttgta catgttcctg tttatgcatt cacaaggtga ctgggatgta gagaggcgtt 10380 agtgggcagg tggccacagc aggactgagg acaggccccc attatcctag gggtgcgctc 10440 acctgcagcc cctcctcctc gggcacagac gactgtcgtt ctccacccac cagtcaggga 10500 cagcagcctc cctgtcactc agctgagaag gccagccctc cctggctgtg agcagcctcc 10560 actgtgtcca gagacatggg cctcccactc ctgttccttg ctagccctgg ggtggcgtct 10620 gcctaggagc tggctggcag gtgttgggac ctgctgctcc atggatgcat gccctaagag 10680 tgtcactgag ctgtgttttg tctgagcctc tctcggtcaa cagcaaagct tggtgtcttg 10740 gcactgttag tgacagagcc cagcatccct tctgcccccg ttccagctga catcttgcac 10800 ggtgacccct tttagtcagg agagtgcaga tctgtgctca tcggagactg ccccacggcc 10860 ctgtcagagc cgccactcct atccccaggc caggtccctg gaccagcctc ctgtttgcag 10920 gcccagagga gccaagtcat taaaatggaa gtggattctg gatggccggg ctgctgctga 10980 tgtaggagct ggatttggga gctctgcttg ccgactggct gtgagacgag gcaggggctc 11040 tgcttcctca gccctagagg cgagccaggc aaggttggcg actgtcatgt ggcttggttt 11100 ggtcatgccc gtcgatgttt tgggtattga atgtggtaag tggaggaaat gttggaactc 11160 tgtgcaggtg ctgccttgag acccccaagc ttccacctgt ccctctccta tgtggcagct 11220 ggggagcagc tgagatgtgg acttgtatgc tgcccacata cgtgaggggg agctgaaagg 11280 gagcccctcc tctgagcagc ctctgccagg cctgtatgag gcttttccca ccagctccca 11340 acagaggcct cccccagcca ggaccacctc gtcctcgtgg cggggcagca ggagcggtag 11400 aaaggggtcc gatgtttgag gaggccctta agggaagcta ctgaattata acacgtaaga 11460 aaatcaccat tccgtattgg ttgggggctc ctgtttctca tcctagcttt ttcctggaaa 11520 gcccgctaga aggtttggga acgaggggaa agttctcaga actgttggct gctccccacc 11580 cgcctcccgc ctcccccgca ggttatgtca gcagctctga gacagcagta tcacaggcca 11640 gatgttgttc ctggctagat gtttacattt gtaagaaata acactgtgaa tgtaaaacag 11700 agccattccc ttggaatgca tatcgctggg ctcaacatag agtttgtctt cctcttgttt 11760 acgacgtgat ctaaaccagt ccttagcaag gggctcagaa caccccgctc tggcagtagg 11820 tgtcccccac ccccaaagac ctgcctgtgt gctccggaga tgaatatgag ctcattagta 11880 aaaatgactt cacccacgca tatacataaa gtatccatgc atgtgcatat agacacatct 11940 ataattttac acacacacct ctcaagacgg agatgcatgg cctctaagag tgcccgtgtc 12000 ggttcttcct ggaagttgac tttccttaga cccgccaggt caagttagcc gcgtgacgga 12060 catccaggcg tgggacgtgg tcagggcagg gctcattcat tgcccactag gatcccactg 12120 gcgaagatgg tctccatatc agctctctgc agaagggagg aagactttat catgttccta 12180 aaaatctgtg gcaagcaccc atcgtattat ccaaattttg ttgcaaatgt gattaatttg 12240 gttgtcaagt tttgggggtg ggctgtgggg agattgcttt tgttttcctg ctggtaatat 12300 cgggaaagat tttaatgaaa ccagggtaga attgtttggc aatgcactga agcgtgtttc 12360 tttcccaaaa tgtgcctccc ttccgctgcg ggcccagctg agtctatgta ggtgatgttt 12420 ccagctgcca agtgctcttt gttactgtcc accctcattt ctgccagcgc atgtgtcctt 12480 tcaaggggaa aatgtgaagc tgaaccccct ccagacaccc agaatgtagc atctgagaag 12540 gccctgtgcc ctaaaggaca cccctcgccc ccatcttcat ggagggggtc atttcagagc 12600 cctcggagcc aatgaacagc tcctcctctt ggagctgaga tgagccccac gtggagctcg 12660 ggacggatag tagacagcaa taactcggtg tgtggccgcc tggcaggtgg aacttcctcc 12720 cgttgcgggg tggagtgagg ttagttctgt gtgtctggtg ggtggagtca ggcttctctt 12780 gctacctgtg agcatccttc ccagcagaca tcctcatcgg gctttgtccc tcccccgctt 12840 cctccctctg cggggaggac ccgggaccac agctgctggc cagggtagac ttggagctgt 12900 cctccagagg ggtcacgtgt aggagtgaga agaaggaaga tcttgagagc tgctgaggga 12960 ccttggagag ctcaggatgg ctcagacgag gacactcgct tgccgggcct gggcctcctg 13020 ggaaggaggg agctgctcag aatgccgcat gacaactgaa ggcaacctgg aaggttcagg 13080 ggccgctctt cccccatgtg cctgtcacgc tctggtgcag tcaaaggaac gccttcccct 13140 cagttgtttc taagagcaga gtctcccgct gcaatctggg tggtaactgc cagccttgga 13200 ggatcgtggc caacgtggac ctgcctacgg agggtgggct ctgacccaag tggggcctcc 13260 ttgtccaggt ctcactgctt tgcaccgtgg tcagagggac tgtcagctga gcttgagctc 13320 ccctggagcc agcagggctg tgatgggcga gtcccggagc cccacccaga cctgaatgct 13380 tctgagagca aagggaagga ctgacgagag atgtatattt aattttttaa ctgctgcaaa 13440 cattgtacat ccaaattaaa ggaaaaaaat ggaaaccatc a 13481 <210> 2 <211> 259 <212> DNA <213> Homo sapiens <220> <223> fragment of huntingtin (HTT) mRNA (residues 1 to 259) <400> 2 gctgccggga cgggtccaag atggacggcc gctcaggttc tgcttttacc tgcggcccag 60 agccccattc attgccccgg tgctgagcgg cgccgcgagt cggcccgagg cctccgggga 120 ctgccgtgcc gggcgggaga ccgccatggc gaccctggaa aagctgatga aggccttcga 180 gtccctcaag tccttccagc agcagcagca gcagcagcag cagcagcagc agcagcagca 240 gcagcagcag cagcagcag 259 <210> 3 <211> 31 <212> DNA <213> Homo sapiens <220> <223> fragment of huntingtin (HTT) mRNA (residues 181 to 211) <400> 3 gtccctcaag tccttccagc agcagcagca g 31 <210> 4 <211> 1266 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> pCLS9303 <400> 4 atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac 60 gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc 120 aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt 180 acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc 240 aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc 300 ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg 360 agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc 420 gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac 480 ttgaccggag acgcccgggg gatcaggtca cgtgcgtctc ggagcattgt tgcccagtta 540 tctcgccctg atccggcgtt ggccgcgttg accaacgacc acctcgtcgc cttggcctgc 600 ctcggcgggc gtcctgcgct ggatgcagtg aaaaagggat tgggggatcc tatcagccgt 660 tcccagctgg tgaagtccga gctggaggag aagaaatccg agttgaggca caagctgaag 720 tacgtgcccc acgagtacat cgagctgatc gagatcgccc ggaacagcac ccaggaccgt 780 atcctggaga tgaaggtgat ggagttcttc atgaaggtgt acggctacag gggcaagcac 840 ctgggcggct ccaggaagcc cgacggcgcc atctacaccg tgggctcccc catcgactac 900 ggcgtgatcg tggacaccaa ggcctactcc ggcggctaca acctgcccat cggccaggcc 960 gacgaaatgc agaggtacgt ggaggagaac cagaccagga acaagcacat caaccccaac 1020 gagtggtgga aggtgtaccc ctccagcgtg accgagttca agttcctgtt cgtgtccggc 1080 cacttcaagg gcaactacaa ggcccagctg accaggctga accacatcac caactgcaac 1140 ggcgccgtgc tgtccgtgga ggagctcctg atcggcggcg agatgatcaa ggccggcacc 1200 ctgaccctgg aggaggtgag gaggaagttc aacaacggcg agatcaactt cgcggccgac 1260 tgataa 1266 <210> 5 <211> 1284 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> pCLS9312 <400> 5 atgggcgatc ctaaaaagaa acgtaaggtc atcgataagg agaccgccgc tgccaagttc 60 gagagacagc acatggacag catcgatatc gccgatctac gcacgctcgg ctacagccag 120 cagcaacagg agaagatcaa accgaaggtt cgttcgacag tggcgcagca ccacgaggca 180 ctggtcggcc acgggtttac acacgcgcac atcgttgcgt taagccaaca cccggcagcg 240 ttagggaccg tcgctgtcaa gtatcaggac atgatcgcag cgttgccaga ggcgacacac 300 gaagcgatcg ttggcgtcgg caaacagtgg tccggcgcac gcgctctgga ggccttgctc 360 acggtggcgg gagagttgag aggtccaccg ttacagttgg acacaggcca acttctcaag 420 attgcaaaac gtggcggcgt gaccgcagtg gaggcagtgc atgcatggcg caatgcactg 480 acgggtgccc cgctcaactt gaccggagac gcccggggga tcaggtcacg tgcgtctcgg 540 agcattgttg cccagttatc tcgccctgat ccggcgttgg ccgcgttgac caacgaccac 600 ctcgtcgcct tggcctgcct cggcgggcgt cctgcgctgg atgcagtgaa aaagggattg 660 ggggatccta tcagccgttc ccagctggtg aagtccgagc tggaggagaa gaaatccgag 720 ttgaggcaca agctgaagta cgtgccccac gagtacatcg agctgatcga gatcgcccgg 780 aacagcaccc aggaccgtat cctggagatg aaggtgatgg agttcttcat gaaggtgtac 840 ggctacaggg gcaagcacct gggcggctcc aggaagcccg acggcgccat ctacaccgtg 900 ggctccccca tcgactacgg cgtgatcgtg gacaccaagg cctactccgg cggctacaac 960 ctgcccatcg gccaggccga cgaaatgcag aggtacgtgg aggagaacca gaccaggaac 1020 aagcacatca accccaacga gtggtggaag gtgtacccct ccagcgtgac cgagttcaag 1080 ttcctgttcg tgtccggcca cttcaagggc aactacaagg cccagctgac caggctgaac 1140 cacatcacca actgcaacgg cgccgtgctg tccgtggagg agctcctgat cggcggcgag 1200 atgatcaagg ccggcaccct gaccctggag gaggtgagga ggaagttcaa caacggcgag 1260 atcaacttcg cggccgactg ataa 1284 <210> 6 <211> 530 <212> PRT <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polypeptide <220> <223> TiFLAN <400> 6 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 1 5 10 15 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 20 25 30 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 35 40 45 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 50 55 60 Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 65 70 75 80 Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 85 90 95 Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 100 105 110 Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 115 120 125 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 130 135 140 Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 145 150 155 160 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 165 170 175 Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val 180 185 190 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 195 200 205 Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 210 215 220 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 225 230 235 240 Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala 245 250 255 Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly 260 265 270 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 275 280 285 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 290 295 300 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 305 310 315 320 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 325 330 335 Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn 340 345 350 Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val 355 360 365 Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 370 375 380 Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 385 390 395 400 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 405 410 415 Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 420 425 430 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 435 440 445 Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val 450 455 460 Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 465 470 475 480 Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 485 490 495 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 500 505 510 Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 515 520 525 Leu Glu 530 <210> 7 <211> 530 <212> PRT <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polypeptide <220> <223> TiFLAN2_T01.1 <400> 7 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 1 5 10 15 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 20 25 30 His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 35 40 45 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 50 55 60 Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 65 70 75 80 Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 85 90 95 Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 100 105 110 Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 115 120 125 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 130 135 140 Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 145 150 155 160 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 165 170 175 Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 180 185 190 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 195 200 205 Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 210 215 220 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 225 230 235 240 Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 245 250 255 Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 260 265 270 Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 275 280 285 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 290 295 300 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 305 310 315 320 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 325 330 335 Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 340 345 350 Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 355 360 365 Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 370 375 380 Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 385 390 395 400 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 405 410 415 Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 420 425 430 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 435 440 445 Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 450 455 460 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 465 470 475 480 Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu 485 490 495 Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 500 505 510 Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 515 520 525 Leu Glu 530 <210> 8 <211> 936 <212> PRT <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polypeptide <220> <223> pCLS9984 <400> 8 Met Gly Asp Pro Lys Lys Lys Arg Lys Val Ile Asp Tyr Pro Tyr Asp 1 5 10 15 Val Pro Asp Tyr Ala Ile Asp Ile Ala Asp Leu Arg Thr Leu Gly Tyr 20 25 30 Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val 35 40 45 Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His 50 55 60 Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val 65 70 75 80 Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala 85 90 95 Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala 100 105 110 Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp 115 120 125 Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val 130 135 140 Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn 145 150 155 160 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 165 170 175 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 180 185 190 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 195 200 205 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 210 215 220 Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 225 230 235 240 Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 245 250 255 Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 260 265 270 Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 275 280 285 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 290 295 300 Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 305 310 315 320 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 325 330 335 Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val 340 345 350 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 355 360 365 Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 370 375 380 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 385 390 395 400 Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala 405 410 415 Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly 420 425 430 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 435 440 445 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 450 455 460 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 465 470 475 480 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 485 490 495 Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn 500 505 510 Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val 515 520 525 Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 530 535 540 Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 545 550 555 560 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 565 570 575 Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 580 585 590 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 595 600 605 Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val 610 615 620 Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 625 630 635 640 Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 645 650 655 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 660 665 670 Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 675 680 685 Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro Asp Pro Ala Leu Ala 690 695 700 Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly Gly Arg 705 710 715 720 Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Gly Asp Pro Ile Ser Arg 725 730 735 Ser Gln Leu Val Lys Ser Glu Leu Glu Glu Lys Lys Ser Glu Leu Arg 740 745 750 His Lys Leu Lys Tyr Val Pro His Glu Tyr Ile Glu Leu Ile Glu Ile 755 760 765 Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu Glu Met Lys Val Met Glu 770 775 780 Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly Lys His Leu Gly Gly Ser 785 790 795 800 Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val Gly Ser Pro Ile Asp Tyr 805 810 815 Gly Val Ile Val Asp Thr Lys Ala Tyr Ser Gly Gly Tyr Asn Leu Pro 820 825 830 Ile Gly Gln Ala Asp Glu Met Gln Arg Tyr Val Glu Glu Asn Gln Thr 835 840 845 Arg Asn Lys His Ile Asn Pro Asn Glu Trp Trp Lys Val Tyr Pro Ser 850 855 860 Ser Val Thr Glu Phe Lys Phe Leu Phe Val Ser Gly His Phe Lys Gly 865 870 875 880 Asn Tyr Lys Ala Gln Leu Thr Arg Leu Asn His Ile Thr Asn Cys Asn 885 890 895 Gly Ala Val Leu Ser Val Glu Glu Leu Leu Ile Gly Gly Glu Met Ile 900 905 910 Lys Ala Gly Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn 915 920 925 Gly Glu Ile Asn Phe Ala Ala Asp 930 935 <210> 9 <211> 2814 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> pCLS9984 <400> 9 atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac 60 gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc 120 aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt 180 acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc 240 aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc 300 ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg 360 agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc 420 gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac 480 ttgacccccc agcaggtggt ggccatcgcc agcaatggcg gtggcaagca ggcgctggag 540 acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg 600 gtggccatcg ccagccacga tggcggcaag caggcgctgg agacggtcca gcggctgttg 660 ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat 720 aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc 780 cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg 840 ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccccag 900 caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg 960 ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc agcaggtggt ggccatcgcc 1020 agcaataatg gtggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc 1080 caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag 1140 caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc 1200 ccggagcagg tggtggccat cgccagcaat attggtggca agcaggcgct ggagacggtg 1260 caggcgctgt tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc 1320 atcgccagca ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg 1380 ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc 1440 ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc 1500 ttgaccccgg agcaggtggt ggccatcgcc agcaatattg gtggcaagca ggcgctggag 1560 acggtgcagg cgctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg 1620 gtggccatcg ccagcaataa tggtggcaag caggcgctgg agacggtcca gcggctgttg 1680 ccggtgctgt gccaggccca cggcttgacc ccggagcagg tggtggccat cgccagccac 1740 gatggcggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc 1800 cacggcttga ccccggagca ggtggtggcc atcgccagca atattggtgg caagcaggcg 1860 ctggagacgg tgcaggcgct gttgccggtg ctgtgccagg cccacggctt gaccccccag 1920 caggtggtgg ccatcgccag caataatggt ggcaagcagg cgctggagac ggtccagcgg 1980 ctgttgccgg tgctgtgcca ggcccacggc ttgacccctc agcaggtggt ggccatcgcc 2040 agcaatggcg gcggcaggcc ggcgctggag agcattgttg cccagttatc tcgccctgat 2100 ccggcgttgg ccgcgttgac caacgaccac ctcgtcgcct tggcctgcct cggcgggcgt 2160 cctgcgctgg atgcagtgaa aaagggattg ggggatccta tcagccgttc ccagctggtg 2220 aagtccgagc tggaggagaa gaaatccgag ttgaggcaca agctgaagta cgtgccccac 2280 gagtacatcg agctgatcga gatcgcccgg aacagcaccc aggaccgtat cctggagatg 2340 aaggtgatgg agttcttcat gaaggtgtac ggctacaggg gcaagcacct gggcggctcc 2400 aggaagcccg acggcgccat ctacaccgtg ggctccccca tcgactacgg cgtgatcgtg 2460 gacaccaagg cctactccgg cggctacaac ctgcccatcg gccaggccga cgaaatgcag 2520 aggtacgtgg aggagaacca gaccaggaac aagcacatca accccaacga gtggtggaag 2580 gtgtacccct ccagcgtgac cgagttcaag ttcctgttcg tgtccggcca cttcaagggc 2640 aactacaagg cccagctgac caggctgaac cacatcacca actgcaacgg cgccgtgctg 2700 tccgtggagg agctcctgat cggcggcgag atgatcaagg ccggcaccct gaccctggag 2760 gaggtgagga ggaagttcaa caacggcgag atcaacttcg cggccgactg ataa 2814 <210> 10 <211> 936 <212> PRT <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polypeptide <220> <223> pCLS16715 <400> 10 Met Gly Asp Pro Lys Lys Lys Arg Lys Val Ile Asp Tyr Pro Tyr Asp 1 5 10 15 Val Pro Asp Tyr Ala Ile Asp Ile Ala Asp Leu Arg Thr Leu Gly Tyr 20 25 30 Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val 35 40 45 Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His 50 55 60 Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val 65 70 75 80 Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala 85 90 95 Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala 100 105 110 Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp 115 120 125 Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val 130 135 140 Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn 145 150 155 160 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 165 170 175 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 180 185 190 His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 195 200 205 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 210 215 220 Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 225 230 235 240 Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 245 250 255 Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 260 265 270 Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 275 280 285 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 290 295 300 Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 305 310 315 320 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 325 330 335 Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 340 345 350 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 355 360 365 Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 370 375 380 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 385 390 395 400 Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 405 410 415 Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 420 425 430 Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 435 440 445 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 450 455 460 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 465 470 475 480 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 485 490 495 Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 500 505 510 Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 515 520 525 Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 530 535 540 Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu 545 550 555 560 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 565 570 575 Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 580 585 590 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 595 600 605 Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 610 615 620 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu 625 630 635 640 Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu 645 650 655 Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 660 665 670 Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 675 680 685 Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro Asp Pro Ala Leu Ala 690 695 700 Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly Gly Arg 705 710 715 720 Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Gly Asp Pro Ile Ser Arg 725 730 735 Ser Gln Leu Val Lys Ser Glu Leu Glu Glu Lys Lys Ser Glu Leu Arg 740 745 750 His Lys Leu Lys Tyr Val Pro His Glu Tyr Ile Glu Leu Ile Glu Ile 755 760 765 Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu Glu Met Lys Val Met Glu 770 775 780 Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly Lys His Leu Gly Gly Ser 785 790 795 800 Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val Gly Ser Pro Ile Asp Tyr 805 810 815 Gly Val Ile Val Asp Thr Lys Ala Tyr Ser Gly Gly Tyr Asn Leu Pro 820 825 830 Ile Gly Gln Ala Asp Glu Met Gln Arg Tyr Val Glu Glu Asn Gln Thr 835 840 845 Arg Asn Lys His Ile Asn Pro Asn Glu Trp Trp Lys Val Tyr Pro Ser 850 855 860 Ser Val Thr Glu Phe Lys Phe Leu Phe Val Ser Gly His Phe Lys Gly 865 870 875 880 Asn Tyr Lys Ala Gln Leu Thr Arg Leu Asn His Ile Thr Asn Cys Asn 885 890 895 Gly Ala Val Leu Ser Val Glu Glu Leu Leu Ile Gly Gly Glu Met Ile 900 905 910 Lys Ala Gly Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn 915 920 925 Gly Glu Ile Asn Phe Ala Ala Asp 930 935 <210> 11 <211> 2814 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> pCLS16715 <400> 11 atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac 60 gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc 120 aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt 180 acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc 240 aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc 300 ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg 360 agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc 420 gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac 480 ttgacccccc agcaggtggt ggccatcgcc agcaataatg gtggcaagca ggcgctggag 540 acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg 600 gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg 660 ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat 720 aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc 780 cacggcttga ccccggagca ggtggtggcc atcgccagca atattggtgg caagcaggcg 840 ctggagacgg tgcaggcgct gttgccggtg ctgtgccagg cccacggctt gaccccccag 900 caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg 960 ctgttgccgg tgctgtgcca ggcccacggc ttgaccccgg agcaggtggt ggccatcgcc 1020 agccacgatg gcggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc 1080 caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag 1140 caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc 1200 ccggagcagg tggtggccat cgccagccac gatggcggca agcaggcgct ggagacggtc 1260 cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccggagca ggtggtggcc 1320 atcgccagcc acgatggcgg caagcaggcg ctggagacgg tccagcggct gttgccggtg 1380 ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc 1440 ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc 1500 ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag 1560 acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg 1620 gtggccatcg ccagcaatat tggtggcaag caggcgctgg agacggtgca ggcgctgttg 1680 ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat 1740 aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc 1800 cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg 1860 ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccggag 1920 caggtggtgg ccatcgccag caatattggt ggcaagcagg cgctggagac ggtgcaggcg 1980 ctgttgccgg tgctgtgcca ggcccacggc ttgacccctc agcaggtggt ggccatcgcc 2040 agcaatggcg gcggcaggcc ggcgctggag agcattgttg cccagttatc tcgccctgat 2100 ccggcgttgg ccgcgttgac caacgaccac ctcgtcgcct tggcctgcct cggcgggcgt 2160 cctgcgctgg atgcagtgaa aaagggattg ggggatccta tcagccgttc ccagctggtg 2220 aagtccgagc tggaggagaa gaaatccgag ttgaggcaca agctgaagta cgtgccccac 2280 gagtacatcg agctgatcga gatcgcccgg aacagcaccc aggaccgtat cctggagatg 2340 aaggtgatgg agttcttcat gaaggtgtac ggctacaggg gcaagcacct gggcggctcc 2400 aggaagcccg acggcgccat ctacaccgtg ggctccccca tcgactacgg cgtgatcgtg 2460 gacaccaagg cctactccgg cggctacaac ctgcccatcg gccaggccga cgaaatgcag 2520 aggtacgtgg aggagaacca gaccaggaac aagcacatca accccaacga gtggtggaag 2580 gtgtacccct ccagcgtgac cgagttcaag ttcctgttcg tgtccggcca cttcaagggc 2640 aactacaagg cccagctgac caggctgaac cacatcacca actgcaacgg cgccgtgctg 2700 tccgtggagg agctcctgat cggcggcgag atgatcaagg ccggcaccct gaccctggag 2760 gaggtgagga ggaagttcaa caacggcgag atcaacttcg cggccgactg ataa 2814 <210> 12 <211> 69 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> TIFLAN <400> 12 tctcaagatt tcgctgcagc agcagcagca gcagcagcag cagcagcagc agcagcagca 60 gcagcagca 69 <210> 13 <211> 46 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> TIFLAN2 <400> 13 tgtgatcccc ccagcagcag cagcagcagc agcagcagca gcagca 46 <210> 14 <211> 530 <212> PRT <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polypeptide <220> <223> TiCAG <400> 14 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 1 5 10 15 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 20 25 30 His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly 35 40 45 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 50 55 60 Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 65 70 75 80 Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 85 90 95 Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 100 105 110 Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 115 120 125 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 130 135 140 Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 145 150 155 160 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 165 170 175 Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val 180 185 190 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 195 200 205 Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 210 215 220 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 225 230 235 240 Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 245 250 255 Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 260 265 270 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 275 280 285 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 290 295 300 His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly 305 310 315 320 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 325 330 335 Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 340 345 350 Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 355 360 365 Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 370 375 380 Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 385 390 395 400 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 405 410 415 Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 420 425 430 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 435 440 445 Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 450 455 460 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 465 470 475 480 Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 485 490 495 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 500 505 510 Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg Pro Ala 515 520 525 Leu Glu 530 <210> 15 <211> 942 <212> PRT <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polypeptide <220> <223> pCLS9996 <400> 15 Met Gly Asp Pro Lys Lys Lys Arg Lys Val Ile Asp Lys Glu Thr Ala 1 5 10 15 Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser Ile Asp Ile Ala Asp 20 25 30 Leu Arg Thr Leu Gly Tyr Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro 35 40 45 Lys Val Arg Ser Thr Val Ala Gln His His Glu Ala Leu Val Gly His 50 55 60 Gly Phe Thr His Ala His Ile Val Ala Leu Ser Gln His Pro Ala Ala 65 70 75 80 Leu Gly Thr Val Ala Val Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro 85 90 95 Glu Ala Thr His Glu Ala Ile Val Gly Val Gly Lys Gln Trp Ser Gly 100 105 110 Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly 115 120 125 Pro Pro Leu Gln Leu Asp Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg 130 135 140 Gly Gly Val Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu 145 150 155 160 Thr Gly Ala Pro Leu Asn Leu Thr Pro Gln Gln Val Val Ala Ile Ala 165 170 175 Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 180 185 190 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 195 200 205 Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 210 215 220 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val 225 230 235 240 Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val 245 250 255 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 260 265 270 Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu 275 280 285 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 290 295 300 Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 305 310 315 320 Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 325 330 335 Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 340 345 350 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 355 360 365 His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly 370 375 380 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 385 390 395 400 Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His 405 410 415 Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 420 425 430 Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala 435 440 445 Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 450 455 460 Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala 465 470 475 480 Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 485 490 495 Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 500 505 510 Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 515 520 525 Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln 530 535 540 Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 545 550 555 560 Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 565 570 575 Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala 580 585 590 Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 595 600 605 Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 610 615 620 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 625 630 635 640 His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 645 650 655 Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 660 665 670 Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn 675 680 685 Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg 690 695 700 Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu 705 710 715 720 Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu 725 730 735 Gly Asp Pro Ile Ser Arg Ser Gln Leu Val Lys Ser Glu Leu Glu Glu 740 745 750 Lys Lys Ser Glu Leu Arg His Lys Leu Lys Tyr Val Pro His Glu Tyr 755 760 765 Ile Glu Leu Ile Glu Ile Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu 770 775 780 Glu Met Lys Val Met Glu Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly 785 790 795 800 Lys His Leu Gly Gly Ser Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val 805 810 815 Gly Ser Pro Ile Asp Tyr Gly Val Ile Val Asp Thr Lys Ala Tyr Ser 820 825 830 Gly Gly Tyr Asn Leu Pro Ile Gly Gln Ala Asp Glu Met Gln Arg Tyr 835 840 845 Val Glu Glu Asn Gln Thr Arg Asn Lys His Ile Asn Pro Asn Glu Trp 850 855 860 Trp Lys Val Tyr Pro Ser Ser Val Thr Glu Phe Lys Phe Leu Phe Val 865 870 875 880 Ser Gly His Phe Lys Gly Asn Tyr Lys Ala Gln Leu Thr Arg Leu Asn 885 890 895 His Ile Thr Asn Cys Asn Gly Ala Val Leu Ser Val Glu Glu Leu Leu 900 905 910 Ile Gly Gly Glu Met Ile Lys Ala Gly Thr Leu Thr Leu Glu Glu Val 915 920 925 Arg Arg Lys Phe Asn Asn Gly Glu Ile Asn Phe Ala Ala Asp 930 935 940 <210> 16 <211> 2832 <212> DNA <213> Artificial sequence <220> <223> Description of artificial sequence: Synthetic polynucleotide <220> <223> pCLS9996 <400> 16 atgggcgatc ctaaaaagaa acgtaaggtc atcgataagg agaccgccgc tgccaagttc 60 gagagacagc acatggacag catcgatatc gccgatctac gcacgctcgg ctacagccag 120 cagcaacagg agaagatcaa accgaaggtt cgttcgacag tggcgcagca ccacgaggca 180 ctggtcggcc acgggtttac acacgcgcac atcgttgcgt taagccaaca cccggcagcg 240 ttagggaccg tcgctgtcaa gtatcaggac atgatcgcag cgttgccaga ggcgacacac 300 gaagcgatcg ttggcgtcgg caaacagtgg tccggcgcac gcgctctgga ggccttgctc 360 acggtggcgg gagagttgag aggtccaccg ttacagttgg acacaggcca acttctcaag 420 attgcaaaac gtggcggcgt gaccgcagtg gaggcagtgc atgcatggcg caatgcactg 480 acgggtgccc cgctcaactt gaccccccag caggtggtgg ccatcgccag caataatggt 540 ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc 600 ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag 660 acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg 720 gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg 780 ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat 840 aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc 900 cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg 960 ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccccag 1020 caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg 1080 ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc agcaggtggt ggccatcgcc 1140 agcaataatg gtggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc 1200 caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag 1260 caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc 1320 ccccagcagg tggtggccat cgccagcaat ggcggtggca agcaggcgct ggagacggtc 1380 cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc 1440 atcgccagca ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg 1500 ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc 1560 ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc 1620 ttgacccccc agcaggtggt ggccatcgcc agcaatggcg gtggcaagca ggcgctggag 1680 acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg 1740 gtggccatcg ccagcaataa tggtggcaag caggcgctgg agacggtcca gcggctgttg 1800 ccggtgctgt gccaggccca cggcttgacc ccggagcagg tggtggccat cgccagccac 1860 gatggcggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc 1920 cacggcttga ccccccagca ggtggtggcc atcgccagca atggcggtgg caagcaggcg 1980 ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gacccctcag 2040 caggtggtgg ccatcgccag caatggcggc ggcaggccgg cgctggagag cattgttgcc 2100 cagttatctc gccctgatcc ggcgttggcc gcgttgacca acgaccacct cgtcgccttg 2160 gcctgcctcg gcgggcgtcc tgcgctggat gcagtgaaaa agggattggg ggatcctatc 2220 agccgttccc agctggtgaa gtccgagctg gaggagaaga aatccgagtt gaggcacaag 2280 ctgaagtacg tgccccacga gtacatcgag ctgatcgaga tcgcccggaa cagcacccag 2340 gaccgtatcc tggagatgaa ggtgatggag ttcttcatga aggtgtacgg ctacaggggc 2400 aagcacctgg gcggctccag gaagcccgac ggcgccatct acaccgtggg ctcccccatc 2460 gactacggcg tgatcgtgga caccaaggcc tactccggcg gctacaacct gcccatcggc 2520 caggccgacg aaatgcagag gtacgtggag gagaaccaga ccaggaacaa gcacatcaac 2580 cccaacgagt ggtggaaggt gtacccctcc agcgtgaccg agttcaagtt cctgttcgtg 2640 tccggccact tcaagggcaa ctacaaggcc cagctgacca ggctgaacca catcaccaac 2700 tgcaacggcg ccgtgctgtc cgtggaggag ctcctgatcg gcggcgagat gatcaaggcc 2760 ggcaccctga ccctggagga ggtgaggagg aagttcaaca acggcgagat caacttcgcg 2820 gccgactgat aa 2832