본문 바로가기
HOME> 논문 > 논문 검색상세

학위논문 상세정보

Codon Based Encoding for Genome Sequence Analysis 원문보기

  • 저자

    Mst. Rokeya Reaz

  • 학위수여기관

    경희대학교

  • 학위구분

    국내석사

  • 학과

    컴퓨터공학과

  • 지도교수

  • 발행년도

    2014

  • 총페이지

    56 p.

  • 키워드

  • 언어

    eng

  • 원문 URL

    http://www.riss.kr/link?id=T13536605&outLink=K  

  • 초록

    Splice site identication in pre-mRNA is an important task for identifying new genes and their structures and functions. Gene identication also plays an important role in molecular biology for diagnosing disease and drug discovery. For this reason, much research has been performed to predict genes structure and function. In this thesis we propose a splice site prediction method using the RSCU (relative synonymous codon usage) values of introns and exons. Our method encodes DNA sequences with the RSCU values of 59 codons (leaving out three stop codons and two unique codons) to search for introns and exons. We then embed 118 dimensional feature vectors into the input vectors of binary SVM (support vector machine) to establish a learning model and classify test data. Furthermore, we extend our encoding approaches on nucleotide's chemical properties in the era of evolutionary relationship analysis on mtDNA sequence of 8 different species. Extensive performance study shows that our method can provide better performance than existing encoding methods based on several performance criteria such as classication accuracy, sensitivity (Sn), specicity (Sp), auROC (area under Receiver Operating Characteristics) and degree of similarity dissimilarity. Key words: RSCU, SVM, gene classication, splice site, HS3D, mtDNA, Hexag- onal Model i


 활용도 분석

  • 상세보기

    amChart 영역
  • 원문보기

    amChart 영역