Selenoprotein gene is very important in all the genomes, and searching the new selenoprotein gene from genome is significant for exploring the biological function of selenoprotein. In this paper, we introduce the structural characteres of the selenocysteine insertion sequences (SECIS), the bioinformatics methods of searching selenoprotein from genome and its research progress. Then, we search and verify the Selenoprotein gene in the Rat genome, and compare them with the Selenoprotein genes which been known. By this way, we confirmed the rationality of our method. Next, base on the above method, we disposal the rice genome and have gotten 1176 SECIS structures, analysis their 3'UTR and open read frames(ORF), and estimate their conservative character. In the end, we have gotten 15 sequences and their secondary structures from rice genome which maybe selenoprotein The main contents and results in this paper are as follow: 1. Download the selenoproteines of rat from the NCBI nucleotide database, analysis their SECIS elements and corresponding sequences with SECISearch2.19 program. Base on the conservative character of the 3'UTR region of Selenoprotein, Search the similar sequences in the genome database. We researched several rat Selenoproteines in this paper, including glutathione peroxide, Deiodinase, Thioredoxin reductase , SelM , SelP ,SelW and SPS2 Selenoprotein etc. With the consequence, we know there are some nucleotide sequences in mouse and other eukaryote genome which are similar to the Selenoprotein of rat. It tells us the Selenoprotein gene has high conservative character. 2. Search the SECIS element in the rice genome with the SECISearch2.19 program .and gain 1176 secondary structures from rice genome of type I and type II. Analysis their sequences and structures character and get their CDS regions. 3. Find the sequences accord with the standard one and standard two, and get their sequences of 3'UTR and ORF, analysis their conservative character with the Blast N program. Base on the consequences, we can conclude some sequences may be the Selenoprotein. 4. Search the ORF sequences. Normally, AUG codon is the start codon of the ORF and UAA,UAG,UGA are the stop codons as the DNA translation. UGA codon has a dual function in that it signals both the termination of protein synthesis and incorporation of the amino acid selenocysteine. With the Genetool bioinformatics program, we can find the ORF regions correspond to the SECIS elements and estimate the UGA codon in ORF is the incorporation codon or terminal codon.In the end, we have a conclusion that there are 15 Selenoprotein candidates in rice genome. |