NONHSAT041623
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT041623 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3197 nt |
Genomic location |
chr15-:35473213..35512541 |
Exon number |
6 |
Exons |
35473213..35473692,35475352..35475454,35475505..35476202,35479650..35480640,35504992..35505452,35512076..35512541 |
Genome context |
|
Sequence |
000001 CAAATCTACT ACACTAATCC AAAATATTAA TAGAGGAAAC TGTGGGGAGA GGTTGGGAGG TGATGTGGGA CCTCTTGGTA 000080
000081 TTTCCCACTC AACTTTTCTG TAAACGTAAA ACTGCCCTCC CAAAATAGTC TATTTTTAAA AAATAAAAGT AGTCAAAAAT 000160 000161 CTAAAAGAAA AAAATTATAG TAGACAGAGA TATATGATAT CACAAGTATA GTAAAATGTT AATAGTGGTA GTGTATGTGG 000240 000241 ATGTTCCTTA TTATACTCTG CTTTGTGTTT GACATTTTTC ATAATAAATG TGTAACAAAG TCATTGTGGG TAATATTAAA 000320 000321 AGAAAGCTTA GCATAGTAAG GTTCAGTTGA ACCTGTGGGA CTTAATTCTA ATTCCATTAT GGATTTCTCA TATGATTCTA 000400 000401 CTAAAAAATG TAACTTGATT TAACTACTTG AATTACTTAT CTAAATTATG AAACCCAGTT GGCCCTTGAA ATGCAAATAG 000480 000481 GAGATCATTT CCCCTGCCCT TAAGAAAGGT TGGAACATGA CAAACCTTGA ATCAGTCTTG GAAAGAATCA TGTAATTACA 000560 000561 GGAAAGTGTG AGAATTCCAG GCCTTACCTC AGACAGATAA GGACTAGCTA TACATGAAAC GCAGACTGTG GCTAACTTTA 000640 000641 ATTCTTTCCC AAATCAGATT TTCATGAGAC ATTTTCAGTT CTTTCAGTGA GGCAAATGAA TTTCTTTAAT ATTTTACATA 000720 000721 TCCATTGTTA AAATTCAATG CAAAAACAAC TCTAATGTCA AATATAAATT ATTTTGTTGT TAAGTTTAAT TTTGAATAAT 000800 000801 CTTCTTCTCT TTTCTTTGGC ATTAGAACAA ATTGCAATTA CTGAAGTCCA AAGAATTATA AAGCACTATT TAAGCATTTA 000880 000881 CATTAGATAT CAAAATTGAA GAATACAAAG ACTAGTTTAT TAATGGATGC ATTCCTTCTG TCTTGATTCT TCCCTTGGGG 000960 000961 TGGCCTGTCT GCATGCACAG TGGCCTACTA GCACTTGAGA AGTGAGCATG CACAGTGGGT TTACTGGAGT TGTACACATG 001040 001041 CTCACTTGAG GCATTTTTCC CTTACCCGTC CAATGTTCCT AGAGGAAGCT CATCCACCAT TTTGCCTCTT AGTGTGCATA 001120 001121 GGTGAGCCCA CTCAGCCAAC TCCCGAGATC TTATGAGGAA GCTGATAATC ACTAGTTTCA GATATTTCTC TCTATTGGGA 001200 001201 GAGTGCCTTT CCCTGGCACT GTCTGTGACC AATTATTATT TTAGAGAGAC AGTTAACAGC TGCCTGACCA TCACCTGATG 001280 001281 GTCATCTGAC ATTCCTAGTG CAGGGGGGAT GCCCTCTCCT GCCCTGCTCA TGTCTGACTA GCTACCTACT GTAGCAGCAC 001360 001361 CACATCTGGC CAATTTTTGT ATTTTTTGTA GAGACAGAGT TTCATCATGT TGCCCAGGCT GGTCTTGAAC TCCTGGACTC 001440 001441 AAGTGATCTG CCTGCCTTAG CCTCCCAAAG TGCTGGGATT ACAGGCATGA GCCACCTCAC CTGGCCTGAG ATGTTTGTGG 001520 001521 GGGTTTTGCT TTTGTTTTTG TAAATTAGAA ACAGGGTCTC TCTATGTTGG CAAGGCTGGT CTTGAACTTC TGGGCTCAAT 001600 001601 CAATCCTCCC ACCTCAGCCT CCCGAAGTGT TGGGATTACA GGCATGAGCC ATTGCAGCCA GCCTGAATTC TAATTCTAAT 001680 001681 TGGTAAAAAT TAGGACTGTG CCCTTGCCAT TCCCTTCTTT TAAACTTGTA ATTCAATTAC TTGGCATGAC TGGTCCTTAT 001760 001761 ATTATAAAAT ATACTAAATC AAAAATCATC ATAATAAAAG CAGCTCCTTG AGCAACTGGA CAAAAAAAGC AGCTGGGCAT 001840 001841 AAACACTAGT AAAGTTTAAA CAATTGTATT TGGCAAGTAT TCCCCAACGT ATAAACAATT ACCACTAAAA AAAATCTGGG 001920 001921 TGAATTCATA CTGATGATCA TCTCTCTAAC AGTGTTTTCT AACATCTAAG TACTACTTTT AGCCCTAGAA AAGGATTTTA 002000 002001 ATGTTTAGAA AAGAACATTT TAATAAAGCC TTGCTAATAT TGACATATAT AGGCATCTTA ACTTGATTGA AATTCTGTCT 002080 002081 TCCCAAAGAT GGTTCTATTT TAACATTGTA TATGAAGGAA GAAATCCTAG CCTAATAATT TGAAATTATG TTTTTTATAA 002160 002161 CAGAATTCAT TTGCTTTTAT TTTAACATCA TAATAAGTGT ATCACTTTGA TCCTTACACA GTATCTTTTA TCTGTTGTGT 002240 002241 AACAATCATG CTACTGAATG TTAGATATTA CATGACATAT GAAGGAAGTT TAACTTGAGC AGTTACTGAG TTCATCCAGA 002320 002321 CTTGAAGCCT CTGTTTGTCA AGAAGTTAAT ATTTTTAAAG ATACACTGAC TTTATAATCT GTAATTATTT TCTTATCAAT 002400 002401 TACATTTTTA TAGTTTTCTG ATTAGCTGGC ACTTGGAGCT TAGTGTTACC GTTTAGCTCT GCAGGGTCCT TTTAGACACT 002480 002481 ATTATTGTAA AGAAACATCA AAAATCAAAG TGAATTCTTA GTATTATACT TGAAGCAGTA GAGGCGCAAT TGCTGAAACA 002560 002561 GCAGCCAAGC ATTGTGCTGT AGAACTAACT CACAGCTTGG TGCCCAGTCC CTGCTGTGCT AATGAGCTTT AACCTGTTTC 002640 002641 ATTCTTACTA TGCTTACTGT TTAGATACTT TGCTTGCAAA TTAATTTTTA GCTGAGAGAA CACATGAAAG AGGATATTTT 002720 002721 TTTTTTTTAC TAAAAAGCAG AGTTTTTTTT TATAACTGAC AAGAGAATCT GAGGGCAAGA ATAGTGAACA CAGGGAGGAT 002800 002801 GGCTGCTATA TTTAGTGCCT GCGTTTACAT ATGTTGGCAC CAAATGAATA CCTTAACTAA AACAAAGTGA AAAGTTTGCT 002880 002881 GAAGCTTCCA GAGGGAGGCA AAGGAGAAAG AATGTTTAGG TGCTTGCTGG CACTAACACA AATGTAAACA CAGGCATGAG 002960 002961 TAATGTGACA ATGCCCAAAT ATAAGGCAAT CATCGAGACA CATCTGTAAA TACTACAAAA TGTTATTATT ATTTTAACTC 003040 003041 TGAGATGGGA AATAGTCCTA TCCCCATTTT ACAAATGAGA AAATTGAAAA GTTAAGTGAC TGCCCTAAGA GCTCAAAGCC 003120 003121 AATTGGTGGT GGAACCATAT TTTAAGCCTC AGTTGTCTGG CTAGTATCAG TGCCCTTAAT CAGTGGGAGT AGTAACT |
Predicted Small Protein
Name | NONHSAT041623_smProtein_926:997 |
Length | 24 |
Molecular weight | 2491.8191 |
Aromaticity | 0.0869565217391 |
Instability index | 87.0695652174 |
Isoelectric point | 5.92596435547 |
Runs | 5 |
Runs residual | 0.0103519668737 |
Runs probability | 0.0155272919979 |
Amino acid sequence | MHSFCLDSSLGVACLHAQWPTST |
Secondary structure | LLLEEELLLHHHHLLLLLLLLLL |
PRMN | - |
PiMo | - |