Input sequence

Protein name Cleavage and polyadenylation specificity factor subunit 1
Organism Arabidopsis thaliana Length 1442
Disorder content 13.9% ProS content 5.6%
IDEAL NA UniProt Q9FGR0

Prediction

Order Disorder ProS BLAST:PDB BLAST:PDB BLAST:PDB RPS-BLAST:PDB RPS-BLAST:PDB RPS-BLAST:PDB RPS-BLAST:Pfam HMMER:Pfam SEG:LCR 0 1442 1-29 30-38 39-49 50-55 56-77 78-83 84-94 95-125 126-130 131-216 217-224 225-229 230-443 444-447 448-496 497-500 501-585 586-601 602-613 614-630 631-676 677-685 686-728 729-732 733-755 756-777 778-821 822-840 841-880 881-886 887-916 917-1055 1056-1071 1072-1321 1322-1329 1330-1442 1-29 39-49 56-77 95-125 131-216 230-443 501-585 602-613 631-676 686-728 733-755 778-821 841-880 917-1055 1072-1321 1330-1442 84-94 217-224 448-496 586-601 614-630 729-732 756-777 822-840 887-916 1056-1071 1322-1329 84-87 217-224 474-492 614-624 629-630 756-762 898-916 1056-1062 1068-1071 992-1436 240-449 495-662 942-1435 423-663 1-460 1097-1403 1079-1376 381-394 672-693 4a0aA(372-777) 5e-15 21.9% 4a0aA(156-341) 0.0003 23.81% 2b5lA(337-479) 4e-07 23.81% 2b5lA(645-1124) 5e-48 18.77% 2b5lA(268-480) 4e-14 20.16% 4a0aA(1-352) 8e-28 17.61% PF03178(12-300) 4e-39 35.06% PF03178 2e-55 194.7%

Sequence

MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV
VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA
VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF
PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN
LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV
IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA
SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS
VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQDT
IGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQSN
YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAADE
DEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGAR
ILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCTV
SISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGDI
YCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHELEYELNKNSEDNTSSKEI
KNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDSTKAENSLSSENPAALN
SSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGSRPGWCMLFR
ERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDNYWPVQKIPL
KATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSSDDLQRTYTV
EEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYVQG
EDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW
NGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLD
CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL
QMVSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAF
RQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSVGTS
FL

Region:1-1442
Length:1442aa
Label:Full sequence
Reset:click