Input sequence

Protein name Centromere protein F
Organism Homo sapiens Length 3210
Disorder content 33.5% ProS content 27.6%
IDEAL NA UniProt P49454

Prediction

Order Disorder ProS BLAST:PDB BLAST:PDB BLAST:PDB BLAST:PDB BLAST:PDB BLAST:PDB RPS-BLAST:PDB RPS-BLAST:PDB RPS-BLAST:PDB RPS-BLAST:PDB RPS-BLAST:Pfam RPS-BLAST:Pfam RPS-BLAST:Pfam RPS-BLAST:Pfam RPS-BLAST:Pfam RPS-BLAST:Pfam RPS-BLAST:Pfam HMMER:SCOP SEG:LCR COILS:coiledCoil 0 3210 1-19 20-138 139-163 164-191 192-272 273-501 502-515 516-706 707-717 718-766 767-824 825-873 874-884 885-996 997-1014 1015-1163 1164-1195 1196-1251 1252-1278 1279-1327 1328-1644 1645-1747 1748-1885 1886-1920 1921-1927 1928-1955 1956-1985 1986-2013 2014-2020 2021-2187 2188-2202 2203-2381 2382-2395 2396-2812 2813-2813 2814-2959 2960-3210 20-138 164-191 273-501 516-706 718-766 825-873 885-996 1015-1163 1196-1251 1279-1327 1645-1747 1886-1920 1928-1955 1986-2013 2021-2187 2203-2381 2396-2812 2814-2959 1-19 139-163 192-272 502-515 707-717 767-824 874-884 997-1014 1164-1195 1252-1278 1328-1644 1748-1885 1921-1927 1956-1985 2014-2020 2188-2202 2382-2395 2813-2813 2960-3210 1-19 139-163 192-194 200-211 218-228 233-239 502-515 707-717 767-824 874-884 997-1014 1164-1195 1252-1278 1328-1644 1748-1885 1921-1927 1956-1985 2014-2020 2188-2202 2382-2395 2813-2813 2960-2995 3028-3035 3047-3053 3069-3113 3189-3193 3204-3210 420-583 1000-1170 1881-2136 2567-2862 2375-2508 2112-2321 1992-2304 2396-2731 2596-2886 278-873 1-131 179-263 2409-2548 2227-2366 3065-3109 281-901 2611-2890 753-890 108-119 265-276 386-399 592-616 906-917 936-960 1220-1234 1471-1494 1567-1590 1678-1691 2157-2171 2322-2349 2519-2531 2899-2916 2934-2952 2995-3012 20-138 164-191 273-501 516-706 718-766 825-873 885-996 1015-1163 1196-1251 1279-1327 1645-1747 1886-1920 1928-1955 1986-2013 2021-2181 2203-2381 2396-2812 2814-2959 4tqlA(3-182) 2e-08 25.4% 4tqlA(1-156) 0.0002 22.81% 4tqlA(3-222) 0.0004 21.8% 1c1gA(4-284) 2e-07 22.15% 1c1gA(103-236) 1e-06 24.63% 1c1gA(1-189) 0.0002 21.8% 4rh7A(1594-1923) 2e-06 12.99% 4rh7A(1610-1923) 2e-05 12.2% 4rh7A(1610-1908) 0.0003 13.2% 4rh7A(1594-2204) 0.0009 13.47% PF10481(1-131) 3e-33 90.84% PF10481(112-194) 2e-11 58.82% PF10473(1-140) 2e-17 68.57% PF10473(1-140) 9e-17 75.71% PF10490(5-49) 8e-16 84.44% PF12128(238-845) 8e-06 19.05% PF12128(221-487) 4e-05 23.57% a.7.1 0.00086 15.1%

Sequence

MSWALEEWKEGLPTRALQKIQELEGQLDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKT
EGTNLKRENQRLMEICESLEKTKQKISHELQVKESQVNFQEGQLNSGKKQIEKLEQELKR
CKSELERSQQAAQSADVSLNPCNTPQKIFTTPLTPSQYYSGSKYEDLKEKYNKEVEERKR
LEAEVKALQAKKASQTLPQATMNHRDIARHQASSSVFSWQQEKTPSHLSSNSQRTPIRRD
FSASYFSGEQEVTPSRSTLQIGKRDANSSFFDNSSSPHLLDQLKAQNQELRNKINELELR
LQGHEKEMKGQVNKFQELQLQLEKAKVELIEKEKVLNKCRDELVRTTAQYDQASTKYTAL
EQKLKKLTEDLSCQRQNAESARCSLEQKIKEKEKEFQEELSRQQRSFQTLDQECIQMKAR
LTQELQQAKNMHNVLQAELDKLTSVKQQLENNLEEFKQKLCRAEQAFQASQIKENELRRS
MEEMKKENNLLKSHSEQKAREVCHLEAELKNIKQCLNQSQNFAEEMKAKNTSQETMLRDL
QEKINQQENSLTLEKLKLAVADLEKQRDCSQDLLKKREHHIEQLNDKLSKTEKESKALLS
ALELKKKEYEELKEEKTLFSCWKSENEKLLTQMESEKENLQSKINHLETCLKTQQIKSHE
YNERVRTLEMDRENLSVEIRNLHNVLDSKSVEVETQKLAYMELQQKAEFSDQKHQKEIEN
MCLKTSQLTGQVEDLEHKLQLLSNEIMDKDRCYQDLHAEYESLRDLLKSKDASLVTNEDH
QRSLLAFDQQPAMHHSFANIIGEQGSMPSERSECRLEADQSPKNSAILQNRVDSLEFSLE
SQKQMNSDLQKQCEELVQIKGEIEENLMKAEQMHQSFVAETSQRISKLQEDTSAHQNVVA
ETLSALENKEKELQLLNDKVETEQAEIQELKKSNHLLEDSLKELQLLSETLSLEKKEMSS
IISLNKREIEELTQENGTLKEINASLNQEKMNLIQKSESFANYIDEREKSISELSDQYKQ
EKLILLQRCEETGNAYEDLSQKYKAAQEKNSKLECLLNECTSLCENRKNELEQLKEAFAK
EHQEFLTKLAFAEERNQNLMLELETVQQALRSEMTDNQNNSKSEAGGLKQEIMTLKEEQN
KMQKEVNDLLQENEQLMKVMKTKHECQNLESEPIRNSVKERESERNQCNFKPQMDLEVKE
ISLDSYNAQLVQLEAMLRNKELKLQESEKEKECLQHELQTIRGDLETSNLQDMQSQEISG
LKDCEIDAEEKYISGPHELSTSQNDNAHLQCSLQTTMNKLNELEKICEILQAEKYELVTE
LNDSRSECITATRKMAEEVGKLLNEVKILNDDSGLLHGELVEDIPGGEFGEQPNEQHPVS
LAPLDESNSYEHLTLSDKEVQMHFAELQEKFLSLQSEHKILHDQHCQMSSKMSELQTYVD
SLKAENLVLSTNLRNFQGDLVKEMQLGLEEGLVPSLSSSCVPDSSSLSSLGDSSFYRALL
EQTGDMSLLSNLEGAVSANQCSVDEVFCSSLQTYVDSLKAENLVLSTNLRNFQGDLVKEM
QLGLEEGLVPSLSSSCVPDSSSLSSLGDSSFYRALLEQTGDMSLLSNLEGVVSANQCSVD
EVFCSSLQEENLTRKETPSAPAKGVEELESLCEVYRQSLEKLEEKMESQGIMKNKEIQEL
EQLLSSERQELDCLRKQYLSENEQWQQKLTSVTLEMESKLAAEKKQTEQLSLELEVARLQ
LQGLDLSSRSLLGIDTEDAIQGRNESCDISKEHTSETTERTPKHDVHQICDKDAQQDLNL
DIEKITETGAVKPTGECSGEQSPDTNYEPPGEDKTQGSSECISELSFSGPNALVPMDFLG
NQEDIHNLQLRVKETSNENLRLLHVIEDRDRKVESLLNEMKELDSKLHLQEVQLMTKIEA
CIELEKIVGELKKENSDLSEKLEYFSCDHQELLQRVETSEGLNSDLEMHADKSSREDIGD
NVAKVNDSWKERFLDVENELSRIRSEKASIEHEALYLEADLEVVQTEKLCLEKDNENKQK
VIVCLEEELSVVTSERNQLRGELDTMSKKTTALDQLSEKMKEKTQELESHQSECLHCIQV
AEAEVKEKTELLQTLSSDVSELLKDKTHLQEKLQSLEKDSQALSLTKCELENQIAQLNKE
KELLVKESESLQARLSESDYEKLNVSKALEAALVEKGEFALRLSSTQEEVHQLRRGIEKL
RVRIEADEKKQLHIAEKLKERERENDSLKDKVENLERELQMSEENQELVILDAENSKAEV
ETLKTQIEEMARSLKVFELDLVTLRSEKENLTKQIQEKQGQLSELDKLLSSFKSLLEEKE
QAEIQIKEESKTAVEMLQNQLKELNEAVAALCGDQEIMKATEQSLDPPIEEEHQLRNSIE
KLRARLEADEKKQLCVLQQLKESEHHADLLKGRVENLERELEIARTNQEHAALEAENSKG
EVETLKAKIEGMTQSLRGLELDVVTIRSEKENLTNELQKEQERISELEIINSSFENILQE
KEQEKVQMKEKSSTAMEMLQTQLKELNERVAALHNDQEACKAKEQNLSSQVECLELEKAQ
LLQGLDEAKNNYIVLQSSVNGLIQEVEDGKQKLEKKDEEISRLKNQIQDQEQLVSKLSQV
EGEHQLWKEQNLELRNLTVELEQKIQVLQSKNASLQDTLEVLQSSYKNLENELELTKMDK
MSFVEKVNKMTAKETELQREMHEMAQKTAELQEELSGEKNRLAGELQLLLEEIKSSKDQL
KELTLENSELKKSLDCMHKDQVEKEGKVREEIAEYQLRLHEAEKKHQALLLDTNKQYEVE
IQTYREKLTSKEECLSSQKLEIDLLKSSKEELNNSLKATTQILEELKKTKMDNLKYVNQL
KKENERAQGKMKLLIKSCKQLEEEKEILQKELSQLQAAQEKQKTGTVMDTKVDELTTEIK
ELKETLEEKTKEADEYLDKYCSLLISHEKLEKAKEMLETQVAHLCSQQSKQDSRGSPLLG
PVVPGPSPIPSVTEKRLSSGQNKASGKRQRSSGIWENGRGPTPATPESFSKKSKKAVMSG
IHPAEDTEGTEFEPEGLPEVVKKGFADIPTGKTSPYILRRTTMATRTSPRLAAQKLALSP
LSLGKENLAESSKPTAGGSRSQKVKVAQRSPVDSGTILREPTTKSVPVNNLPERSPTDSP
REGLRVKRGRLVPSPKAGLESNGSENCKVQ

Region:1-3210
Length:3210aa
Label:Full sequence
Reset:click