CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-000907
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Zinc finger C3H1 domain-containing protein 
Protein Synonyms/Alias
 Coiled-coil domain-containing protein 131; Proline/serine-rich coiled-coil protein 2 
Gene Name
 ZFC3H1 
Gene Synonyms/Alias
 CCDC131; KIAA0546; PSRC2 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
880EQLQATEKILNVNRMubiquitination[1, 2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
  
Sequence Annotation
 REPEAT 1361 1400 TPR 1.
 REPEAT 1381 1413 HAT 1.
 REPEAT 1401 1434 TPR 2.
 REPEAT 1415 1446 HAT 2.
 REPEAT 1503 1535 HAT 3.
 REPEAT 1602 1635 TPR 3.
 REPEAT 1636 1669 TPR 4.
 REPEAT 1745 1778 TPR 5.
 REPEAT 1759 1794 HAT 4.
 REPEAT 1804 1843 HAT 5.
 REPEAT 1919 1951 HAT 6.
 ZN_FING 1185 1206 C3H1-type.
 MOD_RES 2 2 N-acetylalanine.
 MOD_RES 15 15 Phosphoserine.
 MOD_RES 28 28 Phosphoserine.
 MOD_RES 34 34 Phosphoserine.
 MOD_RES 352 352 Phosphoserine.
 MOD_RES 714 714 Phosphoserine.
 MOD_RES 717 717 Phosphoserine.
 MOD_RES 719 719 Phosphoserine.
 MOD_RES 809 809 Phosphoserine.
 MOD_RES 949 949 Phosphoserine.
 MOD_RES 953 953 Phosphoserine.
 MOD_RES 998 998 Phosphoserine.
 MOD_RES 1046 1046 Phosphoserine.
 MOD_RES 1303 1303 Phosphoserine.
 MOD_RES 1304 1304 Phosphoserine.  
Keyword
 Acetylation; Alternative splicing; Coiled coil; Complete proteome; Metal-binding; Phosphoprotein; Polymorphism; Reference proteome; Repeat; TPR repeat; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1989 AA 
Protein Sequence
MATADTPAPA SSGLSPKEEG ELEDGEISDD DNNSQIRSRS SSSSSGGGLL PYPRRRPPHS 60
ARGGGSGGGG GSSSSSSSSQ QQLRNFSRSR HASERGHLRG PSSYRPKEPF RSHPPSVRMP 120
SSSLSESSPR PSFWERSHLA LDRFRFRGRP YRGGSRWSRG RGVGERGGKP GCRPPLGGGA 180
GSGFSSSQSW REPSPPRKSS KSFGRSPSRK QNYSSKNENC VEETFEDLLL KYKQIQLELE 240
CINKDEKLAL SSKEENVQED PKTLNFEDQT STDNVSITKD SSKEVAPEEK TQVKTFQAFE 300
LKPLRQKLTL PGDKNRLKKV KDGAKPLSLK SDTTDSSQGL QDKEQNLTRR ISTSDILSEK 360
KLGEDEEELS ELQLRLLALQ SASKKWQQKE QQVMKESKEK LTKTKTVQQK VKTSTKTHSA 420
KKVSTTAKQA LRKQQTKAWK KLQQQKEQER QKEEDQRKQA EEEERRKREE EIRKIRDLSN 480
QEEQYNRFMK LVGGKRRSRS KSSDPDLRRS LDKQPTDSGG GIYQYDNYEE VAMDTDSETS 540
SPAPSPVQPP FFSECSLGYF SPAPSLSLPP PPQVSSLPPL SQPYVEGLCV SLEPLPPLPP 600
LPPLPPEDPE QPPKPPFADE EEEEEMLLRE ELLKSLANKR AFKPEETSSN SDPPSPPVLN 660
NSHPVPRSNL SIVSINTVSQ PRIQNPKFHR GPRLPRTVIS LPKHKSVVVT LNDSDDSESD 720
GEASKSTNSV FGGLESMIKE ARRTAEQASK PKVPPKSEKE NDPLRTPEAL PEEKKIEYRL 780
LKEEIANREK QRLIKSDQLK TSSSSPANSD VEIDGIGRIA MVTKQVTDAE SKLKKHRILL 840
MKDESVLKNL VQQEAKKKES VRNAEAKITK LTEQLQATEK ILNVNRMFLK KLQEQIHRVQ 900
QRVTIKKALT LKYGEELARA KAVASKEIGK RKLEQDRFGP NKMMRLDSSP VSSPRKHSAE 960
LIAMEKRRLQ KLEYEYALKI QKLKEARALK AKEQQNISPV VEEEPEFSLP QPSLHDLTQD 1020
KLTLDTEEND VDDEILSGSS RERRRSFLES NYFTKPNLKH TDTANKECIN KLNKNTVEKP 1080
ELFLGLKIGE LQKLYSKADS LKQLILKTTT GITEKVLHGQ EISVDVDFVT AQSKTMEVKP 1140
CPFRPYHSPL LVFKSYRFSP YYRTKEKLPL SSVSYSNMIE PDQCFCRFDL TGTCNDDDCQ 1200
WQHIQDYTLS RKQLFQDILS YNLSLIGCAE TSTNEEITAS AEKYVEKLFG VNKDRMSMDQ 1260
MAVLLVSNIN ESKGHTPPFT TYKDKRKWKP KFWRKPISDN SFSSDEEQST GPIKYAFQPE 1320
NQINVPALDT VVTPDDVRYF TNETDDIANL EASVLENPSH VQLWLKLAYK YLNQNEGECS 1380
ESLDSALNVL ARALENNKDN PEIWCHYLRL FSKRGTKDEV QEMCETAVEY APDYQSFWTF 1440
LHLESTFEEK DYVCERMLEF LMGAAKQETS NILSFQLLEA LLFRVQLHIF TGRCQSALAI 1500
LQNALKSAND GIVAEYLKTS DRCLAWLAYI HLIEFNILPS KFYDPSNDNP SRIVNTESFV 1560
MPWQAVQDVK TNPDMLLAVF EDAVKACTDE SLAVEERIEA CLPLYTNMIA LHQLLERYEA 1620
AMELCKSLLE SCPINCQLLE ALVALYLQTN QHDKARAVWL TAFEKNPQNA EVFYHMCKFF 1680
ILQNRGDNLL PFLRKFIASF FKPGFEKYNN LDLFRYLLNI PGPIDIPSRL CKGNFDDDMF 1740
NHQVPYLWLI YCLCHPLQSS IKETVEAYEA ALGVAMRCDI VQKIWMDYLV FANNRAAGSR 1800
NKVQEFKFFT DLVNRCLVTV PARYPIPFSS ADYWSNYEFH NRVIFFYLSC VPKTQHSKTL 1860
ERFCSVMPAN SGLALRLLQH EWEESNVQIL KLQAKMFTYN IPTCLATWKI AIAAEIVLKG 1920
QREVHRLYQR ALQKLPLCAS LWKDQLLFEA SEGGKTDNLR KLVSKCQEIG VSLNELLNLN 1980
SNKTESKNH 1989 
Gene Ontology
 GO:0005622; C:intracellular; IEA:InterPro.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0006396; P:RNA processing; IEA:InterPro. 
Interpro
 IPR003107; HAT.
 IPR019607; Putative_zinc-finger_domain.
 IPR013026; TPR-contain_dom. 
Pfam
 PF10650; zf-C3H1 
SMART
 SM00386; HAT 
PROSITE
 PS50005; TPR
 PS50293; TPR_REGION
 PS50103; ZF_C3H1 
PRINTS