CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041104
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 MCG1556, isoform CRA_b 
Protein Synonyms/Alias
 Protein Zfp407 
Gene Name
 Zfp407 
Gene Synonyms/Alias
 mCG_1556 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
1356DSSIIRIKTEDGELVubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2246 AA 
Protein Sequence
MDSENKHDND EDESVKGAKD PRSVSSRDDS SEPVSDDMAN VSENFTSKRV FSESSNSDGV 60
AAEEYRSERA SKRLRTEALE PETREQGACG LGTVEANVHL AEMGKEAFPA NCSEGGSALP 120
SVFSPSCDFS TSNILPLKVD EVKKPAREMV SLVPERHNPF PPEERSISCS FGSVETGLRC 180
GVCGCSFSSC SALEKHVECH VEEGKERTCC HCSHRAESSS SPHEHVRHTH GPQKVFSCDL 240
CGFQCAEENL LNAHYLGKTH LRRQNLAARG GFVQILTKPS FPKKACVMGT KNVRTKPRAS 300
KPIAKTGDSR VVPSTGNDFK DCRGAPSEDS AGGSELLVEM VPSRKISSGK AHVVEENVSF 360
GVAQNPENQN KQLGGLVSSE GLLNKPESTK NALQMAHVSI STTSRPRSER NLLVLGNSFR 420
RRSGTFTLKG QAKKRFNLLG INKRGTNETQ RMYMKHFRTQ MKTNAQPVLE QVEMSKDVQS 480
LCVTTSDNPE VMQDKTAFCL SGSTRLGSLP VKPADHQLSV QCTCTECGQI AANKTDFEVH 540
VKQCHGREMQ FHCQTCDFSS PSRRDLEEHV HSNQHQHMTP VLSCQCCSFI SLNETGLRDH 600
MKEKHGMGFF CTSCNLFLSE KDVEEHRATE SHNSLVVQPK TASSLSGDSV LPFSTVESEN 660
PADSREDSGK AAQEGPAESR ASHGTEARHS SKPQFQCKKC FYKTRSSTVL TRHIKLRHGQ 720
DYHFLCKACN LYSLSKEGME KHIKRSKHLE NAKKNNIGLS FEECIERVCI GANDKKEESS 780
VSGSGRPEGH VGVQSQEHSQ REQSMLTPKE LPQSGVITKE DELGLATTPK RGRPKGSISR 840
TCSHCGLLAS SITNLTVHIR RKHSHQYSYL CKVCKYYTVT KGDMERHCAT KKHKGRVEIE 900
ANGKQSSDIV VGPEGGNLEA CKDSTSLAVT VSDAQASKPA KSDTRTLETP GVGIGNAGDA 960
EAGSVFPSGD GELSSQLSDK KGQLSLETED LLQLDDACSQ REVAGSSDNK CLQCEFSAHS 1020
AASLELHVKR KHTKEFQFYC MACDYYAVTR REMTRHAATE KHRMKRQSYL SASSVEAGSS 1080
EISKNITIPE EEHSHNSEEF QIHPHQSSGT LQCRNPADCS ILDDNTNLGM SKVLCAPDSV 1140
TVVTEQESNF SEGHSFCETL QQPLVKDKSM KPREIVSPNT PSNLSLPGCL QSENLASSAV 1200
DCETAKKNRD VLDAVGDRST PCEDEGGSVD DSEEKILDKS PCPGDPDGGH SAESTSSVVM 1260
KIPREQLDLD GGGQNKVGCE QTSEDLKDVQ ANPILENKEI LINSQEEAEV ILEEDAPTSN 1320
GTADSNDVYE TIISIDDKGQ TMYSFGRFDS SIIRIKTEDG ELVEQPEEGL TATGGRVSEL 1380
PLKDCAQGLK KKKVEGGSFG ESTRIRCDDC GFLADGLSGL NVHIAMKHPT KEKHFHCLLC 1440
GKSFYTESNL HQHLASAGHM RNEQASVEEL PEGGATFKCV KCTEPFDSEQ NLFLHIKGQH 1500
EELLREVNKY IVEDTEQINR EREENQGNVC KYCGKMCRSS NSMAFLAHIR THTGSKPFKC 1560
KICHFATAQL GDARNHVKRH LGMREYKCHV CGVAFVMKKH LNTHLLGKHG VGTPKERKFT 1620
CHLCDRSFTE KWALNNHMKL HTGEKPFKCT WPTCHYSFLT ASAMKDHYRT HTGEKSFLCD 1680
LCGFAGGTRH ALTKHRRQHT GEKPFKCDEC NFASTTQSHL TRHKRVHTGE KPYRCPWCDY 1740
RSNCAENIRK HILHTGKHEG VKMYNCPKCD YGTNVPVEFR NHLKEQHPDI ENPDLAYLHA 1800
GIVSKSYECR LKGQGATFVE TDSPFTAATL AEESPVKERS LRSSKRQAAS PEQVQQVIII 1860
QGYDGEFALD ASVEETAAAT LQTLAMAGQV ARVVHITEHG QVIATSQNGS HVGSVVPGPI 1920
LPEQLADGTT QVVVMGGSME SHSVDEALSP GAAVIQQVTK QEVLSLSEAG VPPSDNSSAL 1980
DALLCAVTEL GEVEGRVGHE EKGRPSHKDV LIQLPSQEAA QAHAKAEATE AQLFQDVEES 2040
PASMEVLTQV VRPSTIITSQ ERAQVAFKKM VQGVLQFAVC DTAAASQLIK DGVTQVIVNE 2100
EGAVHMVAGE GSQFIMQEAE THGLRVPAEH MDLVESEGEI SQIIVTEELV QAMVRESNSS 2160
FPEGATHYIV TELPPGVQED TGVYSHTVIE TASSPEILQA GAALSAEAVG SSSTEQLTSM 2220
VIYTQDGSPA ATVIQSQREN SELQEA 2246 
Gene Ontology
 GO:0003676; F:nucleic acid binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR007087; Znf_C2H2.
 IPR015880; Znf_C2H2-like.
 IPR013087; Znf_C2H2/integrase_DNA-bd.
 IPR003604; Znf_U1. 
Pfam
 PF00096; zf-C2H2 
SMART
 SM00355; ZnF_C2H2
 SM00451; ZnF_U1 
PROSITE
 PS00028; ZINC_FINGER_C2H2_1
 PS50157; ZINC_FINGER_C2H2_2 
PRINTS