CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032783
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 E1a binding protein P400, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_080190 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
936RGCQQGEKEIKAELRacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2924 AA 
Protein Sequence
MSLPPQWQPP GHSVSSEQDS VSAFSPADPS TLVRRPASSS LSDCPCVAGS SGGSVADAQP 60
WSVPCHVAAA SSAVSSQRAL STSAATPSGG LRVAQKSSEA APRALHAVAD PMSLNAPPIH 120
KSGSPPALSC DVSGNRNGQA PSAAPLESGP SAASPWRRTM GGQGVGEGLP FQQVRLSFSS 180
RVQNGDESKS ACVSASSDHL RDAWADGIQD GAQPGSRDRP RRRRGRVTSR DSSEHDGASL 240
SAATLPGGWP LEECVQRNTA SDDGDASVAG DEPVASGSCG DWPALREKEW LELSNPSASC 300
CSAPSASVSD VAGPLASSAC FPHADSSAFS SRACAVAPGR QAQLAVRWEA FRHPALTERE 360
ERERTILRVQ EEQRQVLLEI QQQREEARRR GEELPDEPPP ELACKMCGVD DSGNLLHPGC 420
VDMDEFLASL GGLYDVGLNQ EEEKTVQEEL QSLQERLQKL LVKMGRSSGR GEGGGPVSQR 480
EPARVPEVVF QNILEKEIRS FQDLVIDEHK EKRKLFRQLA GGCRRHVETV EKKKQMKAEE 540
EERRLRAVAK STCGPVEVFW RRIERLVWER EKRQLQRQLH EKKKQRLDRL VSEAMQQCRR 600
LAQGLRKPCL RSTAAQHDGS KRLTSETTMS SRRNSFVSDE EAKTERQGPG RSESRFSSGR 660
RAKAKTEGGE EDDGEKEWTA SMFAKDQEEE DDRLEAEMER EEDEEQEDLQ SELQGLQDEA 720
SIPVEELLKR TYGVEGGLTQ LASDRQREQK EKAHRADKDE EDGEEEDGEA EDGEEGDGEE 780
EDGEEGDGEE EDGEEGDGEE EDGEEGDGEE EDGEEGDGEE EDRENPWSPL GHSKEQEEED 840
ERLDVEMEKE EEDEQEDTEA ELRGLQDEAA MPVDELLKRI YGVEGGEAQL AADKRREVDG 900
KARREKAVKE AEASGGRGQA DRQEEREERG CQQGEKEIKA ELRGREAAAE KIGDTEHSAD 960
DDDDNRSEFS LDGGAFGKQK EEDEELDAAM EAEEEEEEED ELKRLQEDAE LPIEELIRRF 1020
GAPSSGRMEE EEEEGTSSED EVVIPVQRSL RRRRQRGDSS CEVKQEASSP CSEVRSVSPG 1080
QPVEERDSVK RERGASEDRE EADKREAKTG DASSADDSRD DLSRREQSER EGNRQLRWPS 1140
LSDAEPRSPA GARVQNMKRE EGDSKRVLAV KPSKAEVVCV CSPHSAGAVA KKSTEEENLA 1200
GFSSALSQGP HGEGRGGQSS EKTKSASDTE PSPQPRYLSS NPAPALVRAT LRTYQSEGVQ 1260
WLFALHDKGL NGILADEMGL GKTLQTIVLL ARLALERGVW GPHLIVVPTS VMLNWEREFF 1320
KFCPGFKVLV YFGSAQERAK KRTGWSRPYA FHVCIASYST VVKDAQIFRR KKWYSLVLDE 1380
AQNIKNFHSR RWQTLLTFNT QHRLLLTGTP LQNNLAELWS LMHFLMPTVF QSHDDFKEWF 1440
GDPLTAAIEQ EQVSEHQQLL EKLHALLRPY LLRRLKKDVE KQMPRKYEHV VRCSLTKRQK 1500
CLYDEFMQRR QVQQTMAAGN YRGMMNILMQ LRKVCNHPDL FEPRPIETPV GGGGVNALSY 1560
DIPAMICLWL HEPWNWCIEE RFRRVTLPII SLIHYEILFS SLQHGLAQNL SPLRLLAPAS 1620
SSSLLQFSPF DLLSAATPLE ECDLLSLPQP VYTRRLHADN SRNSSASGPP CSSSCSSSVA 1680
AGVTRFPALS PRVPVENQGN ALPLSLPLQE DERIQEFILP CQSPESQISS GHVSEPFPGA 1740
AGSSHKVPSV WSAETARHMC VDMRVQGCPG SLAPCASPAA ATAPSGPPQG PQVPSSSPSL 1800
VQSPVHPSFP VASALPLSRP GTGSQDSVGS PPSGVSPNST FQSHSSSHCS LLSGPAPHSL 1860
GEREAVPPLR SAERTFSWVH PDADREGVSA SVGGEALDRG EFLGPIGGSS QGPSPGGGRA 1920
SPLTPVDNSS GPDCDEPGEE GQRESRATEE RLFTASGVHA SSPSRVALEA RGDENPLEGP 1980
PLFESGEAAE AVGSRGMRPL SPAEDACVQG FAGVPPSLRA AEPALATAAV PASRAASGAS 2040
SLQASPCETP TTEKKLGSSG GPPTGLRVSA RAAARRCREE RLAGHDESVK NEGKQPFLSD 2100
ASGSAMPPSS VLPLSATKRR RLMTAAMSPG EGRRNGEGTE SDVLSLGTLL EVPLLRNTTD 2160
DGSTAPRAGA SASASSALPA NSVSLVLTQP FGSASIPDLD GFLAAHARRR CCGRDRRLLL 2220
RNCFPVPPAF VSHLMRQQED SEKDGKVMED EDLVLNGDAE AAALSSGLAD SLSSRAGAAG 2280
GTGGVGSPPA LERGGKARRQ AMRGIKRAGT LWRRWISSER CEMQPALCLV DATGGESFLD 2340
SFTPGESEFT AGEDRGMHKD TQTTFLARPA EPFSAAVDAN EGTRMRKAEF WRSEEMRCEA 2400
SQAMFLFASS LSLASPPLFG GRDTRELLRK EICQSPLNPV GSVHEPIRAS GDFLARTETL 2460
RRITPTPTEV FERDQSVILR CSVLCNPRVQ PGPPRIFLRG PGGIQARENS FSAFAESLEF 2520
LQGSAAELHE AVERQRRIFP HKQTLQDDCG KLIVLAELLT KLRADGHRCL LFTQFSKMLD 2580
VLESWINHQG FTYVRLDGST KVDQRQRVVT RFNANPRIFL FISSTRAGGV GLNLTGADTV 2640
IFYDTDWNPA MDRQAMDRCH RIGQTRDVHV YRLVTEHSIE ENIWRKQLQK RLLDEVVVDR 2700
GLFTMENTTR EGHLGQQTQD KEAAREWFAN AETLKDLLAS PEESRAKSGF KDDIYADRIL 2760
HDSAEDNPDD TEASTHVPRS GKRGAGLGGG GEFEAAILEV EDVEDVAAMQ QTTREEKQAK 2820
QELQQDFRGE KVGEEGTVDL NGALERMPAL AAYCVRLINE NKPPSLLAQI AQLKTQVRAE 2880
GDEDEEKPND EDRQSESEEP QRSSEDDGPA LWESEIESTE EDDE 2924 
Gene Ontology
 GO:0005524; F:ATP binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0004386; F:helicase activity; IEA:InterPro.
 GO:0016740; F:transferase activity; IEA:UniProtKB-KW. 
Interpro
 IPR014012; Helicase/SANT-assoc_DNA-bd.
 IPR014001; Helicase_ATP-bd.
 IPR001650; Helicase_C.
 IPR027417; P-loop_NTPase.
 IPR000330; SNF2_N.
 IPR003903; Ubiquitin-int_motif. 
Pfam
 PF00271; Helicase_C
 PF00176; SNF2_N 
SMART
 SM00487; DEXDc
 SM00490; HELICc 
PROSITE
 PS51192; HELICASE_ATP_BIND_1
 PS51194; HELICASE_CTER
 PS51204; HSA
 PS50330; UIM 
PRINTS