CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032669
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcriptional adaptor, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_072430 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
241PPPATSEKVAVSVCRacetylation[1, 2]
1137LTETQAEKGEAREKQacetylation[2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2697 AA 
Protein Sequence
MSQAAVSAGD TADATVSSVS ARLSPSLSTS QRDAGTKGEG SASTQASMPD ADERSSRSEK 60
KGATAKERFE EPVEDRTNAR EPENVNEKLE DEPGDKWRKL KVSRASKHAL KKSSETEKVT 120
RRGVEAMLSN RLSVSRLKNA MCLPTPETER PDRGSGGGEA ARMRTRVIAR RVHVSSSPKQ 180
SSQRSSVLST GTPSPFFPSS PRRASGRSYV RVSPSPAPRS REVEARGTLS PASPPPATSE 240
KVAVSVCRHR GKEESSPSVT RSQKPRWSSA ASRRSEGTSG RRNLPAVRRL LRSAGRGSPK 300
TERVESDSQS RCRGDSSPSA SSASSPMSGK GTCVAPRTPA PVSTQRAPRM RSLRRPLGTF 360
TTRRQRATPS ALSGSRPGLR SERRAEGSRS RRGVSPVASS SDAVLRDADE DPEEGDGARE 420
RLHGTRRRLP VSARAGVSTR SSAAGSSSEG YVGSQGFPGM PEDGGTPPAS PRTEPERRRG 480
DRERPVAGFP GLRQRHLAPG EEPPTPRPGG VSPRSSPMTP EEREKVGTRR GYGIGGIGDG 540
GRQKKAKTEA EKDDDTRALC GQNEERLVGS TPNERSPMRP ILSFSPSSPA RKKDGLQARD 600
GFFGARWRVN AQTGLKELDP GPLGVDFHCN VCGVDVSVGQ WRVRCAECDD FDLCVFCFAR 660
GRETGTHLNT HAYRPVPPNR QEIFAPNWTA DEEQMLLEGV SRFGLGNWND VASLVNRVAL 720
RAKTKQQCEQ HYMSVYIDSG GIPAALWEQE IPAPKVHSPP EASPSSLIFS PRRDASSDKD 780
EAASPLQKAA GAGPRDGAGE DGGLLGSNGD TSATTPDGEA AMPQNAAKTG DRRHGDEAEQ 840
RRERAQRGEN ENRSERDERT EEEKMFGTLL ADTESDEEES NQRTLAELQA DSNSWHPPWI 900
AKPVPPQPHH NNLQGYLPLR GDFDVEYDNH AEALLADMTI EPHESPSEKA LKLSIVEAYN 960
CRLDERIYRK RTILWRHWDD PKIANREKTG SLLERRYWQQ LKPVQRFHND SEHIALIRSL 1020
VTYAEAMERC RLLKEWRSLD LHTLQHVVDY EAEKQKRGAA CRPAAFRSLF DAPSVSGAPS 1080
ADDSFKATSA GILSDAPAGE HMTVAPKSAV TSTSAERPSS LSAAAPVALL TETQAEKGEA 1140
REKQSSRSSS PPPSAFAPSA AIDSASGTSR KGGRQGGPET RVKDGGPDEE SEADAREQGL 1200
DEGAKAKKRG RDEELSEGQA HADGLERSEV DLCRALQLPP VLYQLVIQAL KAQAQMLPTV 1260
EKDMLRKKRK RISTIGSLWD FDVRLDVEQK AGMLRDASSP KVILSSPGNQ TVGSPPSPFA 1320
VSFASSVPPA AGQATDLRTL GGEAASSQRF AGGREASTSR ASAPGDGPPL PGGSAQLASA 1380
QGLPQSETHA YEPNRQSLHP GYGASQNPSR LMSSPSPSAY TARSGSSRHG TPDAFASPGT 1440
LAPSPQVYGG ASRSGPEAGR GQTAHSASPP VSSLPQSFMH LHRQTLPGTH PSFAYSNAAY 1500
SHTVACSNGL QSFHPAAYST YRSNGSSVDV GPSGYALSPA GFCGAGASLG PGVSQNGFYA 1560
SRAYYGTPSG VPFLPQAPSL RAANTALQRL QQQQQLQSFL LAAGRGGRPT AHLGWGQQQP 1620
GNALLGNVIS TGGGAGVGGG HDFLIPSAHQ NVANLSHMSP AYLAGNGHPY LSSLSYYGNP 1680
SNGAAMHAAS GYGQLNLPSQ FTAYQSSLLQ PGASPSRQGG GTGPFLSQPN PGHRDLGAEA 1740
GAQGAGDPQG PGRRHPGMLA SGGGTRIETS TAPSRLFQRE NQMGSCRPAW PSGDSLAHTS 1800
NSLETDGEPL HTGQGLDRSE RQDGETSQGD AACGAGERMK KATGEGTMGS DAEAQTGIDG 1860
AKIQASFSSG ESLFASGPSV LSNTGSLRSV VAGLPAGSAG LCGPGGEGGG GALAERRKHE 1920
RLRVPPDSQE RQRAVPTPLS QEGKREEERE KWKEFVLHKL VPRVRGVRYD AVSHHWVAQW 1980
PAVFTASPFD SVQSFREGGV PPATLAVSVE NSGETPTTRI PASGGPETEG ATRSLAFSIK 2040
RLGFEGAWFQ ACEARRKEAE KARDWGLVAE VRAAETAAPS VFAALQDELP SLPFSSNDLA 2100
SAFAGLSNTT GENKGTGGLQ NASFSTGAKK DEVRDEEDRR EKPGRRLPSA ASPVPAAVKG 2160
GNEAGAGALS SLTRAAAHQL TIEGGLNSAN SEVVFASPMA GLPRHAGSSR ATDGLLQLRG 2220
SPPGQELIQA SASTGKRQVG DEPVLADAQN APRGVLQLGG QSLHVTSALA SPGISGVGRE 2280
RLLNGRPCVV KSLHGLRLER NGGACGGLGP WTGGGASAVG ASLVEMHRRL PLAHAGKETP 2340
SPGIGGARKA MAASTVVAHA NGGDGRAMLL ASGLSPVSGG VKHRELPNNL AGGQFRAQRD 2400
DAKAAFCNGG ASCNGASVST LPAPALYATG KPRGFPGRGG GSTHAQSGVS GRGHGERRQG 2460
GSVGTGEEKN GQGSSAQPTL AWSDDDEGAN SLEPSPAARK ATACRGSREV SLATDSAAAE 2520
GPRGSSAQTV ALPEEQRRTT GALLSSSVGA VERSVSETPV AEREASSSFV QGPGSLHASE 2580
DISDNDASQR QHRMWAGGPR CGVEGAREEE AGWARDDVEL QNQSSGLERD KERQAVESEA 2640
DTVEAARCDE NREEGKGENV AVVDAHPTSR ETTVSEGFAC SAPPLLSASG QENETER 2697 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-KW.
 GO:0003682; F:chromatin binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR009057; Homeodomain-like.
 IPR001005; SANT/Myb.
 IPR017884; SANT_dom.
 IPR000433; Znf_ZZ. 
Pfam
 PF00249; Myb_DNA-binding
 PF00569; ZZ 
SMART
 SM00717; SANT
 SM00291; ZnF_ZZ 
PROSITE
 PS51293; SANT
 PS50135; ZF_ZZ_2 
PRINTS