CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032665
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 AT hook motif-containing protein, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_093380 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2863NGEHEGEKRCRKELGacetylation[1, 2]
2912KPRQEPSKSIPGDDKacetylation[2]
3239EAARTADKRLERPGEacetylation[2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3767 AA 
Protein Sequence
MRRLSISAGT PSTKGGGKAC VVSASDVPPS LSRASESHLP RPREEPAQAI PAALPFSPAS 60
SLPASPGSGG SGVCTPDSRG QAGAAGGPQG GRRGRKSVQT VKAGQSPAPL VAGSPRGDRL 120
PCQKEEGSVS EQKDLLSVSR CSASPAGPPS VGAEAPLSSL SRSRRPSVAD ARSPLLAASA 180
AVPSATSALR PSRAETVSLT GASPPPAVVE PQPAGDGIAE KASSGRTGSS EEDQDAPGVF 240
SQRASRSKED GRDVALLGGQ DERREELREE RGRGRRQGSV AESRLQAPAS KTEREGKKAE 300
ADKKEETGKS TGRQRLPPSR CRHRVEEPRW DEEDESELSE YAGSASGVNA GFPGAEKLEK 360
ECVRSREEKR RRRDEKGNET GSPQPERGGE EDARGAMRHG EAGKDPAADA SGDAREGAPL 420
PEQADSEKEG AEKKARQQPG RTEESRGKRE NARGEEEQEE AEKKRGKKED LRKKKVSSVR 480
RPDEEKEETE VDVVMTDANE AEREAVHAER RESAESGEGA EAREEKKRRV EGREPAKLEE 540
AMGEAREAQS DGRREEDGRG SSPGEAEGPE GLDRDSHGVC RGNLGGDRGR NSHAEKETSE 600
SLLGETAKAA KQLEEREGGR GSEVQQENET EKGERRRGLK SDGRSKTGLG RDADHGKATR 660
RSEKPPEESR QQTSASPSST SSSSSAAVVG TESDACDASA EKKERGGSPG RAALAAPAAD 720
SLQVCDADPA SACVGTLKRT GECSAEGDEV AAVQSHGKAR LGEKREIPQG AVAGLDSERR 780
RKGERKEARE SSLSVSGSRK IQDAYLSEVA AEDADPGDRL AASEVKSSAS SEAPASPGTA 840
REDGASRLAA TSPSRSQGGG ASTVATRSSS RIAAKTQVPE PASRREEREE RRQREDKLWE 900
RVHRRRDSEK EEKGGEIRYE REEKSGEETE ANTEDAKKGG ERSSTSPESE LLPSRQKSSS 960
DFKGPVKGSR RGPVTLFSEP SAPPANDEPS DAGGATPSSE AKLGTGRPKR SASAKAFSAE 1020
KVNRSRKAAS EKEDSGVHTP QNSSDASRPR AESATEAAPR GPEGPGTSDD PAAGVVVSGS 1080
DDRAAGSASK QRPLDLMSPT RSAASPLERP SAPVSPVAEA VSSASPLASQ REAAPARRSA 1140
RQQQRQAALA TSLSRCLEGD GAEYVEAQPR SSKEGQDSLK DIAPRVADAA ASSPDSTEET 1200
PGSDASATSA DPSGGAAESG RRGGRGGSGS GASSGCGAAV ETRRSLLGLT ITNSPYWSRP 1260
STHGSAAGGS SASARGSNGD AAKAGKEGRD SETQAKSAGS SLAVRPRLRP GKASNSASGP 1320
AGDAGASGTL PSRRGPGAAA SPSERACEKK AENRESGEEH RESGEREREL KREASDEMAK 1380
NGGCRREDVR KARRGEEEKR GKADERERRK AEEDLEKESR ERGAKKAGHG SGSGTRSGLL 1440
PRLRTRGFGR EEQGENTDGE NEDAGLSGTQ RRDFLPTRGT QSAKRSRRAA EAMAAVAAGA 1500
AAAARERRRQ GSTRHLGDQR EEEGGEEREE EGEEDRISFF RHEGASSPES RQESAALLGF 1560
SSTSISRDTS PEKDDGGLRS RRLSVSSLQR EEEKTRGEAR ETRASKARRS AVARDEAEKR 1620
EGESEKKEET QVTEESQKDE GKEDLKELER RRRRLTRGRE EENEWPKFGR GRTRRASLLS 1680
SASLSRHGNG SRAAAAGGDE QEAYEELRQK SREEEKEMKE ATEKDKGEAG GETEEHGELS 1740
SSPVAVAPRS TRGFTCVKRQ SPSLGEASRL LDSTEGEEKR EGEEKKGEES GSEGPSVSPS 1800
HGVRAASRQS KHEASRESGD FEQDEKKGEE ADPDPGDAAS AEERGRGEGG LNREGDREGD 1860
EGEGGEEIEK KEEKSLAGTE ERETETGEGR DDASAPSADR DRHSQEDRHE KRKAKETSAD 1920
RPADGEEKDS VADGVVRRTG TRLSAARQMT PKNEEGESAG AKKDGKQAEE GGGEVTAFKA 1980
VIAKLVAFNE ADKVRYPQWY TGPVGAFEPR HNQLCWWRFR VQAKYQPGRV VMDFNEEIRC 2040
GRLTEAVKQQ LFVTCVRRTH GTGSAQHLVC RSNTDETGGP AEPPHIGDIV IVQNLEDLKF 2100
SVCNWKVTKP LLFSLPLVWA NKRMCEPVGR LCFFQPFPAG EMQVRLERAS GSALGVLNSL 2160
TRAKKFLQIR EKYSGWDWTA FQKEQDRLVA QRRRGKADAS AALEDESAPS ETSGSTGRLG 2220
QAPASPPFGS SSPFSSSACT ARSSGAKAGA SGREDAAFSG ASSLRSAVGD TPTMSTSTFS 2280
APSASPRRVV YATLPPHIAA ARARVTLGAG GAGPSLQAIP VLSSLTQSLA SSSLLDDDSL 2340
SLLADLPPSH PLFLNLLNQS REPPTPRADA VVGALPGLSG QWIRMLSQLI LRRDALLHLL 2400
QPVSLDLLHN PTRVFGPLPA LLPPDLLLQV SLATGDAGPA CHLLTANLYL ARLADNAAVR 2460
TLDPAQKRIR VLQSSRAETA DRSSFAERLL GARDEKRARK WSAERHTDAE KPHRPKKKRR 2520
RDPNEPPRRR GRPPSDVSSF AARETSAGPQ SSSAPKGPSS SSLTSSSLSS SSWSSSSSSS 2580
SSSSAACAAS LRRSLLLPAS SSSQSENAPP VSYEMGHSIT FSVGLTYGRL FLPSLISELS 2640
PNKRRSVHGR CSDDEVSETE LLETLDRRSR AKRLFDRQIT CAPWIGCCPS QSAELAAALA 2700
LGTAKTEPSS SSLSSSSDLL ASSAVSHGEK WTTERQERTL EAARPAGEET PSAGEKEDAR 2760
EMGGDTNETA FREAASLERD APGEREAVVR EEQKTEEEQA KESVAKEDKK LNQSRHANSG 2820
DAPPSSLETS SLDRCFLREV SASFPARESV SDKSRNGEHE GEKRCRKELG TVRVKKQMTL 2880
WSSWGLAKKR EDPPEKSSQL TSLSKPRQEP SKSIPGDDKQ EANRASSQRS SLSARGLGDQ 2940
VSLDSGAQQS RLRATLRVGD EKAVDEERET SANVEAGEGE AASGHKVGKP EGTKTDGIRG 3000
DNPGDERRAG EEAGSWNSNL IFFSPLSVCS QADGEEQIEE ESRYSSCLAG ESCANVEQER 3060
VKGERGDAES ARHSPGQAKR ARDMPEGDRQ HEGEQKKFLD CVPLAAAEAS GVWSDIAGKT 3120
QKTPASEAPR GRPDEKTGDA EETRRVAACF SSSPLCRAEH ELFSEGESEL QAAMRLSVSD 3180
FEEEEEGDIL RKEEREVPGV ACDPARPSAS CAAASSPAAK EREDEGRSGV SEAARTADKR 3240
LERPGESTER TQGRERERST GSGGDVAAEE GDEWDPRPPL SLARSAGLKE EKSRDRDQRE 3300
RTEERREGKY AREGDETHRK RQTRRDSGRE LEMPNASREP RFPFCTPGPY ESVNLNLLCL 3360
HNSCDDEDFL LRVALEKVSS KPPPLDELER CFTTHGERFD LHWLDRRLAN TQKLQKLQPE 3420
LVRLLTSYRT RMATASRLAA SSSSAPPAPS ASASSSPSSA SASRSASFSG SSRSTSSVAP 3480
APRASSSFSV ARLAEGLGAR EKSFQTLKEK QEAGAVLLRQ ALRLFAKEKE HLLQKRAKCR 3540
EKREKDGPPL SSWLPAVETL RTKKRSDDAA LLPRLQPSYD WQVEALAALP NRSLPADNEA 3600
RKGEREDARK DEQKEKEIPV EFEALFNTLR ILAKQKVSST LPSSSSSLSS SLSSSLPSPR 3660
SSSLSGVRCA GSGASGGEGR EGRAGDEKGD KGAVSPDSVE GNACEDPEKR EREEERSVRE 3720
ALLQEELKEI EEWRQIFVEL LSMQPGTPKN PHVLRGPVDG FAVPARG 3767 
Gene Ontology
 GO:0003677; F:DNA binding; IEA:InterPro. 
Interpro
 IPR017956; AT_hook_DNA-bd_motif. 
Pfam
  
SMART
 SM00384; AT_hook 
PROSITE
  
PRINTS