CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032624
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 MIF4G domain-containing protein, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_016290 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1199GSAGGAGKHDAHRRGacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2668 AA 
Protein Sequence
MSAVADGASS GTIHQSLQNS RVELFAEQSS SVGAGSGNHP LFECSDFLSS CDSALPAAPQ 60
YPPFSVGGTR ESPFVSGGPP TFGHPLSVSA SFAAPCSQRG TPAGGSFLSP AQSWLKTYFP 120
SENAVNSRSS LEPVILPGLP SASSPSGCVP AFPTFFPVPV GPTPLSGVAA PCRSLAKPQP 180
ASRLQLGEEP EGREAARTGR GGLFPSLGAP VTMPPPPASV ATPTAAESAN GGAAAPVAPS 240
PSGNAGPMRS QLRADAVEFR PAATTAAVVP GSFKGASGGV QAAPPAAPAG GVQHSGAAGA 300
TPASPSSMGL SGVNNGATPA SPSSATTYNS THQFNRNAQE FVPGRPFVVY PSSPSPPGAP 360
PPPMAGALAR SAPAQAAGLG PATAPGATQN VGVPPVPGVV RGASPPAGAP ATLPGYAPQV 420
VGRGAGHLAP GAHHSQPFIP VPGSLHFAPM PHPVPAQFAA PMNGALYAGQ PAPGPVDHLV 480
VAAVGAQSDA ALAAQLHDKA ATGAAVPASF VYPYAAPHPH LVGPGGPNCP PSEFLGVGAP 540
YTMGSPATGP GSGLGRGGRG GRLGRGGRTG DMVNYRVSGG AGRGAGRGIP NAPGGSGRLQ 600
GGAPHHGRGG AAAGPTQAGG AQGAAWGGQQ GGGQALGGPL GRTSHQGAGA AGASKASTAG 660
VSEEVKSQPP PPPAAAQGTT RMPAGTSFRD KLMQGNAKTT EHAEGAPGAS PAPAKTGATS 720
GAQERALPSS SLTPTSTSGT GTAVGGGASP SGSSTNDGTR SPVCSGTGAV PEATKTRPAT 780
APGAGALSAT PGSLSYAGAA AGAAGASRPS SVPQKLSEEA PKQGKTSPEK GTPRATGGTA 840
ATPSWSQKAG SAAAPVKENA AQAAAGSATA TKENAASTQT RRSSHAECAT PSRRGSIRPA 900
TPVVSAGQGR DRAGEASRPR GSDASDNAPV RGQEAAPEQE AVEKSTGESA TTAPQGPRSW 960
AERMKTNASK PVVVTPSPSA RKSPANASST NGGSGAAARA GASPVSPATK SGLGTGRGRG 1020
DAPMAPESTD QQGSSQTQAV EKKREQSVEV PSASAPAPGK PTGWAARVSA PSPQPAATAA 1080
APAAEKTESV SKEVGREGDN EAVANFVRGS RDDTKVAKAD GKKEVASAGE AASVVELSQE 1140
VAEAAASTAA PSGPMTWAER ARRVKEKPVP VAEPAKPSPP SSGGETVSEA RGSAGGAGKH 1200
DAHRRGEKDR RSREEGRGST SHGYAERAGH HRREHHRDHH RGSRGEASEE AAGPAASLQG 1260
QAHHAHEGAQ HRQGRGGNGA EPQAPESQPT QSTQPQGSAG ADQEGGARRR GSNAGAAALP 1320
AGDGKVAETA GPQKPAGCYA AVVAASQPAQ APRRWGPQAG QQPQAQQVAG GKPVQAAFEE 1380
ASTPAIPAGD KGTRPAPGGA WRAQSAAAVG GSGGSAAAGE GQGAARDAAG KEEEPHVLWP 1440
RNRPKLKKVV VEQEEEKEAP LLPGQEQKFA LPQGRKDQLR AEERRRAKQQ VLKQEEEDDE 1500
ATAMQPSPEA HGPTVGALGV AGGFLPPPPC VIPPPPPTGG APVSSRGFAT TASLVGSVFS 1560
TPTTLEPPEP AAAPAMSQNS SAVSLVACVS SADVQKPVPT VVVEAPAEPM GFAVSLAKKE 1620
AGAEAANVVT KLGNLPPVSA GFTEPPPPPP PSEDAFLGQA MKAEMYHKLS PRGRRPSDMS 1680
PRANRQAARD ECRSGRASRS PRGDNGQGRR LSASQVQKDL ELEAKRESHL SPRAASRSPR 1740
SLRTHSPASG SASPRDASPQ ANRLLLPDGA VAQREGQRSR ADSAHSSSPR RRCSRAASPA 1800
PVPPPLPAAG EFGDSPAASA GSSSPSDFPS PSTPGSATHR QPLAPSSTRS AGESPDALSS 1860
PQSKASSSPP SASSLLSDGP SRSAANGPVS AQGSGDACAG LYSKRLLVAY RLSKQAASVP 1920
ALISICAMSH AAATQQVQPG GDFWHHGRQG HGDDSWKSGG RGGSGRTGGD GLLHAMHRKN 1980
EGKRAMGSGS QVGNRNEDGW RVGDGSHSRQ ADGGRGSGSL RDWRRGEDSR GGLSAMAGGL 2040
GGFFNQERPD PGEFRRMQQS LPPPPSHAIL KASESSWVKK QAEQKKDEQQ QLLRRLKGHL 2100
NKLTLEKFEK LYPQILNAGI KEKEEVNALM KMVFEKAVSQ HHFIQMYVQL CSRLKDDLKQ 2160
ILADDAKGSY FRRILINQCE DSFVANLEPM RVPEGLDEDE AFEFAQLYKA RMKGNMIFVG 2220
ELLKSRMISH RILLECIDRL LQKRLECIDI SGGEDQGVPH MEALCAFLHT VGPFFENPKW 2280
KFYEEFCERI KVVQKLQKDE SLPFRVRCLL KDVLDNRAEK WRKKFAATKE GPTRLSELHQ 2340
QAQLEQQQQQ YSAGILGGKG RGFDTRGGDD GGWEMATSRR GVGPAKAKPG MNPMPMNANS 2400
RMGGGETSKP MSAFSALGRM RSEREKERES GRDDNLRREN WGSSFSKRSP ANASASATPV 2460
DASSSHTGSA SGPSSLLGRA PAEPSSPAKD SSRRKSGGEP AAPKTSSGAA PMSEEQQVAA 2520
ARSEMKDLVQ SMDLNTALEM LKEMNLSQSV HKEIFDEWLK YSIEALVADK EHVNKQRAIA 2580
FQLFVQVCQK GILQAEALKA CLKQFMSNSS AEEGEDGEAV EDSPYSDLKI DVPRLTDFMK 2640
EFLQTLEAAD PEHQVLSTST LDDLKGKL 2668 
Gene Ontology
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0003723; F:RNA binding; IEA:InterPro.
 GO:0016070; P:RNA metabolic process; IEA:InterPro. 
Interpro
 IPR016024; ARM-type_fold.
 IPR016021; MIF4-like_typ_1/2/3.
 IPR003890; MIF4G-like_typ-3. 
Pfam
 PF02854; MIF4G 
SMART
 SM00543; MIF4G 
PROSITE
  
PRINTS