CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032650
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcription factor S-II central domain-containing protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_065540 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1067RREAASGKESGGTGSacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1313 AA 
Protein Sequence
MVDPELMDNA ESGAVDVEDL SAPSLDPPPS PSSPPSPREL ARLEPSESPQ TPPVCLFQLA 60
GETFYLHDWV RLKSSGERDW VAQIVAYFPG SDSEKSGGVE DADGGSGGRI LCRWGWDPRD 120
LRREACPALP LQSYSRFEVI PAINFCDENP IQSIKAKVRV EKYEKWKRRA AFPPLVRCTA 180
DDEEDVDKDK SGQSGNAAQR DSDPGQTCMP SVLFYRHSHE IETGFHPSLV ADVFRFKRIS 240
RSATARGGSL SADLAVAGSP SSGTKSEHGG DEIAQAIDLG SEGEGPAAKG GFSDARSGDF 300
AAPTYRRYPS LRNPDVRHFF CCNCRSIFAP PDDVAPPVWP DPEEVSQAEG GRYLLDAGIS 360
ASYHPAVLPP EGPGSAAGPH SAGEATGPSR QREEAKATPQ PPQLKLLLLE EFRKSTGDRP 420
LSVLSPVQLP FVWMRTPAEG DFIILCWKCT GKRRKRREES AGRKPAVAAT SASRDSGVVN 480
RRKRRAETED EAGEGPPAGH SLRRRAGQRG EKPARENEAS HALSQQTTAQ KIAGDSMAFE 540
ETGQPKDRSE VSYGETGEKS QGEGSGSENT ARSGPRQEDA EPSRKRRKAA SGVARVAAAM 600
AAAEASSDAD MDEDSDADFV AHSEEDSGGR CSAAAAEPRA KLKRSSRVAG DAEPPEAASP 660
SRKARKESAP ANGTSSSSGP AARRKSSGTS AAAPLGSRRS SAGGGLLSGP GSGATRPGGP 720
SDWQAGNQAR VDRLTEAFRL GIRELEEEAK QSLPGEAGAD ALAATKASGV FGTKGAEGAA 780
ASAKAEGGEV AERLMTSPGS DSGAGRGSGG SGPEGEGGTE PSKTLPDLLF ATAEACAKAV 840
NAAFVSQHKT LDSRRQKQRF FELLSNLKRE NNQELRKKVL TGQISVRRLV TLESAELAPS 900
FVRKEREEER ERHFRQAVLL HEAPAAALAR LRKTHKGIEP VIEDPDLLQT SPPARGIAPP 960
IITVEKTAAA AKEKRIRVRA ETDLRGSSSP SQSSGSSSSD SDDSASASDS DGSESEETSG 1020
SEDGETSAES GSESESSCEL RREKLGGGAN GESGHLAQAR REAASGKESG GTGSGVQEQL 1080
ASLKRLFSQA TSGDFGADSC RTPAVGGERA SEGGILKRGD GDAGERGEEG TENDGVQKNR 1140
QQRERRVAFS EETHTADGPT GKGIVSKSKK RRHTLARFSS SASRSCATLP RLPELLPNPP 1200
EEDGTERAGN GRENGRKGEA AAARAHRQSS VLGVPLDCSS LSPDACKDRV RHLLERVPLV 1260
GMQPPPSRPG NEDGEGLQTA RTTPLLDYKA VVASMFRYLN VAFDRVTLDL QRR 1313 
Gene Ontology
 GO:0006351; P:transcription, DNA-dependent; IEA:InterPro. 
Interpro
 IPR003618; TFIIS_cen_dom. 
Pfam
 PF07500; TFIIS_M 
SMART
  
PROSITE
 PS51321; TFIIS_CENTRAL 
PRINTS