CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032823
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcription factor with AP2 domain(S), putative 
Protein Synonyms/Alias
  
Gene Name
 ApiAP2 
Gene Synonyms/Alias
 PF13_0235 
Created Date
 July 27, 2013 
Organism
 Plasmodium falciparum (isolate 3D7) 
NCBI Taxa ID
 36329 
Lysine Modification
Position
Peptide
Type
References
399HDNYSIKKLYHNNNFacetylation[1]
Reference
 [1] Extensive lysine acetylation occurs in evolutionarily conserved metabolic pathways and parasite-specific functions during Plasmodium falciparum intraerythrocytic development.
 Miao J, Lawrence M, Jeffers V, Zhao F, Parker D, Ge Y, Sullivan WJ Jr, Cui L.
 Mol Microbiol. 2013 Aug;89(4):660-75. [PMID: 23796209
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3858 AA 
Protein Sequence
MQNKSEFNST NSKEFYQDEY DNVPCHRNEY INNQHKNVDY TNNNYVHEMS LINNTENLQN 60
SNNSNLISVK NELTQIYNNF PFTYEDVDIT CKEKKNDQIK MYEKENVNIY NEQNEQNEQN 120
DQNDQNEQNE QYEQNEQNEQ NEQYEQNEQS EQNNPFFLGS YKKENDIIPC DNNIIPCNNE 180
SYNLHINKRS TNEPLPKDEY LYNDKNYEIS NKNNNNNNNN NNNNNNNNNY SNNNYSNNNF 240
SNNLYVNKHD MVNYYMKDNN EKLLNPSNFN NIYHHNEYPD NNNNCNVNSF LKNNVGENNK 300
YNNYIVHDNM FDNINNVNCV NNNIHSESYN KPTIYKNVMN YHNKNNYDMV TYSNNNCNIS 360
NNNYEDIISP YNTYNDANCK FQNFNNYNMP MHDNYSIKKL YHNNNFYYNN NINNNHYQNN 420
YHHNNIYNNI DTAQNINIPF LNNYNENINT GLYNNMINNN MNTQYNATPM LNQMFLNYNN 480
DEHVKNIHNE NNLSSYKNIP IVQNNTFINN EESHFIEDKK LYYNNMIDSN NMDILNYKHP 540
IYNNMNNSVQ RKEKNVKTIP SYMNNLSDQH ISDKDNIVWK QTYNNINIVG DKKENIDEIY 600
DMNINISALN KEKSNINLEK YKTDCYNSNM MGPINMDNKM NGENIMSGDN KMNGGNIMSG 660
DNKMNGGNIM SGDNKMNGGN IMSGDNKMNG ENIMSGDNKM NGENIMSGEN IMSGENIMNG 720
ENIMNGDNKM NGENIMSDDN NYNSPWDVNF PNSLYNNNCI SNKSKYTNRS NSYELSLFKD 780
NKKNNINSNS NSNNNNNNNS NNNKYDDMLN KKMLYSIYLQ SLLNNLNNEI YNDNEHDDDI 840
STSVFYKDYL KRNNINNQLI LYNKKRNISN RFFNNIPNIN NYFSYLNYYI ESLRLLYNKL 900
LRNRNLILIL AKEIFSKNKK LDKTKKNIYT SNNEKYSYIR QLLSVLLPQP PIYPPFEIWF 960
FMSKKCASQL HKLHLTIYMV YQKIIYNFSN ILLQINNDMN ITKPNKIQKD INESNDYLNL 1020
YDHNENEKKN IKYDDKHEES YNQRLYNLKK TNLDTNQNDS NSQIQLDKPI ISNTLPTTDE 1080
KMIISLNSHN NDDRIINNMD TNMLIQINKK LEYIITSINN INKIIMNKKN VQNILQGHHN 1140
INEKMYNMHP INPIQTSNLR TFESISDNII SCDDENNINI IEDEENFKNT KENKNETTNE 1200
KTNETTNEKT NETTNETTNE KTNENTNETT NETTNENTND IDYINIDKEL SNNISTNKQI 1260
HINNNQIIAY NNLEYNSLFN KWNNIKSDIH NYIYEENPFN ILDNIKYELK DVYNQMMNLK 1320
KTDNILTIND KLLHNLLKLD NSTHQMDIPN ISYDNKWKGS SNNHNNDLNN NNNNYYHNYY 1380
LNNNNFNENK NVLEELLSNN NCFNRINNQF KNKDNFKEPN LMTHNDIADD ISICSNEDVN 1440
DIIYNINRTD DILKRLMYLS NNYYNNMIEY DNYILHSYQH NDHILNELNN NTVTYYNNIK 1500
KKKNFNNQLL PRTKRNEDFI FNNNHIVTPN NIYNNNNNNN KLADSEIMEN VMQTCNQNNY 1560
QFNRTESNDN LKNIPQENNF LLPNSNMEKF GELINVDMNN IENNINMFNN MYNNVNNNIN 1620
GNNINGNNIN GNNIIGNNIN GNNINGNNIN GNNINNNNIN SNNINSNNIN SNNINSNNIN 1680
SNNINSNNIN SNNINSNNIN SNNINSNNIN SNIYNNMYNM NYKNNVISSN NNYINNINNE 1740
QHSNPMNYNT YGYDLSNNVN HLFYNNNIMD GRYNISNNNI IQNENIKGIY PHIYKSNHMV 1800
NVETNNFDIY DNVPNKYTSM DNKDFVGEYI SDREERYYDI NILNDENNIN KNIINNNIND 1860
MNVYDNNSIH SNNNNNFDYI KNIDNNNNNI NDRVFDSRNS YECSFPLLTN GPYYYENENN 1920
TSSNILCNQD VNLTNNNFIN YESNISNNYN QLISDNNNEG NINMNKNFNV FNNNVLINNC 1980
TINDNNVDNN KMIDSNMIDG NMIDSNMIDS NMNGNNMNEN NMNENNIIEN NMNGNNIIEN 2040
NMNGNNIIEN NMNGNNIIEN NMNGNNIIDN NMIDNNLIDN NMIDNNLIDN NLIDNNMIDN 2100
NMMDNNMYDH INNFTINSPM IDMNSFEEDP KNYNMIHINE TDIINLTNED LINNSENYYK 2160
HDNTSSYLLN YQNCKNFELK NVSNVEIEQT NIMDSNNTYN KITEENETNS SHILLEEKIN 2220
KTNDMCKNDT EINDHSLAIL LKKNPSEKKK IMNNHHNNNK KNNKKKKKDD LHEHNNKEKL 2280
QNCHNEYLDS NDYNSDNNTN KDNVFVINNI KNTNCNINQN NDIISKNVDN SSKNNDISNE 2340
KDNINKMKLK KMLEITNKLI GKKYRGICYD PTRNGWSTFV YKDGVRYKKF FSSFKYGNLL 2400
AKKKCIEWRL KNLNPSSHAY SFSLKAKEEF NEVLNDNYKD VGILYDNKKD SNNNNNIKNN 2460
GNQDDDGNRD ILYVNAFLNI YNDENVKEKC FTDDNDNVKD NEKNTKEIKI IKNKIPPPKK 2520
KTVQNIRTNE KCNEEKEIYC GDNTNFNVTN NHNNNNNSNG NNSNGNNSNG NNSNMLDERQ 2580
HSYYNDTINE KKKKIINKNT INILHDDIEQ TYSKNNNLCD NNLSNLLNEK KNNALNTGKR 2640
KNDRENYTNN HMNTDSFLRN NSGDNLMDKN KNCKLFSNTL ENDETYSCDT QNVPDDVLKN 2700
NEYKRKKMCT VSEAKIIKDT NDFVDMMYTL VERNNTFDNN SDLKKDQMDN KKEQEDVEKD 2760
VKENEKNDIK EYDEHKDVHV DVEEGLKKTT RNHLININNC DTYNHENNNN QNYLIHNSTN 2820
NTKTKNKSCI DNIDLTLNNI NCEIPLIEDN YLYDKDKIMN DINTYENIED INGKHHNSCN 2880
FMNVDYKKDK HVTSYNDNLK NNEEDIFFNN SNKNNFVLHN NNSNKIKNDE NYFAYKDKLL 2940
KYMNTCKYTN DEEKHIFLQF LKIKTEWVLV ELNNLEEKYH HYFRNKIESF YKNYLLKVKN 3000
NKKQNENIKE LEIIFENKLN TPWMCMIFPY YFISYFNYKI FCILKCTKKK RVRKNTNLIK 3060
DKILTSKLKG VNFIKYKKAW CFTYVDVDDK KKKKIFPVND YGFVESKALS ILFRKSFFFH 3120
LNKIHNFLKN ILYNKNNISN TMINQGNTYN NINEYIDHYK KYEEFLVCYG KIIYFNEMKN 3180
IYICSENKQN ALNYMPKEIK RQICNKMCSH KEYNMDNSMD NSMDNNIDNN IDNNIDNNID 3240
NNIDNNIDNN IDNNLCKNLC NKIDPINLLI TTRYYFFKKS KIMNFPKGLV YLSGYFLWLV 3300
AFLNNEKQEI IISFSVRKYS YDIAKAKCLE CYYCLLHKYK FKPLNISGVI DIVLESDIEC 3360
KNYNLLDYSS EDIMSLEYLF LYFSPSSYVL HNNTLYKKIS RKNQELHKQY DKLFPGQYVH 3420
LNTYQTYEIT NNDLIDNNIK CANAEMVSNI YNDKLYQDDK NKFISPRNEQ NILGTMIYNN 3480
QEEYHDVNMD QINKNKEEPI TNITKNISND LSPLIYDHQN ITYKKNDKKS IPLCFSDDEE 3540
AKEINDKIKK KKDINLNCMS NLHNEISKKS ANINITNHNI NVQMNDSMNG HLVDEKIHMD 3600
NKNNTNKIDN NINNIHNINN IHNNNNSCSS CCFYVKHVNR ESDNGNSENL FTSKTFQDTY 3660
KQTYINNDYY KNILNKNICF FFDNNIQDIY LKKYYDELFN EKSNEDENIK NEHILFKRID 3720
KKEENIGIFI LLNCQWLSDS FVNNVNQIET KYADIYSFKN YLNTCEEVYN WKHKKNFIKD 3780
CAEISKKFPR IVGVHYDSYA TAWVVNCSFN KKRHDKKFSV KTFGFLQARK LAIEYRERWI 3840
QLKAFHRLNN LKKKTEKL 3858 
Gene Ontology
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro. 
Interpro
 IPR001471; AP2/ERF_dom. 
Pfam
 PF00847; AP2 
SMART
  
PROSITE
  
PRINTS