Showing Protein Histone-lysine N-methyltransferase SETD1B (HMDBP09034)
Identification | |||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
HMDB Protein ID | HMDBP09034 | ||||||||||||||||
Secondary Accession Numbers |
|
||||||||||||||||
Name | Histone-lysine N-methyltransferase SETD1B | ||||||||||||||||
Synonyms |
|
||||||||||||||||
Gene Name | SETD1B | ||||||||||||||||
Protein Type | Unknown | ||||||||||||||||
Biological Properties | |||||||||||||||||
General Function | Involved in nucleotide binding | ||||||||||||||||
Specific Function | Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1A suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression. Specifically tri-methylates 'Lys-4' of histone H3 in vitro. | ||||||||||||||||
Pathways |
|
||||||||||||||||
Reactions |
|
||||||||||||||||
GO Classification |
|
||||||||||||||||
Cellular Location |
|
||||||||||||||||
Gene Properties | |||||||||||||||||
Chromosome Location | 12 | ||||||||||||||||
Locus | 12q24.31 | ||||||||||||||||
SNPs | SETD1B | ||||||||||||||||
Gene Sequence |
>5772 bp ATGGAGAACAGTCACCCCCCCCACCACCACCACCAGCAGCCCCCGCCGCAGCCCGGCCCT TCGGGCGAGAGGAGGAACCACCATTGGAGAAGTTACAAGTTGATGATTGACCCGGCTCTG AAAAAGGGGCATCATAAACTGTACCGCTACGATGGGCAGCATTTCAGCCTGGCGATGTCC AGCAACCGCCCGGTGGAAATTGTCGAAGATCCCCGGGTCGTCGGGATCTGGACCAAAAAC AAGGAGCTGGAGCTGTCGGTGCCCAAATTCAAGATCGATGAGTTCTACGTGGGCCCGGTG CCTCCGAAGCAGGTGACATTTGCCAAGCTGAATGATAACATCCGTGAAAACTTCCTGAGG GACATGTGCAAGAAGTATGGGGAGGTGGAGGAGGTGGAGATTTTGTACAACCCCAAGACC AAGAAGCACCTGGGCATCGCCAAGGTGGTCTTTGCCACGGTCCGGGGAGCCAAGGATGCC GTTCAGCACTTGCACAGCACTTCCGTCATGGGCAACATTATCCACGTGGAGCTGGACACC AAAGGGGAAACCCGAATGCGGTTCTATGAACTGTTGGTCACTGGCCGATACACCCCCCAG ACCCTCCCAGTGGGCGAGCTGGACGCTGTCTCTCCAATCGTGAATGAGACCCTGCAGCTG TCAGATGCCCTGAAGCGCCTCAAGGATGGAGGCCTGTCTGCAGGCTGTGGCTCCGGCTCC TCCTCTGTCACCCCCAATAGCGGTGGGACACCCTTCTCCCAGGACACAGCTTATTCCAGC TGCCGCCTGGACACACCCAACTCCTATGGACAGGGCACCCCGCTCACACCGCGCCTGGGC ACCCCTTTCTCACAGGACTCCAGCTACTCCAGCCGCCAGCCCACACCCTCATACCTCTTC AGCCAGGACCCTGCAGTGACCTTCAAGGCCCGGCGCCACGAGAGCAAGTTCACGGACGCC TACAACCGCCGCCACGAACATCATTATGTACACAATTCTCCCGCGGTCACTGCGGTGGCC GGGGCCACAGCCGCTTTCCGGGGTTCCTCGGACCTCCCGTTCGGAGCAGTCGGCGGCACT GGGGGCAGCAGCGGTCCCCCGTTCAAGGCTCAACCACAGGATTCAGCCACATTTGCCCAC ACTCCACCACCCGCCCAAGCAACCCCTGCTCCTGGATTCAAGTCTGCTTTCTCTCCGTAT CAGACCCCAGTGGCCCACTTCCCTCCACCCCCGGAAGAGCCCACCGCCACAGCCGCTTTT GGGGCCCGCGACAGTGGGGAGTTCCGGAGGGCACCGGCGCCCCCACCCCTGCCACCTGCT GAGCCTCTGGCCAAGGAGAAGCCAGGCACGCCACCCGGCCCGCCGCCCCCCGACACCAAC AGCATGGAGCTGGGCGGCCGGCCCACCTTCGGCTGGAGTCCTGAGCCCTGTGACAGCCCT GGCACGCCCACGCTGGAGTCGTCCCCTGCAGGGCCAGAGAAACCCCACGACAGCCTGGAC TCGCGCATCGAGATGCTGCTGAAGGAGCAGCGCACCAAGCTGCTCTTCCTGAGGGAGCCG GACTCGGACACCGAGCTGCAGATGGAGGGCAGCCCCATCTCCTCCTCCTCCTCCCAGCTC TCCCCACTGGCCCCCTTTGGCACCAACTCCCAGCCAGGCTTCCGGGGCCCCACGCCCCCC TCGTCACGCCCCTCCAGCACCGGCCTGGAGGATATCAGCCCAACACCCCTCCCAGACTCC GACGAGGACGAGGAGCTCGACCTGGGCCTTGGGCCTCGGCCTCCACCTGAGCCAGGCCCC CCGGACCCTGCTGGGCTTCTGAGCCAGACAGCTGAGGTGGCCTTGGACCTGGTTGGAGAC AGAACCCCGACCTCAGAGAAGATGGATGAGGGCCAGCAGTCCTCAGGCGAGGACATGGAG ATCTCGGATGACGAGATGCCCTCGGCCCCCATCACCAGCGCTGACTGCCCCAAGCCCATG GTGGTGACCCCAGGAGCGGCAGCCGTGGCAGCCCCTTCTGTGCTAGCCCCAACCCTGCCG CTGCCCCCGCCACCTGGCTTCCCCCCGCTGCCCCCCCCACCACCACCACCCCCACCGCAG CCTGGCTTCCCCATGCCCCCACCGCTGCCCCCACCGCCGCCCCCACCCCCTCCAGCCCAC CCTGCTGTGACAGTGCCCCCACCACCCTTGCCAGCGCCGCCTGGAGTCCCGCCCCCACCC ATCCTGCCACCACTGCCCCCCTTTCCGCCGGGCCTGTTCCCTGTGATGCAGGTGGACATG AGCCACGTGCTGGGTGGCCAGTGGGGCGGCATGCCCATGTCCTTCCAGATGCAAACGCAG GTGCTCAGCCGGCTGATGACGGGCCAGGGCGCCTGCCCCTACCCGCCCTTCATGGCCGCT GCGGCCGCCGCTGCCTCAGCTGGGCTCCAGTTTGTCAACCTGCCGCCCTACCGGGGCCCC TTCTCCCTGAGCAACTCCGGCCCAGGCCGCGGGCAGCACTGGCCACCACTGCCCAAGTTT GACCCGTCAGTGCCTCCACCAGGCTACATGCCACGCCAGGAGGACCCACACAAAGCCACG GTGGATGGCGTCCTGCTGGTGGTCCTCAAAGAACTCAAGGCCATCATGAAGCGTGACCTG AACCGCAAGATGGTGGAAGTGGTGGCTTTCCGGGCCTTTGACGAGTGGTGGGACAAGAAG GAGCGGATGGCCAAGGCCTCGCTGACCCCGGTGAAGTCGGGCGAGCACAAGGACGAGGAC AGGCCGAAGCCCAAGGACCGCATCGCCTCGTGCCTGCTGGAGTCATGGGGCAAGGGCGAG GGCCTGGGCTACGAGGGCCTGGGCCTGGGCATTGGGCTGCGTGGGGCCATTCGCCTGCCC TCCTTCAAGGTCAAGAGGAAGGAGCCACCAGACACCACCTCATCTGGCGACCAGAAGCGG CTGCGGCCCTCGACCTCTGTGGATGAGGAAGATGAAGAGTCCGAGCGAGAGCGAGACCGG GATATGGCAGACACCCCCTGTGAGCTCGCCAAGCGGGACCCCAAGGGCGTGGGTGTGCGG CGGCGGCCGGCGCGGCCTCTGGAGCTGGACAGTGGTGGGGAGGAGGACGAGAAGGAGTCA TTGTCGGAGGAACAGGAGAGCACCGAGGAGGAAGAGGAGGCGGAGGAGGAGGAGGAGGAG GAAGATGACGACGATGACGACAGTGATGACCGGGACGAGTCTGAGAACGATGACGAGGAC ACAGCCCTGTCAGAGGCGAGTGAGAAGGACGAAGGGGACTCGGATGAAGAGGAGACAGTG AGCATTGTAACCTCCAAGGCCGAAGCCACGTCGTCCAGTGAGAGTTCCGAGTCTTCTGAG TTTGAGTCAAGCTCCGAGTCCTCGCCCTCATCCTCGGAGGATGAGGAGGAGGTAGTGGCC AGGGAAGAGGAGGAAGAAGAGGAGGAGGAGGAGATGGTGGCCGAGGAAAGCATGGCTTCT GCAGGCCCTGAGGACTTTGAGCAGGACGGGGAGGAAGCGGCTCTGGCCCCGGGGGCACCT GCAGTGGACTCGTTGGGCATGGAAGAGGAGGTGGACATCGAGACTGAGGCTGTGGCCCCT GAGGAGCGGCCCTCCATGCTGGACGAGCCCCCCTTGCCTGTGGGTGTTGAAGAGCCAGCG GACTCCAGGGAGCCGCCTGAGGAACCAGGCCTGAGCCAGGAAGGGGCCATGTTGCTGTCT CCAGAGCCCCCTGCCAAGGAGGTGGAGGCTCGACCCCCATTGTCCCCTGAGCGAGCTCCA GAACATGACCTGGAAGTGGAGCCGGAGCCCCCTATGATGCTCCCCTTGCCGCTGCAACCA CCATTGCCGCCCCCACGACCACCCCGGCCACCCAGCCCACCGCCGGAGCCTGAGACCACA GATGCCTCACACCCATCTGTCCCTCCGGAGCCCCTTGCCGAGGACCACCCCCCGCATACT CCAGGCCTCTGTGGCAGCCTGGCCAAGTCGCAGAGCACAGAGACGGTGCCAGCCACACCA GGCGGGGAGCCCCCGCTATCAGGGGGCAGCAGTGGCCTGTCCCTGAGCTCTCCGCAAGTG CCCGGCAGCCCCTTCTCCTACCCAGCCCCGTCCCCTAGCTTGAGCAGTGGGGGCCTCCCT CGGACACCTGGCCGGGACTTCAGCTTCACACCCACCTTCTCCGAGCCCAGCGGGCCCTTG CTCCTGCCCGTCTGCCCACTCCCCACTGGCCGACGCGATGAACGCTCCGGGCCCCTGGCC TCCCCGGTGCTCCTGGAGACGGGCCTGCCCCTCCCTCTGCCCCTTCCCCTGCCCTTGCCC TTGGCATTGCCCGCCGTCTTGCGGGCCCAGGCTCGTGCGCCCACCCCGCTGCCACCCCTG CTGCCCGCCCCCCTGGCCTCTTGCCCTCCCCCAATGAAGAGGAAGCCGGGCCGGCCCCGG CGATCCCCACCATCTATGCTCTCCTTGGATGGGCCCTTGGTCCGACCACCAGCAGGGGCC GCCCTTGGAAGGGAACTCCTGCTCCTGCCGGGCCAGCCACAGACCCCCGTCTTCCCCAGC ACCCATGACCCCCGGACGGTGACCCTGGACTTCCGGAACGCGGGGATCCCAGCCCCTCCA CCACCCCTTCCCCCCCAGCCACCCCCACCCCCACCTCCCCCACCTGTAGAGCCCACCAAG CTGCCCTTTAAGGAGCTAGACAACCAGTGGCCCTCCGAGGCCATTCCTCCGGGCCCCCGT GGGCGCGATGAGGTCACTGAGGAATACATGGAGTTGGCCAAGAGCCGGGGGCCGTGGCGC CGGCCACCTAAGAAGCGCCATGAGGACCTGGTGCCACCTGCGGGCTCGCCCGAACTCTCG CCACCCCAGCCCCTCTTCCGGCCCCGCTCGGAGTTTGAGGAGATGACCATCCTGTATGAC ATCTGGAACGGTGGCATCGATGAGGAGGACATCCGCTTCCTGTGTGTCACCTACGAGCGA CTGCTACAGCAGGACAATGGCATGGACTGGCTTAACGACACGCTCTGGGTCTACCATCCC TCCACCAGCCTCTCTTCAGCTAAGAAGAAGAAACGGGACGATGGCATCCGCGAGCACGTG ACGGGCTGTGCCCGCAGTGAGGGCTTCTACACCATCGACAAGAAGGACAAGCTCAGATAC CTCAACAGCAGCCGTGCCAGCACCGATGAGCCCCCCGCAGACACCCAGGGCATGAGCATC CCAGCACAGCCCCACGCCTCCACCCGGGCAGGCTCGGAGCGGCGTTCGGAGCAGCGCCGC CTGCTGTCCTCCTTCACTGGCAGCTGTGACAGTGACCTGCTCAAGTTCAACCAGCTCAAG TTCCGGAAGAAAAAGCTCAAGTTCTGCAAGAGCCACATTCACGACTGGGGCTTGTTCGCC ATGGAGCCCATCGCGGCTGACGAGATGGTCATCGAGTACGTGGGCCAGAATATCCGTCAG GTGATCGCAGACATGCGGGAGAAGCGTTATGAGGACGAGGGCATCGGGAGCAGCTACATG TTCCGGGTGGACCATGACACCATCATCGACGCCACCAAGTGCGGCAACTTCGCGCGCTTC ATCAACCACAGCTGCAACCCCAACTGCTATGCCAAGGTGATCACGGTGGAGTCACAGAAG AAGATAGTCATCTACTCGAAGCAGCACATTAACGTCAATGAGGAGATTACCTATGACTAT AAGTTCCCCATCGAGGACGTCAAGATCCCCTGCCTCTGTGGCTCCGAGAACTGCCGGGGG ACCCTCAACTAG |
||||||||||||||||
Protein Properties | |||||||||||||||||
Number of Residues | 1923 | ||||||||||||||||
Molecular Weight | 208729.73 | ||||||||||||||||
Theoretical pI | 4.954 | ||||||||||||||||
Pfam Domain Function | |||||||||||||||||
Signals | Not Available | ||||||||||||||||
Transmembrane Regions | Not Available | ||||||||||||||||
Protein Sequence |
>Histone-lysine N-methyltransferase SETD1B MENSHPPHHHHQQPPPQPGPSGERRNHHWRSYKLMIDPALKKGHHKLYRYDGQHFSLAMS SNRPVEIVEDPRVVGIWTKNKELELSVPKFKIDEFYVGPVPPKQVTFAKLNDNIRENFLR DMCKKYGEVEEVEILYNPKTKKHLGIAKVVFATVRGAKDAVQHLHSTSVMGNIIHVELDT KGETRMRFYELLVTGRYTPQTLPVGELDAVSPIVNETLQLSDALKRLKDGGLSAGCGSGS SSVTPNSGGTPFSQDTAYSSCRLDTPNSYGQGTPLTPRLGTPFSQDSSYSSRQPTPSYLF SQDPAVTFKARRHESKFTDAYNRRHEHHYVHNSPAVTAVAGATAAFRGSSDLPFGAVGGT GGSSGPPFKAQPQDSATFAHTPPPAQATPAPGFKSAFSPYQTPVAHFPPPPEEPTATAAF GARDSGEFRRAPAPPPLPPAEPLAKEKPGTPPGPPPPDTNSMELGGRPTFGWSPEPCDSP GTPTLESSPAGPEKPHDSLDSRIEMLLKEQRTKLLFLREPDSDTELQMEGSPISSSSSQL SPLAPFGTNSQPGFRGPTPPSSRPSSTGLEDISPTPLPDSDEDEELDLGLGPRPPPEPGP PDPAGLLSQTAEVALDLVGDRTPTSEKMDEGQQSSGEDMEISDDEMPSAPITSADCPKPM VVTPGAAAVAAPSVLAPTLPLPPPPGFPPLPPPPPPPPPQPGFPMPPPLPPPPPPPPPAH PAVTVPPPPLPAPPGVPPPPILPPLPPFPPGLFPVMQVDMSHVLGGQWGGMPMSFQMQTQ VLSRLMTGQGACPYPPFMAAAAAAASAGLQFVNLPPYRGPFSLSNSGPGRGQHWPPLPKF DPSVPPPGYMPRQEDPHKATVDGVLLVVLKELKAIMKRDLNRKMVEVVAFRAFDEWWDKK ERMAKASLTPVKSGEHKDEDRPKPKDRIASCLLESWGKGEGLGYEGLGLGIGLRGAIRLP SFKVKRKEPPDTTSSGDQKRLRPSTSVDEEDEESERERDRDMADTPCELAKRDPKGVGVR RRPARPLELDSGGEEDEKESLSEEQESTEEEEEAEEEEEEEDDDDDDSDDRDESENDDED TALSEASEKDEGDSDEEETVSIVTSKAEATSSSESSESSEFESSSESSPSSSEDEEEVVA REEEEEEEEEEMVAEESMASAGPEDFEQDGEEAALAPGAPAVDSLGMEEEVDIETEAVAP EERPSMLDEPPLPVGVEEPADSREPPEEPGLSQEGAMLLSPEPPAKEVEARPPLSPERAP EHDLEVEPEPPMMLPLPLQPPLPPPRPPRPPSPPPEPETTDASHPSVPPEPLAEDHPPHT PGLCGSLAKSQSTETVPATPGGEPPLSGGSSGLSLSSPQVPGSPFSYPAPSPSLSSGGLP RTPGRDFSFTPTFSEPSGPLLLPVCPLPTGRRDERSGPLASPVLLETGLPLPLPLPLPLP LALPAVLRAQARAPTPLPPLLPAPLASCPPPMKRKPGRPRRSPPSMLSLDGPLVRPPAGA ALGRELLLLPGQPQTPVFPSTHDPRTVTLDFRNAGIPAPPPPLPPQPPPPPPPPPVEPTK LPFKELDNQWPSEAIPPGPRGRDEVTEEYMELAKSRGPWRRPPKKRHEDLVPPAGSPELS PPQPLFRPRSEFEEMTILYDIWNGGIDEEDIRFLCVTYERLLQQDNGMDWLNDTLWVYHP STSLSSAKKKKRDDGIREHVTGCARSEGFYTIDKKDKLRYLNSSRASTDEPPADTQGMSI PAQPHASTRAGSERRSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFA MEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARF INHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRG TLN |
||||||||||||||||
External Links | |||||||||||||||||
GenBank ID Protein | 210032580 | ||||||||||||||||
UniProtKB/Swiss-Prot ID | Q9UPS6 | ||||||||||||||||
UniProtKB/Swiss-Prot Entry Name | SET1B_HUMAN | ||||||||||||||||
PDB IDs | |||||||||||||||||
GenBank Gene ID | NM_015048.1 | ||||||||||||||||
GeneCard ID | SETD1B | ||||||||||||||||
GenAtlas ID | SETD1B | ||||||||||||||||
HGNC ID | HGNC:29187 | ||||||||||||||||
References | |||||||||||||||||
General References |
|