- C11orf1
-
Chromosome 11 open reading frame 1 Identifiers Symbols C11orf1; FLJ23499 External IDs MGI: 1915971 HomoloGene: 11242 GeneCards: C11orf1 Gene Gene Ontology Molecular function • molecular_function Cellular component • nucleus Biological process • biological_process Sources: Amigo / QuickGO Orthologs Species Human Mouse Entrez 64776 68721 Ensembl ENSG00000137720 ENSMUSG00000037971 UniProt Q9H5F2 Q9D131 RefSeq (mRNA) NM_022761 NM_023483.3 RefSeq (protein) NP_073598 NP_075972.2 Location (UCSC) Chr 11:
111.75 – 111.76 MbChr 9:
50.57 – 50.58 MbPubMed search [1] [2] Chromosome 11 open reading frame one, also known as C11orf1, is a protein-coding gene.[1] It has been found by yeast two hybrid screen to bind to SETDB1 a histone protein methyltranferase enzyme. SETDB1 has been implicated in Huntington's disease, a neurodegenerative disorder.[2]
C11orf1 is a nuclear protein with unknown function but has been shown to show preferential expression in some disease states in microarray data.[3][2]
Contents
Species distribution
C11orf1 shows conservation through mammals and orthologs can be found in sea squirts and sea anemone. The below table shows some orthologs found using BLAST.[4]
Species Organism Common Name NCBI Accession Sequence Identity Expected value Length (AAs) Gene Common Name Homo sapiens Human CAG33659 100% 8e−86 150 C11orf1 Bos taurus Bovine NP_001033266.1 85% 1e−70 149 UPF0686 protein C11orf1 homolog Canis lupus familiaris Dog XP_536577.1 88% 3e−68 485 PREDICTED: hypothetical protein XP_536577 [Canis familiaris] Mus musculus Mouse NP_075972.2 78% 4e−65 466 hypothetical protein LOC68721 [Mus musculus] Ciona intestinalis Sea Squirt XP_002127073.1 49% 3e−23 156 PREDICTED: similar to predicted protein [Ciona intestinalis] Gene
C11orf1 is located on chromosome 11 and is neighbored by:
- FDXACB1-201
- ALG9-201
- ALG9-202
- AP001781.5-201
Protein
Structure
This protein is part of the UPF0686 superfamily. This family is characterized by the presence of a domain of unknown function (DUF)1143 shared by the family.[5] This family DUF1143 has a domain that includes almost all,149, of the 150 amino acids in the human ortholog. C11orf1 has six spicesomal variants and one unspliced variant.
Predicted properties
The following properties of C11orf1 were predicted using bioinformatic analysis:
- Molecular Weight: 17.76 KDal[6]
- Isoelectric point: 8.38[7]
- Post-translational modification: twelve possible post-translational modifications are predicted:
- Two O-(N-acetylaminogalactosyl)-L-threonine Glycosylations at position 138 and 142 on the protein sequence[8]
- Two O-phospho-L-serine Phosphorylation sites at 112 and 141.[8]
- Four O-phospho-L-threonine Phosphorylation sites at 59, 99, 113, and 138.[8]
- Four O4'-phospho-L-tyrosine Phosphorylation sites at 64, 101, 105 and 143.[8]
Secondary structure is slightly in disagreement depending on the algorithm used to predict.
Tissue distribution
C11orf1 appears to be ubiquitously expressed at low levels but particularly high expression in the parathyroid. Expression data indicate expression in most tissues.[9] This gene has also been found in one experiment to be under expressed in Huntington's disease patients while SETDB1 is over-expressed.[10]
Binding partners
The human protein SET domain bifurcated 1, was found to be a binding partner for C11orf1 by Yeast Two Hybrid.[11]
References
- ^ "Entrez Gene: C11orf1". http://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=64776.
- ^ a b Thomas EA, Coppola G, Desplats PA, Tang B, Soragni E, et al. (June 2008). "CThe HDAC inhibitor 4b ameliorates the disease phenotype and transcriptional abnormalities in Huntington’s disease transgenic mice.". Proc. Natl. Acad. Sci. USA 105 (1): 15564–69. doi:10.1073/pnas.0804249105. PMC 2563081. PMID 18829438. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2563081.
- ^ name=Atlas micro>"E-AFMX-6: Transcription profiling of caudate nucleus, frontal cortex, and cerebellum samples from 44 Huntingtons disease HD-gene-positive cases and 36 age- and sex-matched controls". http://www.ebi.ac.uk/microarray-as/atlas/gene?gid=ENSG00000137720.
- ^ "BLAST: Basic Local Alignment Search Tool". National Center for Biotechnology Information, United States National Institutes of Health. http://blast.ncbi.nlm.nih.gov/Blast.cgi. Retrieved 2009-05-07.
- ^ "CDD: Conserved Domain Database (NCBI)". http://www.ncbi.nlm.nih.gov/Structure/cdd/cddsrv.cgi?uid=115278.
- ^ Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America 89 (6): 2002–6. doi:10.1073/pnas.89.6.2002. PMC 48584. PMID 1549558. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=48584.
- ^ "PI Program (Isoelectric Point Prediction)". http://www.embl-heidelberg.de/cgi/pi-wrapper.pl.
- ^ a b c d "UniProt Database". http://www.uniprot.org/uniprot/Q9H5F2.
- ^ "Unigene (EST profile viewer) Human C11orf1". http://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.17546.
- ^ name=Atlas micro>"E-AFMX-6 Transcription profiling of caudate nucleus, frontal cortex, and cerebellum samples from 44 Huntingtons disease HD-gene-positive cases and 36 age- and sex-matched controls". http://www.ebi.ac.uk/microarray-as/atlas/gene?gid=ENSG00000137720.
- ^ "SET domain, bifurcated 1 [Homo sapiens"]. Protein. National Center for Biotechnology Information, United States National Institutes of Health. http://www.ncbi.nlm.nih.gov/protein/168984228?log$=seqview_refseq_protein. Retrieved 2009-05-10.
Further reading
- Stelzl U, Worm U, Lalowski M, et al. (2005). "A human protein-protein interaction network: a resource for annotating the proteome.". Cell 122 (6): 957–68. doi:10.1016/j.cell.2005.08.029. PMID 16169070.
- Strausberg RL, Feingold EA, Grouse LH, et al. (2002). "Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences.". Proc. Natl. Acad. Sci. U.S.A. 99 (26): 16899–903. doi:10.1073/pnas.242603899. PMC 139241. PMID 12477932. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=139241.
- O'Brien KP, Tapia-Páez I, Stãhle-Bäckdahl M, et al. (2000). "Characterization of five novel human genes in the 11q13-q22 region.". Biochem. Biophys. Res. Commun. 273 (1): 90–4. doi:10.1006/bbrc.2000.2910. PMID 10873569.
- Gerhard DS, Wagner L, Feingold EA, et al. (2004). "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).". Genome Res. 14 (10B): 2121–7. doi:10.1101/gr.2596504. PMC 528928. PMID 15489334. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=528928.
- Ota T, Suzuki Y, Nishikawa T, et al. (2004). "Complete sequencing and characterization of 21,243 full-length human cDNAs.". Nat. Genet. 36 (1): 40–5. doi:10.1038/ng1285. PMID 14702039.
Categories:- Human proteins
- Chromosome 11 gene stubs
Wikimedia Foundation. 2010.