High Mobility Group Proteins
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
A 23-kDa HMG-box protein that binds to and distorts the minor grove of DNA.
RNA, Transfer, Asn
A transfer RNA which is specific for carrying asparagine to sites on the ribosomes in preparation for protein synthesis.
DNA-binding domains present in proteins of the HMG-box superfamily including the archetypal HMGB PROTEINS, a number of sequence specific TRANSCRIPTION FACTORS, and other DNA-BINDING PROTEINS. The domains consist of 70-80 amino acids that form an L-shaped fold from three alpha-helical segments. The domain has the capacity to recognize and/or induce specific DNA structures and effect the accessibility of the DNA to other proteins involved in transcription, recombination, or DNA repair. (Note that not all HIGH MOBILITY GROUP PROTEINS contain this domain.)
Protein Structure, Tertiary
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
Amino Acid Sequence
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
T-Box Domain Proteins
Proteins containing a region of conserved sequence, about 200 amino acids long, which encodes a particular sequence specific DNA binding domain (the T-box domain). These proteins are transcription factors that control developmental pathways. The prototype of this family is the mouse Brachyury (or T) gene product.
Part of a MESSENGER RNA molecule that undergoes a conformation change upon binding a specific metabolite or other small molecule thereby regulating the messenger RNA's transcription, post-transcriptional processing, transport, translation, or stability in response to varying levels of the metabolite or other small molecule.
LIM Domain Proteins
A large class of structurally-related proteins that contain one or more LIM zinc finger domains. Many of the proteins in this class are involved in intracellular signaling processes and mediate their effects via LIM domain protein-protein interactions. The name LIM is derived from the first three proteins in which the motif was found: LIN-11, Isl1 and Mec-3.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
Cell Cycle Proteins
Proteins that control the CELL DIVISION CYCLE. This family of proteins includes a wide variety of classes, including CYCLIN-DEPENDENT KINASES, mitogen-activated kinases, CYCLINS, and PHOSPHOPROTEIN PHOSPHATASES as well as their putative substrates such as chromatin-associated proteins, CYTOSKELETAL PROTEINS, and TRANSCRIPTION FACTORS.
Sequence Homology, Amino Acid
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
Fas-Associated Death Domain Protein
A signal-transducing adaptor protein that associates with TNF RECEPTOR complexes. It contains a death effector domain that can interact with death effector domains found on INITIATOR CASPASES such as CASPASE 8 and CASPASE 10. Activation of CASPASES via interaction with this protein plays a role in the signaling cascade that leads to APOPTOSIS.
A conserved A-T rich sequence which is contained in promoters for RNA polymerase II. The segment is seven base pairs long and the nucleotides most commonly found are TATAAAA.
MADS Domain Proteins
A superfamily of proteins that share a highly conserved MADS domain sequence motif. The term MADS refers to the first four members which were MCM1 PROTEIN; AGAMOUS 1 PROTEIN; DEFICIENS PROTEIN; and SERUM RESPONSE FACTOR. Many MADS domain proteins have been found in species from all eukaryotic kingdoms. They play an important role in development, especially in plants where they have an important role in flower development.
Recombinant Fusion Proteins
Recombinant proteins produced by the GENETIC TRANSLATION of fused genes formed by the combination of NUCLEIC ACID REGULATORY SEQUENCES of one or more genes with the protein coding sequences of one or more genes.
5' Untranslated Regions
The sequence at the 5' end of the messenger RNA that does not code for product. This sequence contains the ribosome binding site and other transcription and translation regulating sequences.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Protein Interaction Domains and Motifs
src Homology Domains
Regions of AMINO ACID SEQUENCE similarity in the SRC-FAMILY TYROSINE KINASES that fold into specific functional tertiary structures. The SH1 domain is a CATALYTIC DOMAIN. SH2 and SH3 domains are protein interaction domains. SH2 usually binds PHOSPHOTYROSINE-containing proteins and SH3 interacts with CYTOSKELETAL PROTEINS.
Transport proteins that carry specific substances in the blood or across cell membranes.
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
Nucleic Acid Conformation
Protein interaction domains of about 70-90 amino acid residues, named after a common structure found in PSD-95, Discs Large, and Zona Occludens 1 proteins. PDZ domains are involved in the recruitment and interaction of proteins, and aid the formation of protein scaffolds and signaling networks. This is achieved by sequence-specific binding between a PDZ domain in one protein and a PDZ motif in another protein.
The small RNA molecules, 73-80 nucleotides long, that function during translation (TRANSLATION, GENETIC) to align AMINO ACIDS at the RIBOSOMES in a sequence determined by the mRNA (RNA, MESSENGER). There are about 30 different transfer RNAs. Each recognizes a specific CODON set on the mRNA through its own ANTICODON and as aminoacyl tRNAs (RNA, TRANSFER, AMINO ACYL), each carries a specific amino acid to the ribosome to add to the elongating peptide chains.
Two-Hybrid System Techniques
Screening techniques first developed in yeast to identify genes encoding interacting proteins. Variations are used to evaluate interplay between proteins and other molecules. Two-hybrid techniques refer to analysis for protein-protein interactions, one-hybrid for DNA-protein interactions, three-hybrid interactions for RNA-protein interactions or ligand-based interactions. Reverse n-hybrid techniques refer to analysis for mutations or other small molecules that dissociate known interactions.
Amino Acid Motifs
Proteins prepared by recombinant DNA technology.
The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain).
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
Protein Structure, Secondary
The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to alpha helices, beta strands (which align to form beta sheets) or other types of coils. This is the first folding level of protein conformation.
Adaptor Proteins, Signal Transducing
A broad category of carrier proteins that play a role in SIGNAL TRANSDUCTION. They generally contain several modular domains, each of which having its own binding activity, and act by forming complexes with other intracellular-signaling molecules. Signal-transducing adaptor proteins lack enzyme activity, however their activity can be modulated by other signal-transducing enzymes
The intracellular transfer of information (biological activation/inhibition) through a signal pathway. In each signal transduction system, an activation/inhibition signal from a biologically active molecule (hormone, neurotransmitter) is mediated via the coupling of a receptor/enzyme to a second messenger system or to an ion channel. Signal transduction plays an important role in activating cellular functions, cell differentiation, and cell proliferation. Examples of signal transduction systems are the GAMMA-AMINOBUTYRIC ACID-postsynaptic receptor-calcium ion channel system, the receptor-mediated T-cell activation pathway, and the receptor-mediated activation of phospholipases. Those coupled to membrane depolarization or intracellular release of calcium include the receptor-mediated activation of cytotoxic functions in granulocytes and the synaptic potentiation of protein kinase activation. Some signal transduction pathways may be part of larger signal transduction pathways; for example, protein kinase activation is part of the platelet activation signal pathway.
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
Promoter Regions, Genetic
TNF Receptor-Associated Death Domain Protein
A 34 kDa signal transducing adaptor protein that associates with TUMOR NECROSIS FACTOR RECEPTOR TYPE 1. It facilitates the recruitment of signaling proteins such as TNF RECEPTOR-ASSOCIATED FACTOR 2 and FAS ASSOCIATED DEATH DOMAIN PROTEIN to the receptor complex.
The first continuously cultured human malignant CELL LINE, derived from the cervical carcinoma of Henrietta Lacks. These cells are used for VIRUS CULTIVATION and antitumor drug screening assays.
The process of moving proteins from one cellular compartment (including extracellular) to another by various sorting and transport mechanisms such as gated transport, protein translocation, and vesicular transport.
Process of generating a genetic MUTATION. It may occur spontaneously or be induced by MUTAGENS.
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
The lipid- and protein-containing, selectively permeable membrane that surrounds the cytoplasm in prokaryotic and eukaryotic cells.
POU Domain Factors
A family of transcription factors characterized by the presence of a bipartite DNA-binding domain known as the POU domain. The POU domain contains two subdomains, a POU-specific domain and a POU-homeodomain. The POU domain was originally identified as a region of approximately 150 amino acids shared between the Pit-1, Oct-1, Oct-2, and Unc-86 transcription factors.
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
The part of a cell that contains the CYTOSOL and small structures excluding the CELL NUCLEUS; MITOCHONDRIA; and large VACUOLES. (Glick, Glossary of Biochemistry and Molecular Biology, 1990)
Extrachromosomal, usually CIRCULAR DNA molecules that are self-replicating and transferable from one organism to another. They are found in a variety of bacterial, archaeal, fungal, algal, and plant species. They are used in GENETIC ENGINEERING as CLONING VECTORS.
Intracellular Signaling Peptides and Proteins
Proteins and peptides that are involved in SIGNAL TRANSDUCTION within the cell. Included here are peptides and proteins that regulate the activity of TRANSCRIPTION FACTORS and cellular processes in response to signals from CELL SURFACE RECEPTORS. Intracellular signaling peptide and proteins may be part of an enzymatic signaling cascade or act through binding to and modifying the action of other signaling factors.
Gene Expression Regulation
Motifs in DNA- and RNA-binding proteins whose amino acids are folded into a single structural unit around a zinc atom. In the classic zinc finger, one zinc atom is bound to two cysteines and two histidines. In between the cysteines and histidines are 12 residues which form a DNA binding fingertip. By variations in the composition of the sequences in the fingertip and the number and spacing of tandem repeats of the motif, zinc fingers can form a large number of different sequence specific binding sites.
Within a eukaryotic cell, a membrane-limited body which contains chromosomes and one or more nucleoli (CELL NUCLEOLUS). The nuclear membrane consists of a double unit-type membrane which is perforated by a number of pores; the outermost membrane is continuous with the ENDOPLASMIC RETICULUM. A cell may contain more than one nucleus. (From Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
Gene Expression Regulation, Developmental
Caenorhabditis elegans Proteins
Hypoxia-Inducible Factor-Proline Dioxygenases
Amino Acid Substitution
The naturally occurring or experimentally induced replacement of one or more AMINO ACIDS in a protein with another. If a functionally equivalent amino acid is substituted, the protein may retain wild-type activity. Substitution may also diminish, enhance, or eliminate protein function. Experimentally induced substitution is often used to study enzyme activities and binding site properties.
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
A molecule that binds to another molecule, used especially to refer to a small molecule that binds specifically to a larger molecule, e.g., an antigen binding to an antibody, a hormone or neurotransmitter binding to a receptor, or a substrate or allosteric effector binding to an enzyme. Ligands are also molecules that donate or accept a pair of electrons to form a coordinate covalent bond with the central metal atom of a coordination complex. (From Dorland, 27th ed)
Gene Expression Regulation, Plant
Octamer Transcription Factor-6
Protein Structure, Quaternary
The characteristic 3-dimensional shape and arrangement of multimeric proteins (aggregates of more than one polypeptide chain).
Major constituent of the cytoskeleton found in the cytoplasm of eukaryotic cells. They form a flexible framework for the cell, provide attachment points for organelles and formed bodies, and make communication between parts of the cell possible.
A diverse class of enzymes that interact with UBIQUITIN-CONJUGATING ENZYMES and ubiquitination-specific protein substrates. Each member of this enzyme group has its own distinct specificity for a substrate and ubiquitin-conjugating enzyme. Ubiquitin-protein ligases exist as both monomeric proteins multiprotein complexes.
Members of the class of compounds composed of AMINO ACIDS joined together by peptide bonds between adjacent amino acids into linear, branched or cyclical structures. OLIGOPEPTIDES are composed of approximately 2-12 amino acids. Polypeptides are composed of approximately 13 or more amino acids. PROTEINS are linear polypeptides that are normally synthesized on RIBOSOMES.
A process whereby multiple RNA transcripts are generated from a single gene. Alternative splicing involves the splicing together of other possible sets of EXONS during the processing of some, but not all, transcripts of the gene. Thus a particular exon may be connected to any one of several alternative exons to form a mature RNA. The alternative forms of mature MESSENGER RNA produce PROTEIN ISOFORMS in which one part of the isoforms is common while the other parts are different.
Different forms of a protein that may be produced from different GENES, or from the same gene by ALTERNATIVE SPLICING.
A cell line generated from human embryonic kidney cells that were transformed with human adenovirus type 5.
Green Fluorescent Proteins
Genes whose expression is easily detectable and therefore used to study promoter activity at many positions in a target genome. In recombinant DNA technology, these genes may be attached to a promoter region of interest.
Nuclear Magnetic Resonance, Biomolecular
NMR spectroscopy on small- to medium-size biological macromolecules. This is often used for structural investigation of proteins and nucleic acids, and often involves more than one isotope.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
Death Domain Receptor Signaling Adaptor Proteins
Sequence Homology, Nucleic Acid
Cell lines whose original growing procedure consisted being transferred (T) every 3 days and plated at 300,000 cells per plate (J Cell Biol 17:299-313, 1963). Lines have been developed using several different strains of mice. Tissues are usually fibroblasts derived from mouse embryos but other types and sources have been developed as well. The 3T3 lines are valuable in vitro host systems for oncogenic virus transformation studies, since 3T3 cells possess a high sensitivity to CONTACT INHIBITION.
Sequence Analysis, DNA
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
Basic Helix-Loop-Helix Transcription Factors
A family of DNA-binding transcription factors that contain a basic HELIX-LOOP-HELIX MOTIF.
A gene silencing phenomenon whereby specific dsRNAs (RNA, DOUBLE-STRANDED) trigger the degradation of homologous mRNA (RNA, MESSENGER). The specific dsRNAs are processed into SMALL INTERFERING RNA (siRNA) which serves as a guide for cleavage of the homologous mRNA in the RNA-INDUCED SILENCING COMPLEX. DNA METHYLATION may also be triggered during this process.
Protein Interaction Mapping
Methods for determining interaction between PROTEINS.
Repetitive Sequences, Amino Acid
Electrophoresis, Polyacrylamide Gel
CELL LINE derived from the ovary of the Chinese hamster, Cricetulus griseus (CRICETULUS). The species is a favorite for cytogenetic studies because of its small chromosome number. The cell line has provided model systems for the study of genetic alterations in cultured mammalian cells.
One of the mechanisms by which CELL DEATH occurs (compare with NECROSIS and AUTOPHAGOCYTOSIS). Apoptosis is the mechanism responsible for the physiological deletion of cells and appears to be intrinsically programmed. It is characterized by distinctive morphologic changes in the nucleus and cytoplasm, chromatin cleavage at regularly spaced sites, and the endonucleolytic cleavage of genomic DNA; (DNA FRAGMENTATION); at internucleosomal sites. This mode of cell death serves as a balance to mitosis in regulating the size of animal tissues and in mediating pathologic processes associated with tumor growth.
Escherichia coli Proteins
Proteins obtained from ESCHERICHIA COLI.
RING Finger Domains
A zinc-binding domain defined by the sequence Cysteine-X2-Cysteine-X(9-39)-Cysteine-X(l-3)-His-X(2-3)-Cysteine-X2-Cysteine -X(4-48)-Cysteine-X2-Cysteine, where X is any amino acid. The RING finger motif binds two atoms of zinc, with each zinc atom ligated tetrahedrally by either four cysteines or three cysteines and a histidine. The motif also forms into a unitary structure with a central cross-brace region and is found in many proteins that are involved in protein-protein interactions. The acronym RING stands for Really Interesting New Gene.
A mixed-function oxygenase that catalyzes the hydroxylation of a prolyl-glycyl containing peptide, usually in PROTOCOLLAGEN, to a hydroxyprolylglycyl-containing-peptide. The enzyme utilizes molecular OXYGEN with a concomitant oxidative decarboxylation of 2-oxoglutarate to SUCCINATE. The enzyme occurs as a tetramer of two alpha and two beta subunits. The beta subunit of procollagen-proline dioxygenase is identical to the enzyme PROTEIN DISULFIDE-ISOMERASES.
Genes that encode highly conserved TRANSCRIPTION FACTORS that control positional identity of cells (BODY PATTERNING) and MORPHOGENESIS throughout development. Their sequences contain a 180 nucleotide sequence designated the homeobox, so called because mutations of these genes often results in homeotic transformations, in which one body structure replaces another. The proteins encoded by homeobox genes are called HOMEODOMAIN PROTEINS.
Compounds and molecular complexes that consist of very large numbers of atoms and are generally over 500 kDa in size. In biological systems macromolecular substances usually can be visualized using ELECTRON MICROSCOPY and are distinguished from ORGANELLES by the lack of a membrane structure.
Use of restriction endonucleases to analyze and generate a physical map of genomes, genes, or other segments of DNA.
Reverse Transcriptase Polymerase Chain Reaction
A variation of the PCR technique in which cDNA is made from RNA via reverse transcription. The resultant cDNA is then amplified using standard PCR protocols.
A highly conserved 76-amino acid peptide universally found in eukaryotic cells that functions as a marker for intracellular PROTEIN TRANSPORT and degradation. Ubiquitin becomes activated through a series of complicated steps and forms an isopeptide bond to lysine residues of specific proteins within the cell. These "ubiquitinated" proteins can be recognized and degraded by proteosomes or be transported to specific compartments within the cell.
Genetic Complementation Test
Saccharomyces cerevisiae Proteins
Enzymes that catalyze the cleavage of a phosphorus-oxygen bond by means other than hydrolysis or oxidation. EC 4.6.
Filamentous proteins that are the main constituent of the thin filaments of muscle fibers. The filaments (known also as filamentous or F-actin) can be dissociated into their globular subunits; each subunit is composed of a single polypeptide 375 amino acids long. This is known as globular or G-actin. In conjunction with MYOSINS, actin is responsible for the contraction and relaxation of muscle.
Tumor Cells, Cultured
Cells grown in vitro from neoplastic tissue. If they can be established as a TUMOR CELL LINE, they can be propagated in cell culture indefinitely.
Magnetic Resonance Spectroscopy
A long pro-domain caspase that contains a death effector domain in its pro-domain region. Caspase 8 plays a role in APOPTOSIS by cleaving and activating EFFECTOR CASPASES. Activation of this enzyme can occur via the interaction of its N-terminal death effector domain with DEATH DOMAIN RECEPTOR SIGNALING ADAPTOR PROTEINS.
Polymerase Chain Reaction
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
A polynucleotide consisting essentially of chains with a repeating backbone of phosphate and ribose units to which nitrogenous bases are attached. RNA is unique among biological macromolecules in that it can encode genetic information, serve as an abundant structural component of cells, and also possesses catalytic activity. (Rieger et al., Glossary of Genetics: Classical and Molecular, 5th ed)
Animals, Genetically Modified
Endosomal Sorting Complexes Required for Transport
A set of protein subcomplexes involved in PROTEIN SORTING of UBIQUITINATED PROTEINS into intraluminal vesicles of MULTIVESICULAR BODIES and in membrane scission during formation of intraluminal vesicles, during the final step of CYTOKINESIS, and during the budding of enveloped viruses. The ESCRT machinery is comprised of the protein products of Class E vacuolar protein sorting genes.
Cell Line, Tumor
A cell line derived from cultured tumor cells.
Sequence Analysis, Protein
A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence.
Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
Schizosaccharomyces pombe Proteins
Addition of methyl groups. In histo-chemistry methylation is used to esterify carboxyl groups and remove sulfate groups by treating tissue sections with hot methanol in the presence of hydrochloric acid. (From Stedman, 25th ed)
Protein Processing, Post-Translational
Any of various enzymatically catalyzed post-translational modifications of PEPTIDES or PROTEINS in the cell of origin. These modifications include carboxylation; HYDROXYLATION; ACETYLATION; PHOSPHORYLATION; METHYLATION; GLYCOSYLATION; ubiquitination; oxidation; proteolysis; and crosslinking and result in changes in molecular weight and electrophoretic motility.
SOX Transcription Factors
A large family of structurally-related transcription factors that were originally discovered based upon their close sequence homology to an HMG-box domain found in SEX-DETERMINING REGION Y PROTEIN. Many SOX transcription factors play important roles in regulating CELL DIFFERENTIATION. The numerous members of this family are organized in several subgroups according to structural identities found within the proteins.
Enzymes that catalyze the methylation of amino acids after their incorporation into a polypeptide chain. S-Adenosyl-L-methionine acts as the methylating agent. EC 2.1.1.
RNA, Small Interfering
Small double-stranded, non-protein coding RNAs (21-31 nucleotides) involved in GENE SILENCING functions, especially RNA INTERFERENCE (RNAi). Endogenously, siRNAs are generated from dsRNAs (RNA, DOUBLE-STRANDED) by the same ribonuclease, Dicer, that generates miRNAs (MICRORNAS). The perfect match of the siRNAs' antisense strand to their target RNAs mediates RNAi by siRNA-guided RNA cleavage. siRNAs fall into different classes including trans-acting siRNA (tasiRNA), repeat-associated RNA (rasiRNA), small-scan RNA (scnRNA), and Piwi protein-interacting RNA (piRNA) and have different specific gene silencing functions.