• encode
  • Gene transcripts that encode 30 proteins important in the detection of semiochemicals, including sex pheromones, were identified and characterized. (usda.gov)
  • This work provides a source of DNA sequences and the proteins that they encode which are important in the detection of semiochemicals such as pheromones. (usda.gov)
  • A simple gene prediction algorithm for prokaryotes might look for a start codon followed by an open reading frame that is long enough to encode a typical protein, where the codon usage of that region matches the frequency characteristic for the given organism's coding regions. (wikipedia.org)
  • The human and the murine GPR84 ORFs both encode proteins of 396 amino acid residues length with 85% identity and are therefore considered as orthologs. (wikipedia.org)
  • The first and second each contain 7 exons and encode the same full-length protein isoform a (786 amino acids, considered the major isoform), the third one contains 6 exons and encodes a shorter isoform b (732 amino acids) and the fourth one contains 5 exons and encodes the shortest isoform c (349 amino acids). (wikipedia.org)
  • secretory
  • TRAM2 is a component of the translocon, a gated macromolecular channel that controls the posttranslational processing of nascent secretory and membrane proteins at the endoplasmic reticulum (ER) membrane. (wikipedia.org)
  • transcriptome
  • This paper describes the generation and characterisation of full-length cDNA sequences from E. tenella second generation merozoites and provides new insights into the E. tenella transcriptome. (biomedcentral.com)
  • Here, we used next-generation sequencing (454 and Illumina) on different tissues from adult and larvae, including chemosensory organs and female ovipositors, to describe the chemosensory transcriptome of S. nonagrioides and identify key molecular components of the pheromone production and detection systems. (ijbs.com)
  • New high-throughput transcriptome sequencing technologies such as RNA-Seq and ChIP-sequencing open opportunities for incorporating additional extrinsic evidence into gene prediction and validation, and allow structurally rich and more accurate alternative to previous methods of measuring gene expression such as expressed sequence tag or DNA microarray. (wikipedia.org)
  • genome sequence
  • The data generated will be useful for the development and validation of diagnostic and control strategies for coccidiosis and will be of value in annotation of the E. tenella genome sequence. (biomedcentral.com)
  • With the bovine draft genome sequence in hand, and development of databases of DNA sequence variation across the genome, we can now turn our attention to leveraging the investment in genome sequence to improve our understanding of the impact and interplay of the genome and the production environment. (usda.gov)
  • Today, with comprehensive genome sequence and powerful computational resources at the disposal of the research community, gene finding has been redefined as a largely computational problem. (wikipedia.org)
  • peptide sequences
  • The output is the predicted peptide sequences in the FASTA format, and a definition line that includes the query ID, the translation reading frame and the nucleotide positions where the coding region begins and ends. (wikipedia.org)
  • cDNA library
  • More than 15,000 partial sequences were generated from the 5' and 3' ends of clones randomly selected from an E. tenella second generation merozoite full-length cDNA library. (biomedcentral.com)
  • In 1979 teams at Harvard and Caltech extended the basic idea of making DNA copies of mRNAs in vitro to amplifying a library of such in bacterial plasmids In 1982, the idea of selecting random or semi-random clones from such a cDNA library for sequencing was explored by Greg Sutcliffe and coworkers. (wikipedia.org)
  • Ensembl
  • For example, the RefSeq database contains transcript and protein sequence from many different species, and the Ensembl system comprehensively maps this evidence to human and several other genomes. (wikipedia.org)
  • codling moth
  • Researchers at the USDA-ARS Yakima Agricultural Research Laboratory (Wapato, WA) and Corn Insect and Crop Genetics Research Unit (Ames,IA), and Washington State University (Pullman, WA) have used next generation sequencing to determine gene expression in various codling moth tissues that respond to semiochemicals. (usda.gov)
  • The results of our EST project indicate that next generation sequencing is useful for the characterization of novel transcripts of interest, and that codling moth expresses a transcript encoding for a new member of the GOBP protein subfamily. (usda.gov)
  • similarity
  • A complex bootstrap resampling leads to an even distribution of scores of randomly similar sequences to assess randomness of the observed sequence similarity. (pubmedcentralcanada.ca)
  • As a result, problematic sequence alignments will contain sections of ambiguous indel positions and random similarity. (pubmedcentralcanada.ca)
  • Sequence identity and similarity were determined using BLAST with the reference human sequence as the query. (wikipedia.org)
  • Given a sequence, local alignment algorithms such as BLAST, FASTA and Smith-Waterman look for regions of similarity between the target sequence and possible candidate matches. (wikipedia.org)
  • A high degree of similarity to a known messenger RNA or protein product is strong evidence that a region of a target genome is a protein-coding gene. (wikipedia.org)
  • Amino acid sequence homology of this isoform of human CASS4 with other family members is 26% overall identity and 42% similarity. (wikipedia.org)
  • Using a yeast two-hybrid approach, the CASS4 protein SH3 domain was shown to interact with the FAK C-terminus, despite the lowest overall similarity to other SH3 domains in the CAS group. (wikipedia.org)
  • The CASS4 C-terminus has a short region of CAS family homology, but lacks obvious similarity at the level of primary amino acid sequence. (wikipedia.org)
  • Although this lack of sequence similarity may mean a reduced functionality of the CASS4 protein, molecular modeling analysis performed by Singh and colleagues using p130CAS/BCAR1 structures as templates suggested an almost identical fold between CASS4 and p130CAS/BCAR1 within their SH3 domains, and substantial similarity within 432-591 residues of CASS4 and 449-610 residues of p130Cas/BCAR1 at the level of secondary and tertiary structures. (wikipedia.org)
  • transcripts
  • Clustering of these sequences produced 1,529 unique transcripts (UTs). (biomedcentral.com)
  • Cloning and sequencing of the 698 nt transcript for CpomGOBP1 resulted in the prediction of a 163 amino acid coding region, and subsequent RT-PCR indicated that the transcripts were mainly expressed in antennae and mouthparts. (usda.gov)
  • The 1289 nt (160 amino acids) CpomGOBP2 and the novel 702 nt (169 amino acids) CpomGOBP3 transcripts are mainly expressed in antennae, mouthparts and female abdomen tips. (usda.gov)
  • Cumulatively, the CASS4 transcripts are most highly expressed in spleen and lung among normal tissues, and are highly expressed in ovarian and leukemia cell lines. (wikipedia.org)
  • receptor
  • Three transcription units were identified within the candidate interval, including Toll-like receptor 4 ( Tlr4 ), part of a protein family with members that have been implicated in LPS-induced cell signaling. (rupress.org)
  • Probable G-protein coupled receptor 160 is a protein that in humans is encoded by the GPR160 gene. (wikipedia.org)
  • Importantly, a number of adipocyte-abundant genes, including GLUT4, hormone sensitive lipase, long-chain fatty acyl-CoA synthase, adipocyte complement-related protein of 30 kDa, and transcription factors CCAAT/enhancer binding protein-α, receptor retinoid X receptor-α, and peroxisome profilerator-activated receptor γ were significantly downregulated by TNF-α treatment. (diabetesjournals.org)
  • Correspondingly, 24-h exposure of 3T3-L1 adipocytes to TNF-α resulted in reduced protein levels of GLUT4 and several insulin signaling proteins, including the insulin receptor, insulin receptor substrate 1 (IRS-1), and protein kinase B (AKT). (diabetesjournals.org)
  • In their papers they described the sequence and expression profile of five new members of GPC receptor family. (wikipedia.org)
  • The product of this gene, P2Y14 belongs to the family of G-protein coupled receptors, which contains several receptor subtypes with different pharmacological selectivity for various adenosine and uridine nucleotides. (wikipedia.org)
  • This receptor is a P2Y purinergic receptor for UDP-glucose and other UDP-sugars coupled to G-proteins. (wikipedia.org)
  • Succinate receptor 1 is a protein that in humans is encoded by the SUCNR1 gene. (wikipedia.org)
  • molecular
  • Wheat grains accumulate a variety of low molecular weight proteins that are inhibitors of alpha-amylases and proteases and play an important protective role in the grain. (biomedcentral.com)
  • Wheat kernels accumulate a variety of low molecular weight proteins that inhibit amylases and/or proteases from different sources. (biomedcentral.com)
  • This protein has a calculated molecular mass of 47.219kDa. (biochemj.org)
  • A better knowledge of the molecular mechanisms of S. nonagrioides olfaction will contribute to the development of new tools for the control of this species since inhibition/artificial activation of the proteins implicated in the olfactory process would lead to the disruption of its chemical communication. (ijbs.com)
  • The molecular mechanisms of olfaction in insects are complex and rely on the intervention of a diversity of proteins expressed in the chemosensory sensilla that cover the sensory organs. (ijbs.com)
  • In molecular biology, the protein domain SUI1 is a translation initiation factor often found in the fungus, Saccharomyces cerevisiae (Baker's yeast) but it is also found in other eukaryotes and prokaryotes as well as archaea. (wikipedia.org)
  • evolutionary
  • The major problem is that finding the most accurate alignment parameters in progressive and consistency based alignment approaches is difficult due to the incomplete knowledge of the evolutionary history of sequences and/or heterogeneous processes along sequences [ 6 ]. (pubmedcentralcanada.ca)
  • SUI1 is also evolutionary related to hypothetical proteins from Escherichia coli (yciH), Haemophilus influenzae (HI1225) and Methanococcus vannielii. (wikipedia.org)
  • Evolutionary divergence of the CAS proteins family members is discussed by Singh et al. (wikipedia.org)
  • transiently expressed
  • When transiently expressed, MyoB, MyoC, as well as the common motor domain lacking the tail did not distribute evenly between daughter cells, suggesting some impairment in proper segregation. (rupress.org)
  • ORFs
  • In eukaryotic genes with multiple exons, ORFs span intron/exon regions, which may be spliced together after transcription of the ORF to yield the final mRNA for protein translation. (wikipedia.org)
  • Long ORFs are often used, along with other evidence, to initially identify candidate protein-coding regions or functional RNA-coding regions in a DNA sequence. (wikipedia.org)
  • On the other hand, it has been proven that some short ORFs (sORFs) that lack the classical hallmarks of protein-coding genes (both from ncRNAs and mRNAs) can produce functional peptides. (wikipedia.org)
  • The tool efficiently finds the ORFs for corresponding amino acid sequences and converts them into their single letter amino acid code, and provides their locations in the sequence. (wikipedia.org)
  • Genomics
  • Many aspects of structural gene prediction are based on current understanding of underlying biochemical processes in the cell such as gene transcription, translation, protein-protein interactions and regulation processes, which are subject of active research in the various omics fields such as transcriptomics, proteomics, metabolomics, and more generally structural and functional genomics. (wikipedia.org)
  • receptors
  • We identified a total of 68 candidate chemosensory genes in this species, including 31 candidate binding-proteins and 23 chemosensory receptors. (ijbs.com)
  • Focusing on the pheromonal communication system, we identified a new pheromone-binding protein in this species, four candidate pheromone receptors and 12 carboxylesterases as candidate acetate degrading enzymes. (ijbs.com)
  • These soluble proteins are proposed to bind odorant molecules and to transport them to membrane bound receptors [ 6 , 7 ]. (ijbs.com)
  • The product of this gene, P2Y13, belongs to the family of G-protein coupled receptors. (wikipedia.org)
  • helminth
  • A helminth protein, or helminthic antigen, is a protein derived from a parasitic worm that causes an immune reaction. (wikipedia.org)
  • Helminth proteins can result in a deregulated response to infection, and are implicated in reduced reactivity to other antigens. (wikipedia.org)
  • Other helminth proteins promote parasite survival in other ways, particularly since parasites must depend on hosts for the supply of essential nutrients. (wikipedia.org)
  • Despite their pathogenic properties, helminth proteins have potential to be co-opted to treat a number of other human diseases. (wikipedia.org)
  • Helminth proteins modulate the immune response of their hosts, but do not suppress it entirely. (wikipedia.org)
  • Additionally, the database can also be used to develop protein targets for new drugs to treat helminth infections. (wikipedia.org)
  • Given the modulatory properties of helminth proteins, it has been suggested that they may be co-opted to successfully treat other human diseases, particularly those associated with auto-immunity disorders. (wikipedia.org)
  • antigens
  • results have recently been obtained using membrane-spanning protein antigens from the tegument of Schistosoma mansoni . (labome.org)
  • species
  • Since this is the first description of GRPs in this species, particular emphasis has been given to the expression pattern of these genes by analyzing their abundance and prevalence in the different cDNA-libraries of the Eucalyptus Genome Sequencing Project Consortium (ForEST). (ufrgs.br)
  • These resources were a key component leading to selection of cattle as a species for complete genome sequencing, which now has opened the door for much more detailed analysis of genome function and evolution among mammals. (usda.gov)
  • Gene finding is one of the first and most important steps in understanding the genome of a species once it has been sequenced. (wikipedia.org)
  • partial
  • Once candidate DNA sequences have been determined, it is a relatively straightforward algorithmic problem to efficiently search a target genome for matches, complete or partial, and exact or inexact. (wikipedia.org)
  • parasite
  • Detailed study of the merozoite stage will support the identification of proteins important to key biological processes in the parasite including host invasion, replication, pathogenicity and the stimulation of host immunity. (biomedcentral.com)
  • When secreted, these proteins may modify the host's immune response in order to promote longevity of the parasite. (wikipedia.org)
  • chromosome
  • Hgpr84 locates to chromosome 12q13.13, and its coding sequence is not interrupted by introns. (wikipedia.org)
  • The protein encoded by this gene forms a heterotrimeric motor complex with kinesin family member 3A and KAP3 (kinesin accessory protein 3) to drive intra-flagellar transport and possibly to aid in chromosome movement during mitosis and meiosis. (wikipedia.org)
  • identification
  • Improvement of the established method for identification of thioredoxin-targeted proteins based on two-dimensional electrophoresis and fluorescence labelling of thiol groups was achieved by application of a highly sensitive Cy5 maleimide dye and large-format two-dimensional gels, resulting in a 10-fold increase in the observed number of labelled protein spots. (biochemj.org)
  • Prolactin-induced protein has also been used for identification and detection of disseminated breast cancer cells. (wikipedia.org)
  • abundant
  • Simple sequence repeat (SSR) analysis identified CAG as the most abundant trinucleotide motif, while codon usage analysis revealed that the ten most infrequently used codons in E. tenella are UAU, UGU, GUA, CAU, AUA, CGA, UUA, CUA, CGU and AGU. (biomedcentral.com)
  • physiological
  • The protein has a physiological function in regulation of water transport mainly in apocrine glands in the axilla, vulva, eyelid and ear canal, serous cells of the submandibular salivary gland, serous cells of the submucosal glands of the bronchi, and accessory lacrimal glands as well as cutaneous eccrine glands. (wikipedia.org)
  • randomly
  • For example, in a randomly generated DNA sequence with an equal percentage of each nucleotide, a stop-codon would be expected once every 21 codons. (wikipedia.org)