Rice Univesrity Logo
    • FAQ
    • Deposit your work
    • Login
    View Item 
    •   Rice Scholarship Home
    • Rice University Graduate Electronic Theses and Dissertations
    • Rice University Electronic Theses and Dissertations
    • View Item
    •   Rice Scholarship Home
    • Rice University Graduate Electronic Theses and Dissertations
    • Rice University Electronic Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    A novel computational platform for sensitive, accurate, and efficient screening of nucleic acids

    Thumbnail
    Name:
    ALBIN-DOCUMENT-2020.pdf
    Size:
    2.974Mb
    Format:
    PDF
    View/Open
    Author
    Albin, Dreycey Don
    Date
    2020-04-24
    Advisor
    Treaangen, Todd
    Degree
    Master of Science
    Abstract
    Recent advances in the field of synthetic biology and nucleic acid synthesis, coupled with increasing concerns about its intentional or accidental misuse, require more sophisticated screening tools to identify genes of interest within short sequence fragments. One major limitation in predicting DNA sequences of concern is the inadequacy of current computational tools and ontologies to describe the specific biological processes of pathogenic proteins. In the first part of this thesis, we design and implement a novel computational platform, SeqScreen, that sensitively assigns taxonomic classifications, functional annotations, and biological processes of interest to short nucleotide sequences of unknown origin (50bp-1,000bp). The overarching goal is to perform sensitive characterization of short sequences and highlight specific pathogenic biological processes of interest (BPoIs). The SeqScreen software executes these tasks in analytical workflows and outputs results in a tab-delimited report. In the second part, we perform a deep computational dive into the area of taxonomic classification, specifically focusing on biases caused by differences in sequences they contain, which radically change over time and differ significantly from repository to repository. To mitigate these drawbacks, the Database Query Tool (DQT) is presented as an effective, easy-to-use, method to investigate the taxonomic composition of databases commonly used in metagenomics. It outputs the databases and related versions that contain a given input NCBI taxonomic ID, allowing for a user to decide what database to use for a given sample, as well as a method for post-analysis. In summary, we provide two novel computational tools for sensitive and accurate characterization of nucleic acid sequences.
    Keyword
    Synthetic Biology; Bioinformatics; Metagenomics; Computational Biology
    Citation
    Albin, Dreycey Don. "A novel computational platform for sensitive, accurate, and efficient screening of nucleic acids." (2020) Master’s Thesis, Rice University. https://hdl.handle.net/1911/108636.
    Metadata
    Show full item record
    Collections
    • Rice University Electronic Theses and Dissertations [13783]

    Home | FAQ | Contact Us | Privacy Notice | Accessibility Statement
    Managed by the Digital Scholarship Services at Fondren Library, Rice University
    Physical Address: 6100 Main Street, Houston, Texas 77005
    Mailing Address: MS-44, P.O.BOX 1892, Houston, Texas 77251-1892
    Site Map

     

    Searching scope

    Browse

    Entire ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeThis CollectionBy Issue DateAuthorsTitlesSubjectsType

    My Account

    Login

    Statistics

    View Usage Statistics

    Home | FAQ | Contact Us | Privacy Notice | Accessibility Statement
    Managed by the Digital Scholarship Services at Fondren Library, Rice University
    Physical Address: 6100 Main Street, Houston, Texas 77005
    Mailing Address: MS-44, P.O.BOX 1892, Houston, Texas 77251-1892
    Site Map