Computational Methods for Predicting Protein-Nucleic Acids Interaction

Cheng, Wen

dc.contributor.author	Cheng, Wen
dc.description.abstract	Since the inception of various proteomic projects, protein structures with unknown functions have been discovered at a fast speed. The proteins regulate many important biological processes by interacting with nucleic acids that include DNA and RNA. Traditional wet-lab methods for protein function discovery are too slow to handle this rapid increase of data. Therefore, there is a need for computational methods that can predict the interaction between proteins and nucleic acids. There are two related problems when predicting protein-nucleic interactions. One problem is to identify nucleic acid-binding sites on the protein structures, and the other problem is to predict the 3-D structure of the complex that protein and nucleic acids form during interaction. The second problem can be further divided into two steps. The first step is to generate potential structures for the protein-nucleic acids complex. The second step is to assign scores to the poses generated in the first step. This dissertation presents two computational methods that we developed to predict the protein-nucleic acids interaction. The first method is a scoring function that can discriminate native structures of protein-DNA complexes from non-native poses, which are also known as docking decoys. We analyze the distribution of protein atoms around each structural component of the DNA and develop spatial-specific scoring matrices (SSSMs) based on the observed distribution. We show that the SSSMs could be used as a knowledge-based energy function to discriminate native protein-DNA structures and various decoys. Our second method discovers the graphs that are enriched on the protein-nucleic acids interfaces and then uses the sub-graphs to predict RNA-binding sites on protein structures and to assign scores to protein-RNA poses. First, the interface area of each RNA-binding protein is represented as a graph, where each node represents an interface residue. Then, common sub-graphs being abundant in these graphs are identified. The method is able to identify RNA-binding sites on the protein surface with high accuracy. We also demonstrate that the common sub-graphs can be used as a scoring function to rank the protein-RNA poses. Our method is simple in computation, while its results are easier to interpret in biological contexts.	en_US
dc.publisher	North Dakota State University	en_US
dc.rights	NDSU Policy 190.6.2
dc.title	Computational Methods for Predicting Protein-Nucleic Acids Interaction	en_US
dc.type	Dissertation	en_US
dc.type	Video	en_US
dc.date.accessioned	2015-06-30T17:39:22Z
dc.date.available	2015-06-30T17:39:22Z
dc.date.issued	2015
dc.identifier.uri	http://hdl.handle.net/10365/25192
dc.rights.uri	https://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degree	Doctor of Philosophy (PhD)	en_US
ndsu.college	Engineering	en_US
ndsu.department	Computer Science	en_US
ndsu.program	Computer Science	en_US
ndsu.advisor	Yan, Changhui

Files in this item

Name:: Wen Cheng video.mov
Size:: 83.95Mb
Format:: QuickTime video

View/Open

Name:: Computational Methods for ...
Size:: 975.8Kb
Format:: PDF
Description:: Computational Methods for ...

View/Open

This item appears in the following Collection(s)

Computer Science Doctoral Work

Show simple item record