Web-based Tools for Protein Classification

互联网2014-02-13

534

Current proteomics technologies generate large number of data among which the investigator has to identify the promising diagnostic/prognostic biomarkers as well as potential therapeutic targets. For the latter, classification of proteins into meaningful families is needed. Current databases, featuring a high level of interconnectivity (cross referencing), provide the tools necessary to bring various data together, facilitating protein classification and elucidation of protein function and interoperativity. This chapter provides guidelines to explore the informationally rich peptide sequences generated by the application of the proteomics methodologies by the use of web-based tools, with the objective to predict potential protein function. After proper preprocessing (e.g., for internal repeats) of a query protein sequence, known domains can be identified, which aid in dividing the query into smaller meaningful parts. Any unclassified remainder of the protein provides the material for low-level comparative analysis for the discovery of distant homologues or candidate novel domain types to be verified experimentally.