丁香实验_LOGO
登录
提问
提问
我要登录
|免费注册
点赞
收藏
wx-share
分享

High-Performance Gene Expression Module Analysis Tool and Its Application to Chemical Toxicity Data

互联网

302
Gene clustering is one of the main themes of data mining approaches in bioinformatics. Although it has the power to analyze gene function, interpretation of the results becomes increasingly difficult when the number of experiments (samples) exceeds hundreds or more. A new type of clustering called “biclustering,” where genes and experiments are coclustered in a large-scale of gene expression data, has been extensively studied in the last decade. We have developed “SAMURAI,” an original program that detects all the biclusters or “gene modules” whose genes have similar expression patterns to query profile using the ultrafast data mining algorithm called Linear-time Closed itemset Miner (LCM). Using chemical toxicity dataset from J&J rat liver experiments, we compiled an exhaustive dictionary of gene modules by searching datasets of gene modules with each chemical exposure experiment as query. Through the module analysis, we found that our program can detect up/down-regulated gene sets that significantly represent particular GO functions or KEGG pathways, thereby unraveling reactions and mechanisms common to different toxicochemical treatments of hepatocytes.
提问
扫一扫
丁香实验小程序二维码
实验小助手
丁香实验公众号二维码
关注公众号
反馈
TOP
打开小程序