Bayesian Integrative Analysis of Heterogeneous Data in Breast Cancer Prognosis

Advanced Search

Basic Search

Help

AND OR NOT

Add a row

Reset

Limit results by

Books+ Search Results

Author

Zhao, Yinjun.

Title

Bayesian Integrative Analysis of Heterogeneous Data in Breast Cancer Prognosis [electronic resource].

ISBN

9780355777352

Published

Ann Arbor : ProQuest Dissertations & Theses, 2017.

Physical Description

1 online resource (52 p.)

Local Notes

Access is available to the Yale community.

Notes

Source: Masters Abstracts International, Volume: 57-05.

Adviser: Shuangge Ma.

Access and use

Access restricted by licensing agreement.

Summary

Jointly profiling the expressions of thousands of genes and searching for their associations with cancer development and prognosis play important roles in providing preventive and prognostic information to cancer treatment management. The availability of large-scale gene profiling datasets has largely stimulated multi-dataset approaches, of which integrative analysis that synthesizes information from various data sources based on raw data has better performance in separating signal from noise, getting more efficient estimates and improving reproducibility than single dataset analysis. Multiple methods have been developed to accommodate heterogeneous datasets including penalization, thresholding, and boosting, but few studies are in the Bayesian framework.

This paper is motivated by the importance of gene marker identification to cancer development and prognosis, the effectiveness of integrative analysis in improving marker selection and model prediction, and the limited integrative approaches in the Bayesian framework. The goal of this paper is to identify a set of gene markers accurately using a novel Bayesian integrative approach and to fill the gap of Bayesian variable selection methods in integrative analysis of heterogeneous datasets.

This paper proposes a new integrative Bayesian variable selection method in the linear and the accelerated failure-time (AFT) models under heterogeneous data structure. A weighted approach is introduced to accommodate the user-defined weighted observations in the linear model and the censoring issue in the AFT model. The simulation studies show the proposed approach has satisfactory performance in terms of closeness to the true effects, true positive rate & false positive rate and computational cost. This paper analyzes three cancer prognosis studies with gene expression measurements and identifies associated genes using the proposed method.

Format

Books / Online / Dissertations & Theses

Language

English

Added to Catalog

July 30, 2018

Thesis note

Thesis (M.P.H.)--Yale University, 2017.

Subjects

Biostatistics.

Public health.

Also listed under