Université de Liège Réseau des Bibliothèques

BICTEL/e - ULg
Serveur institutionnel des thèses de doctorat



Nouvelles thèses
dans BICTEL/e - ULg
  • Decock, Alice - Etude des produits de dissociation de H2O dans un échantillon de comètes d'origines variées
  • Brotcorne, Fany - Behavioral ecology of commensal long-tailed macaque (Macaca fascicularis) populations in Bali, Indonesia: impact of anthropic factors
  • Mishra, Bamdev - A Riemannian approach to large-scale constrained least-squares with symmetries
Présentation Recherche thèse Dépôt thèse Accès
gestionnaires
 
Page de résumé pour ULgetd-07062009-150801

Auteur : Journée, Michel
URN : ULgetd-07062009-150801
Langue : Anglais/English
Titre : Geometric algorithms for component analysis with a view to gene expression data analysis
Intitulé du diplôme : Doctorat en sciences de l'ingénieur
Département : FSA - Département d'électricité, électronique et informatique
Jury :
Nom : Titre :
ABSIL, P-A Membre du jury/Committee Member
SEPULCHRE, Rodolphe Membre du jury/Committee Member
SUYKENS, Johan Membre du jury/Committee Member
THEIS, Fabian Membre du jury/Committee Member
VAN STEEN, Kristel Membre du jury/Committee Member
ZINOVYEV, Andrei Membre du jury/Committee Member
WEHENKEL, Louis Président du jury/Committee Chair
Mots-clés :
  • sparsity
  • statistical independence
  • low-rank positive semidefinite matrice
  • gene expression data analysis
  • optimization on matrix manifolds
  • dimension reduction
  • component analysis
Date de soutenance : 2009-06-04
Type d'accès : Public/Internet
Résumé :

The research reported in this thesis addresses the problem of component analysis, which aims at reducing large data to lower dimensions, to reveal the essential structure of the data. This problem is encountered in almost all areas of science - from physics and biology to finance, economics and psychometrics - where large data sets need to be analyzed.

Several paradigms for component analysis are considered, e.g., principal component analysis, independent component analysis and sparse principal component analysis, which are naturally formulated as an optimization problem subject to constraints that endow the problem with a well-characterized matrix manifold structure. Component analysis is so cast in the realm of optimization on matrix manifolds. Algorithms for component analysis are subsequently derived that take advantage of the geometrical structure of the problem.

When formalizing component analysis into an optimization framework, three main classes of problems are encountered, for which methods are proposed. We first consider the problem of optimizing a smooth function on the set of n-by-p real matrices with orthonormal columns. Then, a method is proposed to maximize a convex function on a compact manifold, which generalizes to this context the well-known power method that computes the dominant eigenvector of a matrix. Finally, we address the issue of solving problems defined in terms of large positive semidefinite matrices in a numerically efficient manner by using low-rank approximations of such matrices.

The efficiency of the proposed algorithms for component analysis is evaluated on the analysis of gene expression data related to breast cancer, which encode the expression levels of thousands of genes gained from experiments on hundreds of cancerous cells. Such data provide a snapshot of the biological processes that occur in tumor cells and offer huge opportunities for an improved understanding of cancer. Thanks to an original framework to evaluate the biological significance of a set of components, well-known but also novel knowledge is inferred about the biological processes that underlie breast cancer.

Hence, to summarize the thesis in one sentence: We adopt a geometric point of view to propose optimization algorithms performing component analysis, which, applied on large gene expression data, enable to reveal novel biological knowledge.

Autre version : http://www.montefiore.ulg.ac.be/services/stochastic/pubs/2009/Jou09/
Fichiers :
Nom du fichier Taille Temps de chargement évalué (HH:MI:SS)
Modem 56K ADSL
[Public/Internet] Thesis_MJournee_toprint.pdf 928.53 Kb 00:02:12 00:00:04

Bien que le maximum ait été fait pour que les droits des ayants-droits soient respectés, si un de ceux-ci constatait qu'une oeuvre sur laquelle il a des droits a été utilisée dans BICTEL/e ULg sans son autorisation explicite, il est invité à prendre contact le plus rapidement possible avec la Direction du Réseau des Bibliothèques.


Parcourir BICTEL/e par Auteur|Département | Rechercher dans BICTEL/e


© Réseau des Bibliothèques de l'ULg, Grande traverse, 12 B37 4000 LIEGE