“During the last decade, the advent of microarray datasets stimulated a new line of research called Bioinformatics. A microarray database is a repository containing microarray gene expression data. Microarray data pose a great challenge for computational techniques, due to their large dimensionality (up to several tens of thousands of genes) and their sample sizes. Furthermore, additional experimental complications like noise and variability render the analysis of microarray data an exciting domain [Saeys et al. 2007, Bioinformatics]".
In light of the aforesaid excerpt, from microarray data which tools of the pattern recognition can you apply to identify the genes responsible for diseases like cancer? Explain how.