| Preface | p. vii |
| Introduction | p. 1 |
| The Microarray: Key to Functional Genomics and Systems Biology | p. 1 |
| Applications of Microarray | p. 2 |
| Gene Expression Profiles in Different Tissues | p. 3 |
| Developmental Genetics | p. 3 |
| Gene Expression Patterns in Model Systems | p. 3 |
| Differential Gene Expression Patterns in Diseases | p. 4 |
| Gene Expression Patterns in Pathogens | p. 5 |
| Gene Expression in Response to Drug Treatments | p. 6 |
| Genotypic Analysis | p. 7 |
| Mutation Screening of Disease Genes | p. 7 |
| Framework of Microarray Data Analysis | p. 8 |
| Summary | p. 11 |
| Basic Concepts of Molecular Biology | p. 13 |
| Introduction | p. 13 |
| Cells | p. 13 |
| Proteins | p. 15 |
| Nucleic Acids | p. 19 |
| DNA | p. 19 |
| RNA | p. 22 |
| Central Dogma of Molecular Biology | p. 22 |
| Genes and the Genetic Code | p. 23 |
| Transcription and Gene Expression | p. 25 |
| Translation and Protein Synthesis | p. 26 |
| Genotype and Phenotype | p. 27 |
| Summary | p. 30 |
| Overview of Microarray Experiments | p. 31 |
| Introduction | p. 31 |
| Microarray Chip Manufacture | p. 32 |
| Deposition-Based Manufacture | p. 33 |
| In Situ Manufacture | p. 34 |
| The Affymetrix GeneChip | p. 35 |
| Steps of Microarray Experiments | p. 36 |
| Sample Preparation and Labeling | p. 36 |
| Hybridization | p. 39 |
| Image Scanning | p. 39 |
| Image Processing | p. 40 |
| Microarray Data Cleaning and Preprocessing | p. 42 |
| Data Transformation | p. 42 |
| Missing Value Estimation | p. 43 |
| Data Normalization | p. 45 |
| Global Normalization Approaches | p. 46 |
| Standardization | p. 46 |
| Iterative linear regression | p. 46 |
| Intensity-Dependent Normalization | p. 47 |
| LOWESS: Locally weighted linear regression | p. 47 |
| Distribution normalization | p. 49 |
| Summary | p. 49 |
| Analysis of Differentially-Expressed Genes | p. 51 |
| Introduction | p. 51 |
| Basic Concepts in Statistics | p. 53 |
| Statistical Inference | p. 53 |
| Hypothesis Test | p. 54 |
| Fold Change Methods | p. 56 |
| k-fold Change | p. 56 |
| Unusual Ratios | p. 57 |
| Model-Based Methods | p. 60 |
| Parametric Tests | p. 62 |
| Paired t-Test | p. 62 |
| Unpaired t-Test | p. 63 |
| Variants of t-Test | p. 64 |
| Non-Parametric Tests | p. 65 |
| Classical Non-Parametric Statistics | p. 65 |
| Other Non-Parametric Statistics | p. 66 |
| Bootstrap Analysis | p. 67 |
| Multiple Testing | p. 69 |
| Family-Wise Error Rate | p. 70 |
| Sidak correction and Bonferroni correction | p. 70 |
| Holm's step-wise correction | p. 71 |
| False Discovery Rate | p. 71 |
| Permutation Correction | p. 72 |
| SAM: Significance Analysis of Microarrays | p. 73 |
| ANOVA: Analysis of Variance | p. 77 |
| One-Way ANOVA | p. 79 |
| Two-Way ANOVA | p. 80 |
| Summary | p. 82 |
| Gene-Based Analysis | p. 83 |
| Introduction | p. 83 |
| Proximity Measurement for Gene Expression Data | p. 85 |
| Euclidean Distance | p. 85 |
| Correlation Coefficient | p. 86 |
| Pearson's correlation coefficient | p. 86 |
| Jackknife correlation | p. 88 |
| Spearman's rank-order correlation | p. 88 |
| Kullback-Leibler Divergence | p. 88 |
| Partition-Based Approaches | p. 90 |
| K-means and its Variations | p. 90 |
| SOM and its Extensions | p. 92 |
| Graph-Theoretical Approaches | p. 94 |
| HCS and CLICK | p. 94 |
| CAST: Cluster affinity search technique | p. 96 |
| Model-Based Clustering | p. 98 |
| Hierarchical Approaches | p. 99 |
| Agglomerative Algorithms | p. 99 |
| Divisive Algorithms | p. 102 |
| DAA: Deterministic annealing algorithm | p. 102 |
| SPC: Super-paramagnetic clustering | p. 103 |
| Density-Based Approaches | p. 104 |
| DBSCAN | p. 105 |
| OPTICS | p. 106 |
| DENCLUE | p. 107 |
| GPX: Gene Pattern eXplorer | p. 110 |
| The Attraction Tree | p. 115 |
| The distance measure | p. 115 |
| The density definition | p. 116 |
| The attraction tree | p. 118 |
| An example of attraction tree | p. 120 |
| Interactive Exploration of Coherent Patterns | p. 122 |
| Generating the index list | p. 123 |
| The coherent pattern index and its graph | p. 125 |
| Drilling down to subgroups | p. 126 |
| Experimental Results | p. 128 |
| Interactive exploration of Iyer's data and Spellman's data | p. 129 |
| Comparison with other algorithms | p. 129 |
| Efficiency and Scalability | p. 134 |
| Cluster Validation | p. 135 |
| Homogeneity and Separation | p. 136 |
| Agreement with Reference Partition | p. 137 |
| Reliability of Clusters | p. 138 |
| P-value of a cluster | p. 138 |
| Prediction strength | p. 139 |
| Summary | p. 139 |
| Sample-Based Analysis | p. 141 |
| Introduction | p. 141 |
| Selection of Informative Genes | p. 144 |
| Supervised Approaches | p. 145 |
| Differentially expressed genes | p. 145 |
| Gene pairs | p. 146 |
| Virtual genes | p. 148 |
| Genetic algorithms | p. 150 |
| Unsupervised Approaches | p. 152 |
| PCA: Principal component analysis | p. 152 |
| Gene shaving | p. 154 |
| Class Prediction | p. 155 |
| Linear Discriminant Analysis | p. 155 |
| Instance-Based Classification | p. 158 |
| KNN: k-Nearest Neighbor | p. 158 |
| Weighted voting | p. 159 |
| Decision Trees | p. 160 |
| Support Vector Machines | p. 162 |
| Class Discovery | p. 163 |
| Problem statement | p. 165 |
| CLIFF: CLustering via Iterative Feature Filtering | p. 165 |
| The sample-partition process | p. 166 |
| The gene-filtering process | p. 167 |
| ESPD: Empirical Sample Pattern Detection | p. 168 |
| Measurements for phenotype structure detection | p. 168 |
| Algorithms | p. 173 |
| Experimental results | p. 184 |
| Classification Validation | p. 190 |
| Prediction Accuracy | p. 190 |
| Prediction Reliability | p. 191 |
| Summary | p. 192 |
| Pattern-Based Analysis | p. 195 |
| Introduction | p. 195 |
| Mining Association Rules | p. 197 |
| Concepts of Association-Rule Mining | p. 198 |
| The Apriori Algorithm | p. 200 |
| The FP-Growth Algorithm | p. 201 |
| The CARPENTER Algorithm | p. 202 |
| Generating Association Rules in Microarray Data | p. 204 |
| Rule filtering | p. 205 |
| Rule grouping | p. 206 |
| Mining Pattern-Based Clusters in Microarray Data | p. 207 |
| Heuristic Approaches | p. 208 |
| Coupled two-way clustering (CTWC) | p. 208 |
| Plaid model | p. 209 |
| Biclustering and 5-Clusters | p. 210 |
| Deterministic Approaches | p. 211 |
| [delta]-pCluster | p. 211 |
| OP-Cluster | p. 213 |
| Mining Gene-Sample-Time Microarray Data | p. 214 |
| Three-dimensional Microarray Data | p. 214 |
| Coherent Gene Clusters | p. 215 |
| Problem description | p. 217 |
| Maximal coherent sample sets | p. 219 |
| The mining algorithms | p. 222 |
| Experimental results | p. 227 |
| Tri-Clusters | p. 232 |
| The tri-cluster model | p. 232 |
| Properties of tri-clusters | p. 234 |
| Mining tri-clusters | p. 235 |
| Summary | p. 238 |
| Visualization of Microarray Data | p. 239 |
| Introduction | p. 239 |
| Single-Array Visualization | p. 241 |
| Box Plot | p. 242 |
| Histogram | p. 243 |
| Scatter Plot | p. 244 |
| Gene Pies | p. 246 |
| Multi-Array Visualization | p. 247 |
| Global Visualizations | p. 247 |
| Optimal Visualizations | p. 249 |
| Projection Visualization | p. 250 |
| VizStruct | p. 251 |
| Fourier Harmonic Projections | p. 253 |
| Discrete-time signal paradigm | p. 253 |
| The Fourier harmonic projection algorithm | p. 254 |
| Properties of FHPs | p. 257 |
| Basic properties | p. 257 |
| Advanced properties | p. 258 |
| Harmonic equivalency | p. 260 |
| Effects of harmonic twiddle power index | p. 261 |
| Enhancements of Fourier Harmonic Projections | p. 263 |
| Exploratory Visualization of Gene Profiling | p. 265 |
| Microarray data sets for visualization | p. 265 |
| Identification of informative genes | p. 265 |
| Classifier construction and evaluation | p. 265 |
| Dimension arrangement | p. 267 |
| Visualization of various data sets | p. 270 |
| Comparison of FFHP to Sammon's mapping | p. 275 |
| Confirmative Visualization of Gene Time-series | p. 277 |
| Data sets for visualization | p. 277 |
| The harmonic projection approach | p. 278 |
| Rat kidney data set | p. 278 |
| Yeast-A data set | p. 279 |
| Yeast-B data set | p. 282 |
| Summary | p. 282 |
| New Trends in Mining Gene Expression Microarray Data | p. 285 |
| Introduction | p. 285 |
| Meta-Analysis of Microarray Data | p. 285 |
| Meta-Analysis of Differential Genes | p. 286 |
| Meta-Analysis of Co-Expressed Genes | p. 287 |
| Semi-Supervised Clustering | p. 288 |
| General Semi-Supervised Clustering Algorithms | p. 289 |
| A Seed-Generation Approach | p. 291 |
| Seed-generation methods | p. 291 |
| Pattern-selection rules | p. 292 |
| The framework for the seed-generation approach | p. 295 |
| Integration of Gene Expression Data with Other Data | p. 296 |
| A Probabilistic Model for Joint Mining | p. 299 |
| A Graph-Based Model for Joint Mining | p. 300 |
| Summary | p. 304 |
| Conclusion | p. 305 |
| Bibliography | p. 307 |
| Index | p. 331 |
| Table of Contents provided by Ingram. All Rights Reserved. |