| Preface | p. xi |
| Seeing Your Life in Data | p. 1 |
| Personal Environmental Impact Report (PEIR) | p. 2 |
| your.flowingdata (YFD) | p. 3 |
| Personal Data Collection | p. 3 |
| Data Storage | p. 5 |
| Data Processing | p. 6 |
| Data Visualization | p. 7 |
| The Point | p. 14 |
| How to Participate | p. 15 |
| The Beautiful People: Keeping Users in Mind When Designing Data Collection Methods | p. 17 |
| Introduction: User Empathy Is the New Black | p. 17 |
| The Project: Surveying Customers About a New Luxury Product | p. 19 |
| Specific Challenges to Data Collection | p. 19 |
| Designing Our Solution | p. 21 |
| Results and Reflection | p. 31 |
| Embedded Image Data Processing on Mars | p. 35 |
| Abstract | p. 35 |
| Introduction | p. 35 |
| Some Background | p. 37 |
| To Pack or Not to Pack | p. 40 |
| The Three Tasks | p. 42 |
| Slotting the Images | p. 43 |
| Passing the Image: Communication Among the Three Tasks | p. 46 |
| Getting the Picture: Image Download and Processing | p. 48 |
| Image Compression | p. 50 |
| Downlink, or, It's All Downhill from Here | p. 52 |
| Conclusion | p. 52 |
| Cloud Storage Design in a Pnutshell | p. 55 |
| Introduction | p. 55 |
| Updating Data | p. 57 |
| Complex Queries | p. 64 |
| Comparison with Other Systems | p. 68 |
| Conclusion | p. 71 |
| Information Platforms and the Rise of the Data Scientist | p. 73 |
| Libraries and Brains | p. 73 |
| Facebook Becomes Self-Aware | p. 74 |
| A Business Intelligence System | p. 75 |
| The Death and Rebirth of a Data Warehouse | p. 77 |
| Beyond the Data Warehouse | p. 78 |
| The Cheetah and the Elephant | p. 79 |
| The Unreasonable Effectiveness of Data | p. 80 |
| New Tools and Applied Research | p. 81 |
| MAD Skills and Cosmos | p. 82 |
| Information Platforms As Dataspaces | p. 83 |
| The Data Scientist | p. 83 |
| Conclusion | p. 84 |
| The Geographic Beauty of a Photographic Archive | p. 85 |
| Beauty in Data: Geograph | p. 86 |
| Visualization, Beauty, and Treemaps | p. 89 |
| A Geographic Perspective on Geograph Term Use | p. 91 |
| Beauty in Discovery | p. 98 |
| Reflection and Conclusion | p. 101 |
| Data Finds Data | p. 105 |
| Introduction | p. 105 |
| The Benefits of Just-in-Time Discovery | p. 106 |
| Corruption at the Roulette Wheel | p. 107 |
| Enterprise Discoverability | p. 111 |
| Federated Search Ain't All That | p. 111 |
| Directories: Priceless | p. 113 |
| Relevance: What Matters and to Whom? | p. 115 |
| Components and Special Considerations | p. 115 |
| Privacy Considerations | p. 118 |
| Conclusion | p. 118 |
| Portable Data In Real Time | p. 119 |
| Introduction | p. 119 |
| The State of the Art | p. 120 |
| Social Data Normalization | p. 128 |
| Conclusion: Mediation via Gnip | p. 131 |
| Surfacing the Deep Web | p. 133 |
| What Is the Deep Web? | p. 133 |
| Alternatives to Offering Deep-Web Access | p. 135 |
| Conclusion and Future Work | p. 147 |
| Building Radiohead's House of Cards | p. 149 |
| How It All Started | p. 149 |
| The Data Capture Equipment | p. 150 |
| The Advantages of Two Data Capture Systems | p. 154 |
| The Data | p. 154 |
| Capturing the Data, aka "The Shoot" | p. 155 |
| Processing the Data | p. 160 |
| Post-Processing the Data | p. 160 |
| Launching the Video | p. 161 |
| Conclusion | p. 164 |
| Visualizing Urban Data | p. 167 |
| Introduction | p. 167 |
| Background | p. 168 |
| Cracking the Nut | p. 169 |
| Making It Public | p. 174 |
| Revisiting | p. 178 |
| Conclusion | p. 181 |
| The design of sense.us | p. 183 |
| Visualization and Social Data Analysis | p. 184 |
| Data | p. 186 |
| Visualization | p. 188 |
| Collaboration | p. 194 |
| Voyagers and Voyeurs | p. 199 |
| Conclusion | p. 203 |
| What Data Doesn't do | p. 205 |
| When Doesn't Data Drive? | p. 208 |
| Conclusion | p. 217 |
| Natural Language Corpus Data | p. 219 |
| Word Segmentation | p. 221 |
| Secret Codes | p. 228 |
| Spelling Correction | p. 234 |
| Other Tasks | p. 239 |
| Discussion and Conclusion | p. 240 |
| Life in Data: The Story of DNA | p. 243 |
| DNA As a Data Store | p. 243 |
| DNA As a Data Source | p. 250 |
| Fighting the Data Deluge | p. 253 |
| The Future of DNA | p. 257 |
| Beautifying Data in the Real World | p. 259 |
| The Problem with Real Data | p. 259 |
| Providing the Raw Data Back to the Notebook | p. 260 |
| Validating Crowdsourced Data | p. 262 |
| Representing the Data Online | p. 263 |
| Closing the Loop: Visualizations to Suggest New Experiments | p. 271 |
| New Experiments | p. 271 |
| Building a Data Web from Open Data and Free Services | p. 274 |
| Superficial Data Analysis: Exploring Millions of Social Stereotypes | p. 279 |
| Introduction | p. 279 |
| Preprocessing the Data | p. 280 |
| Exploring the Data | p. 282 |
| Age, Attractiveness, and Gender | p. 285 |
| Looking at Tags | p. 290 |
| Which Words Are Gendered? | p. 294 |
| Clustering | p. 295 |
| Conclusion | p. 300 |
| Bay Area Blues: The Effect of the Housing Crisis | p. 303 |
| Introduction | p. 303 |
| How Did We Get the Data? | p. 304 |
| Geocoding | p. 305 |
| Data Checking | p. 305 |
| Analysis | p. 306 |
| The Influence of Inflation | p. 307 |
| The Rich Get Richer and the Poor Get Poorer | p. 308 |
| Geographic Differences | p. 311 |
| Census Information | p. 314 |
| Exploring San Francisco | p. 318 |
| Conclusion | p. 319 |
| Beautiful Political Data | p. 323 |
| Example 1: Redistricting and Partisan Bias | p. 324 |
| Example 2: Time Series of Estimates | p. 326 |
| Example 3: Age and Voting | p. 328 |
| Example 4: Public Opinion and Senate Voting on Supreme Court Nominees | p. 328 |
| Example 5: Localized Partisanship in Pennsylvania | p. 330 |
| Conclusion | p. 332 |
| Connecting Data | p. 335 |
| What Public Data Is There, Really? | p. 336 |
| The Possibilities of Connected Data | p. 337 |
| Within Companies | p. 338 |
| Impediments to Connecting Data | p. 339 |
| Possible Solutions | p. 343 |
| Conclusion | p. 348 |
| Contributors | p. 349 |
| Index | p. 357 |
| Table of Contents provided by Ingram. All Rights Reserved. |