Enlaces de Tratamiento de Datos y Visualización

How To Get Experience Working With Large Datasets: The trouble with most sources of data is that they are owned and the data is copyrighted or proprietary. You can scrape websites and, if you fly under the radar, you will get a dataset. Although, if you want a large dataset, then it will take a lot of scraping. Instead, you should look for data that you can acquire more efficiently and, hopefully, legally. If nothing else, collecting the data in a legal manner will help you sleep better at night and you have a chance at going on to use that data to build something useful.

Google Visualization API Data Sources and Tools Gallery. Example, Gauges: This gallery lists data source implementations and development tools built for the Google Visualization API. Some of these have been written by Google, and some have been written by third parties. Links below point to instructions for and code of each library or tool. (Links to third-party tools will take you off the Google site). To learn how to implement a data source, read Implementing a Data Source.


Mapping Excel Data with Google Mapping Tools: Google provides two mapping tools that let you navigate from a global view of the earth down to country, state, city, neighborhood, or individuals house levels. The table below shows comparable views prepared with Google Earth and Google Maps.

Google Refine (formerly Freebase Gridworks) is a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase.

Data visualizations for a changing world: The Google Public Data Explorer makes large datasets easy to explore, visualize and communicate. As the charts and maps animate over time, the changes in the world become easier to understand. You don't have to be a data expert to navigate between different views, make your own comparisons, and share your findings.

Books Ngram Viewer es la herramienta de Google Labs que nos permite visualizar la aparición de uno o varios términos en los libros catalogados por Google Books.

Kagle, a platform for data prediction competitions that allows organizations to post their data and have it scrutinized by the world's best data scientists.

Information is Beutiful: Visualizing Bloodtests: Our goal wasn’t just a polish job. We worked hard on the information too. So there was context for all the facts and figures. Ideally, anyone, of any educational background, could get the gist and plan their next move. 

Dremel – Google's tool for analyzing trillion-row tables in seconds [pdf] 

Mining of Massive Datasets

Free Icon Set for Web Developers: Coded

Ban Comic Sans