Play with the Machine » data http://www.machinelake.com Sat, 03 Sep 2011 16:08:33 +0000 en hourly 1 Your Hedge Fund Howto http://www.machinelake.com/2011/04/04/your-hedge-fund-howto/ http://www.machinelake.com/2011/04/04/your-hedge-fund-howto/#comments Tue, 05 Apr 2011 05:06:58 +0000 gavin http://www.machinelake.com/?p=87596812 “Fresh out of college, with no background in finance, I learned high frequency trading (HFT) first hand while bootstrapping my startup from nothing to trading tens of millions of shares daily for a billion-dollar hedge fund. I’m starting this blog to discuss the technical challenges related to building world class HFT system, including developing the trading algorithms, handling the data feeds, building the high performance data structures, structuring the threading model, and designing the system for maximum reliability.”

Is it any surprise the similarity between a business focused on making money by watching money and any random business focused on making money by watching, I don’t know, tweets? The money watchers invest a lot of time & energy in building real time systems for delivering answers. The tweet watchers don’t care so much about real time currently. They’re ok with answers eventually rather than now. How much longer will this remain the case? Is there a future where real money is riding on immediate answers derived from tweets? Or from any random unit of “social media” from the fire hose? Regardless, it looks like the world of finance has a lot of the answers and WK’s High Frequency Trading Blog is a great behind the scenes look at how it could come together.

]]>
http://www.machinelake.com/2011/04/04/your-hedge-fund-howto/feed/ 0
Cleaning & transforming your data with Stanford’s Wrangler http://www.machinelake.com/2011/02/03/cleaning-transforming-your-data-with-stanfords-wrangler/ http://www.machinelake.com/2011/02/03/cleaning-transforming-your-data-with-stanfords-wrangler/#comments Thu, 03 Feb 2011 16:44:29 +0000 gavin http://www.machinelake.com/?p=87596799

Wrangler Demo Video from Stanford Visualization Group on Vimeo.

“Too much time is spent manipulating data just to get analysis and visualization tools to read it. Wrangler is designed to accelerate this process: spend less time fighting with your data and more time learning from it. Wrangler allows interactive transformation of messy, real-world data into the data tables analysis tools expect. Export data for use in Excel, R, Tableau, Protovis, …”

Seeing how this is from Stanford University, there’s a nice paper as well, “Wrangler: Interactive Visual Specification of Data Transformation Scripts”. There’s a bit of an overlap between FreeBase’s Gridworks (now Google’s Refine tool) but really, more differences than similarity. But they both work in your browser–try Wrangler right now.

]]>
http://www.machinelake.com/2011/02/03/cleaning-transforming-your-data-with-stanfords-wrangler/feed/ 0