[ml] Hadoop going forward

Vikram Oberoi voberoi at gmail.com
Sun May 23 17:43:08 UTC 2010

There's a 'Download Source' button on the top right of the page when  
you're browsing the repository.


On May 23, 2010, at 12:07 AM, Andreas von Hessling <vonhessling at gmail.com 
 > wrote:

> Vikram,
> is there a way you could easily create a download comprising all files
> in your github project, please?  It seems as it is now we'd need to
> manually download each file individually in the browser. (I've never
> used GitHub before, please ignore my ignorance)
> The setup instructions work great for me as well;  (I'm now trying to
> get a better sense of what your functions do and how I use them. )
> Thanks,
> Andy
> On Wed, May 19, 2010 at 11:38 PM, Vikram Oberoi <voberoi at gmail.com>  
> wrote:
>> Hey folks,
>> For those of you that came out tonight, I hope the code I walked  
>> through and
>> initial (albeit rough) overview of MapReduce helped. If you guys  
>> have any
>> questions or requests, the best way to ask would be to:
>> a) direct an email to me over ml at lists.noisebridge.net or...
>> b) open an issue at the Github
>> project: http://github.com/voberoi/hadoop-mrutils
>> Both of these ways someone else might be able to answer first and  
>> everyone
>> will benefit from the answer, as there's a high probability that  
>> everyone
>> will have the same questions.
>> For next week, I'm going to write a script that transforms the KDD  
>> dataset
>> in... some useful way. Your guys' input on what exactly I should do  
>> here is
>> most welcome. The transformation should be involved enough that the  
>> code can
>> serve as an example for scripts you all might implement later.
>> I'll also be taking a look at Apache Mahout (a library containing  
>> Hadoop
>> MapReduce implementations of numerous machine learning algorithms)  
>> and
>> writing up an example of how to use it. If you have a particular  
>> algorithm
>> that you want to apply to the dataset, check if it's in the Mahout  
>> library
>> and let me know.
>> Finally, is any brainstorming/discussion about what we're doing  
>> happening
>> anywhere other than the meetups? I'd be happy to meet again some  
>> time before
>> next Wednesday to hash out some ideas and run with them, as in-person
>> conversation bandwidth is *so* much higher. Alternately, we could  
>> throw out
>> ideas on the list and brainstorm over email threads. It doesn't  
>> seem like
>> there's a whole lot of action on the wiki other than links to  
>> resources and
>> TODOs. Or is there?
>> Vikram
>> _______________________________________________
>> ml mailing list
>> ml at lists.noisebridge.net
>> https://www.noisebridge.net/mailman/listinfo/ml

More information about the ml mailing list