[ml] KDD cup submission status
mike at mindmech.com
Fri Jun 4 02:48:58 UTC 2010
Sounds like you're making great progress! I'll be working on the
graph clustering algorithm for the skill set tonight and will keep
you posted on how things are going.
On Thu, Jun 3, 2010 at 6:17 PM, Andreas von Hessling
<vonhessling at gmail.com>wrote:
> Doing a few basic tricks, I catapulted the submission into the 50th
> percentile. That is not even running any ML algorithm.
> I'm planning on running the NaiveBayesUpdateable classifier
> (http://weka.wikispaces.com/Classifying+large+datasets) over
> discretized IQ/IQ strength/Chance/Chance strength from the command
> line to evaluate performance. Another attempt would be to load all
> data into memory (<3GB, even for full Bridge Train) and run SVMlib
> over it.
> If someone wants to try MOA
> this would be
> helpful also in the long run (at least a tutorial how to set it up and
> The reduced datasets plus the IQ values are linked on the wiki: Features
> ...> row INT,
> ...> studentid VARCHAR(30),
> ...> problemhierarchy TEXT,
> ...> problemname TEXT,
> ...> problemview INT,
> ...> problemstepname TEXT,
> ...> cfa INT,
> ...> iq REAL
> IQ strength (number of attempts per student) should be available soon.
> (perhaps add'l features will become available as well)
> I'm still hoping somebody could cluster Erin's normalized skills data
> and provide a row -> cluster id mapping for algebra and bridge train
> and test sets (I don't have the data any more).
> ml mailing list
> ml at lists.noisebridge.net
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the ml