[ml] Hi

Adam Skory a at skory.us
Mon Jul 18 22:19:49 UTC 2011


2011/7/18 Josh Myer <josh at joshisanerd.com>:
> Unsolicited kibitzing from a grumpy old practitioner:
> As far as algorithmic detection of "is tennis court?": you shouldn't have
> too hard of a time manually finding tennis courts...
> You'll want to find a bunch of
> confusing not-tennis-courts for your training set

> Always remember: 95% of ML is scut work, 5% is sexy algorithmic fun, and 50%
> is refactoring your code to account for the stupid ugly data gotchas you
> missed in the original scope.

When you go about building your dataset, setting up image labeling
tasks on services like Mechanical Turk or Crowdflower will take you
much less time than doing it yourselves.

-Skory



More information about the ml mailing list