机器人与人工智能爱好者论坛

 找回密码
 立即注册
查看: 7241|回复: 0
打印 上一主题 下一主题

Data, not algorithms, is key to machine learning success

[复制链接]

285

主题

451

帖子

1万

积分

超级版主

Rank: 8Rank: 8

积分
13755
跳转到指定楼层
楼主
发表于 2016-1-7 09:31:42 | 只看该作者 |只看大图 回帖奖励 |倒序浏览 |阅读模式
Data, not algorithms, is key to machine learning success

By boris, January 06, 2016

There has been an explosion in machine learning activity, and Shivon Zilis recently mapped out the current machine intelligence ecosystem as we enter 2016. This is one of the key areas that we’ll be following this year.

While the opportunities here are tremendous, the exuberance surrounding machine learning distracts startups from a key hurdle: it’s data, not algorithms, that will dictate who wins in this space. Algorithms have largely been commoditized by now, so a machine learning company built around publicly accessible data isn’t defensible.
Access to unique data isn’t a problem for incumbents like Google, Facebook, and Amazon. But, startups face a serious chicken and egg problem: they have to convince people to give them data, but the machine intelligence service won’t be useful until people (and a lot of people) are actually using the service and sharing their data. Matt Turck recently wrote about this “cold start” problem in a great post, “The Power of Data Network Effects.”
I’ve seen recommendation services struggle with this challenge. It takes too long to train the bots and users drop off the platform before the service gets valuable.
Does this mean that startups can’t successfully play in the machine learning space? Not necessarily. First of all, there’s plenty of data out there. Likewise, there are plenty of opportunities to leverage machine intelligence to solve a business problem or improve day to day life. The key is to build a strong enough use case that compels people to give up their data before the benefits of machine learning and data network effects really kick in.
Turck talks about the “data trap” strategy where startups build fun and free side apps to start gathering data. A great example is Clarifai, a deep learning and image recognition company. Clarifai’s free consumer app, Forevery, offers instant value for its users, by making it easier to organize and find photos. With each new Forevery user, Clarifai has a bigger data pool to refine its image recognition technology.
Mint.com is another example where users gave access to very valuable and unique data, even though Mint didn’t do much on the machine learning front in the beginning.
The first step is to develop some kind of data acquisition strategy. The next step is to effectively communicate the value proposition. The focus has to be on tangible benefits for the end users, rather than the coolness of the technology platform. In Zilis’ summary of today’s machine learning ecosystem, she noted that “many machine intelligence companies have figured out that they need to speak the language of solving a business problem.” That’s a great sign.
The bottom line is that if startups want to succeed in machine learning, their top priority should be building proprietary data sets. Creating a strong use case is the most effective strategy to get that data set. It’s not easy, but the good news is that it’s incredibly hard to unseat a service once data network effects kick in.


我是笨鸟,我先飞!
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

关闭

站长推荐上一条 /1 下一条

QQ|Archiver|手机版|小黑屋|陕ICP备15012670号-1    

GMT+8, 2024-5-18 22:18 , Processed in 0.106984 second(s), 28 queries .

Powered by Discuz! X3.2

© 2001-2013 Comsenz Inc.

快速回复 返回顶部 返回列表