<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Resys China</title>
	<atom:link href="http://www.resyschina.com/feed" rel="self" type="application/rss+xml" />
	<link>http://www.resyschina.com</link>
	<description>all about recommender systems</description>
	<lastBuildDate>Sat, 19 May 2012 13:14:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>尝试pagerank做围脖局部影响力计算</title>
		<link>http://www.resyschina.com/2012/05/%e5%b0%9d%e8%af%95pagerank%e5%81%9a%e5%9b%b4%e8%84%96%e5%b1%80%e9%83%a8%e5%bd%b1%e5%93%8d%e5%8a%9b%e8%ae%a1%e7%ae%97.html</link>
		<comments>http://www.resyschina.com/2012/05/%e5%b0%9d%e8%af%95pagerank%e5%81%9a%e5%9b%b4%e8%84%96%e5%b1%80%e9%83%a8%e5%bd%b1%e5%93%8d%e5%8a%9b%e8%ae%a1%e7%ae%97.html#comments</comments>
		<pubDate>Sat, 19 May 2012 13:04:54 +0000</pubDate>
		<dc:creator>mcgrady164</dc:creator>
				<category><![CDATA[算法学术]]></category>
		<category><![CDATA[pagerank]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=665</guid>
		<description><![CDATA[最近用pagerank的思路对围脖局部影响力做了个简单的计算，最后的效果一般。 思路如下： （1）以若干数量的优质用户为种子用户开始算法。 （2）获取种子用户的关注列表和转发的围脖内容。 （3）利用得到的data构建图，如果用户A关注B，则添加一条边A-&#62;B； 如果用户A转发B的围脖，同样添加一条边A-&#62;B； 如果用户A既关注B，也转发了B的围脖，权重会变化； 因为一个用户很有可能转发同一个用户的多条围脖，所以权重和转发的次数也有关系； 由于关注和转发这两种行为对于影响力的影响也是不同的，所以两种方式的权重也是需要好好想想的。 （4）通过每个种子用户的关注列表以及由转发关系可以发现新的用户，添加这些新的并且不同于已有的用户进入种子用户列表中，重复（2）和（3）的工作。 （5）结束条件： 1）种子用户列表中的用户数量超过一定数量。 2）发现新用户的速度下降到某个阈值。 这样我们可以构建一个为原始优质种子用户中心的社交网络图谱，大多数的转发行为都以他们为中心。 然后利用pagerank的思路，对这个graph进行计算，得到每个用户的pagerank值，从而进行影响力排序。 这里有几个比较重要的地方： （1）数据量要有一定规模，可以保证这个社交网络的完整性。不然的话，在做pagerank计算的时候会发现除了一些优质的种子用户所在的行有足够的数据可以计算，其他绝大多数的行都是0. （2）关于转发和关注的权值分配对于结果的影响比较大。由于围脖是一个靠转发来传播信息的社交网络，所以转发占有绝对的权重，并且会根据转发次数的不同来修改这些权重。这个权值需要用一些别的更好的方法来确定，如果是拍脑袋去定的话效果会不好。 （3）关于计算。因为pagerank要对矩阵做运算，所以数据量的规模越大，对计算资源的要求也就越高。这也是我遇到最头疼的一个问题。（因为我是用个人电脑做计算的）关于这个问题，建议可以学习parallel pagerank。 （4）关于实现。因为在这个过程中需要大量的用到围脖的api，会导致获取数据的时候遇到一些麻烦。比如你长时间访问api，超过了时间限制，就会被forbidden，这个时候需要换appkey，甚至换个user，严重的话得换ip。所以在调围脖api的时候，要注意控制好速度，并且同时做好一些异常的捕获和预防措施。否则收集到的信息就可能会缺东西。 个人觉得这个思路还是比较靠谱的，但有些地方做的不够好，所以用了少量的data做了个结果不是很尽如人意。 希望看到这篇blog的高手前来拍砖和支招儿。 另：不少童鞋询问爬围脖数据的方法。 大概的思路有两种： （1）模拟登录，然后调用围脖的api。这样的好处是，可以获得结构化的json数据。不好的地方在于，要时刻提防挨批速率限制，用户限制，ip限制等等问题。我采用的就是这种。 （2）模拟登录，然后用ajax爬网页数据。这种方式我没有用过，不过网上有介绍的文章。（去看看） 最后附一模拟登录的python源码（点我下载）<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="围脖局部影响力计算example" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/04/21/21668404.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">围脖局部影响力计算example</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="社交媒体流量：Facebook给力？Stumbleupon更给力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fstumbleupon-facebook.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1815225.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">社交媒体流量：Facebook给力？Stumbleupon更给力</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐解释可以增加用户的信任" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Freason-increase-user-trust.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802352.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐解释可以增加用户的信任</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="机器学习在互联网应用面临的 10 大挑战" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/05/16/26444553.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">机器学习在互联网应用面临的 10 大挑战</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="简网发布“指阅”，打造中国版Zite" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/10/26/9859639.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">简网发布“指阅”，打造中国版Zite</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="谷歌或将网页加载速度纳入PageRank影响因素" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F11630.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">谷歌或将网页加载速度纳入PageRank影响因素 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="云计算的架构-从技术角度剖析云计算" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F22003.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/08/5261491.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">云计算的架构-从技术角度剖析云计算 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="从《三鹿奶粉事件》看企业价值与网络口碑营销影响力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F1188.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2010/12/30/1662986.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">从《三鹿奶粉事件》看企业价值与网络口碑营销影响力 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="云里雾里云计算：云计算系列文章" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F18770.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2010/12/29/1636502.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">云里雾里云计算：云计算系列文章 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="真正的框计算" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F9298.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/07/1841763.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">真正的框计算 (@alibuybuy)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<div>
<div>
<p>最近用pagerank的思路对围脖局部影响力做了个简单的计算，最后的效果一般。</p>
<p>思路如下：</p>
<p>（1）以若干数量的优质用户为种子用户开始算法。</p>
<p>（2）获取种子用户的关注列表和转发的围脖内容。</p>
<p>（3）利用得到的data构建图，如果用户A关注B，则添加一条边A-&gt;B；</p>
<p>如果用户A转发B的围脖，同样添加一条边A-&gt;B；</p>
<p>如果用户A既关注B，也转发了B的围脖，权重会变化；</p>
<p>因为一个用户很有可能转发同一个用户的多条围脖，所以权重和转发的次数也有关系；</p>
<p>由于关注和转发这两种行为对于影响力的影响也是不同的，所以两种方式的权重也是需要好好想想的。</p>
<p>（4）通过每个种子用户的关注列表以及由转发关系可以发现新的用户，添加这些新的并且不同于已有的用户进入种子用户列表中，重复（2）和（3）的工作。</p>
<p>（5）结束条件：<br />
1）种子用户列表中的用户数量超过一定数量。</p>
<p>2）发现新用户的速度下降到某个阈值。</p>
<div>这样我们可以构建一个为原始优质种子用户中心的社交网络图谱，大多数的转发行为都以他们为中心。</div>
<div>然后利用pagerank的思路，对这个graph进行计算，得到每个用户的pagerank值，从而进行影响力排序。</div>
<div>这里有几个比较重要的地方：</div>
<div>（1）数据量要有一定规模，可以保证这个社交网络的完整性。不然的话，在做pagerank计算的时候会发现除了一些优质的种子用户所在的行有足够的数据可以计算，其他绝大多数的行都是0.</div>
<div>（2）关于转发和关注的权值分配对于结果的影响比较大。由于围脖是一个靠转发来传播信息的社交网络，所以转发占有绝对的权重，并且会根据转发次数的不同来修改这些权重。这个权值需要用一些别的更好的方法来确定，如果是拍脑袋去定的话效果会不好。</div>
<p>（3）关于计算。因为pagerank要对矩阵做运算，所以数据量的规模越大，对计算资源的要求也就越高。这也是我遇到最头疼的一个问题。（因为我是用个人电脑做计算的）关于这个问题，建议可以学习parallel pagerank。</p>
<p>（4）关于实现。因为在这个过程中需要大量的用到围脖的api，会导致获取数据的时候遇到一些麻烦。比如你长时间访问api，超过了时间限制，就会被forbidden，这个时候需要换appkey，甚至换个user，严重的话得换ip。所以在调围脖api的时候，要注意控制好速度，并且同时做好一些异常的捕获和预防措施。否则收集到的信息就可能会缺东西。</p>
<div>
<div>个人觉得这个思路还是比较靠谱的，但有些地方做的不够好，所以用了少量的data做了个结果不是很尽如人意。</div>
<div>希望看到这篇blog的高手前来拍砖和支招儿。</div>
<div></div>
<div>另：不少童鞋询问爬围脖数据的方法。</div>
<div>大概的思路有两种：</div>
<div>（1）模拟登录，然后调用围脖的api。这样的好处是，可以获得结构化的json数据。不好的地方在于，要时刻提防挨批速率限制，用户限制，ip限制等等问题。我采用的就是这种。</div>
<div>（2）模拟登录，然后用ajax爬网页数据。这种方式我没有用过，不过网上有介绍的文章。（<a href="http://hi.baidu.com/cwyalpha/blog/item/0ef657dacd8d87c4b6fd48fc.html" target="_blank">去看看</a>）</div>
<div>最后附一模拟登录的python源码（<a href="http://vdisk.weibo.com/s/5CGDW/1337432319" target="_blank">点我下载</a>）</div>
</div>
</div>
</div>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="围脖局部影响力计算example" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/04/21/21668404.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">围脖局部影响力计算example</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="社交媒体流量：Facebook给力？Stumbleupon更给力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fstumbleupon-facebook.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1815225.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">社交媒体流量：Facebook给力？Stumbleupon更给力</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="PayPal创始人：兴趣图谱模式将造就下一个互联网巨头" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Finterest-graph.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/16/23099235.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">PayPal创始人：兴趣图谱模式将造就下一个互联网巨头</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="为什么关于社会化推荐系统的创业想法接连失败？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fwhy-does-the-startup-idea-of-social-recommendations-consistently-fail.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802213.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">为什么关于社会化推荐系统的创业想法接连失败？</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="StumbleUpon + Hulu = ？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fstumbleupon-hulu.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802373.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">StumbleUpon + Hulu = ？</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="谷歌或将网页加载速度纳入PageRank影响因素" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F11630.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">谷歌或将网页加载速度纳入PageRank影响因素 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="云计算的架构-从技术角度剖析云计算" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F22003.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/08/5261491.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">云计算的架构-从技术角度剖析云计算 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="从《三鹿奶粉事件》看企业价值与网络口碑营销影响力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F1188.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2010/12/30/1662986.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">从《三鹿奶粉事件》看企业价值与网络口碑营销影响力 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="云里雾里云计算：云计算系列文章" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F18770.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2010/12/29/1636502.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">云里雾里云计算：云计算系列文章 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="真正的框计算" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F9298.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/07/1841763.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">真正的框计算 (@alibuybuy)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/05/%e5%b0%9d%e8%af%95pagerank%e5%81%9a%e5%9b%b4%e8%84%96%e5%b1%80%e9%83%a8%e5%bd%b1%e5%93%8d%e5%8a%9b%e8%ae%a1%e7%ae%97.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>机器学习在互联网应用面临的 10 大挑战</title>
		<link>http://www.resyschina.com/2012/05/machine_learning_10_challenges.html</link>
		<comments>http://www.resyschina.com/2012/05/machine_learning_10_challenges.html#comments</comments>
		<pubDate>Tue, 15 May 2012 16:58:01 +0000</pubDate>
		<dc:creator>clickstone</dc:creator>
				<category><![CDATA[算法学术]]></category>
		<category><![CDATA[精华]]></category>
		<category><![CDATA[algorithm]]></category>
		<category><![CDATA[machine learning]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=662</guid>
		<description><![CDATA[原内容出自：@张栋_机器学习 http://weibo.com/1686830902/xAkL3EV4p 1： “数据稀释性”：训练一个模型，需要大量（标注）数据，但是数据往往比较稀疏。比如，我们想训练一个模型表征某人 “购物兴趣”，但是这个人在网站上浏览行为很少，购物历史很少，很难训练出一个 &#8220;meaningful model&#8221; 来预测应该给这个人推荐什么商品等 &#8230; 2：“不平稳随机过程产生的数据”：机器学习模型往往假设数据的产生是 “平稳随机过程”。但是有些互联网数据（比如 spam 邮件，spam 网站等）的产生是动态的，不平稳随机过程 … 3～4：高数量和高质量 “标定数据” 是机器学习效果的保障，但是获取标定数据需要耗费大量人力和财力。而且，人会出错，人有主观性。如何获取高数量和高质量标定数据，或者用机器学习方法只标定 “关键” 数据 (active learning) 值得深入研究 … 5：&#8221;Scalability&#8221; 是互联网的核心问题之一。搜索引擎索引的重要网页超过 100 亿: 如果1台机器每秒处理1000 网页，需要至少100天。所以出现了 MapReduce, MPI, Spark, Pegasus, Pregel, Hama &#8230; 等分布式计算构架。选择什么样的计算平台，和算法设计紧密相关 … 6：“速度” 是互联网核心的用户体验。线下模型训练可以花费很长时间：比如，Google 某个模型更新一次需要几千台机器，大约训练半年时间。但是，线上使用模型的时候 要求一定要 “快，实时 (real-time)” … 7： 互联网 每时每刻 都在产生大量新数据，要求模型随之不停更新，所以 &#8220;online learning&#8221; 是机器学习的一个重要研究方向。人也是一样: Life is an [...]<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Hunch推荐系统——能帮你更好地挑选礼物吗？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fhunch-algorithm.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802348.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Hunch推荐系统——能帮你更好地挑选礼物吗？</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="算法学术 | Resys China" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2Fcategory%2Falgorithm&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">算法学术 | Resys China</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="YouTube转向Amazon的推荐算法" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Fyoutube_uses_amazon_algorithm.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">YouTube转向Amazon的推荐算法</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="PayPal创始人：兴趣图谱模式将造就下一个互联网巨头" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Finterest-graph.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/16/23099235.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">PayPal创始人：兴趣图谱模式将造就下一个互联网巨头</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="StumbleUpon新功能：从内容推荐到应用推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fstumbleupon-app-recsys.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/28/26050677.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">StumbleUpon新功能：从内容推荐到应用推荐</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Social Media Algorithm: Hacker News" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2009%2Fsocial_media_algorithm_hackernews.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1801986.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Social Media Algorithm: Hacker News (@guwendong)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Social Media Algorithm: StumbleUpon" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2009%2Fsocial_media_algorithm_stumbleupon.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/20/23627877.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Social Media Algorithm: StumbleUpon (@guwendong)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="互联网应用周刊：卡小包、狠狠推、爱美图、你拍我猜等" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F73159.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/05/14/26253353.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">互联网应用周刊：卡小包、狠狠推、爱美图、你拍我猜等 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="互联网应用周刊：畅无线、黄金时代、爱班网、板报等" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F73018.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/05/06/25014432.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">互联网应用周刊：畅无线、黄金时代、爱班网、板报等 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐系统：关联规则(3) —— FP-Growth 算法" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2008%2Ffpgrowth_algorithm.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802009.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐系统：关联规则(3) —— FP-Growth 算法 (@guwendong)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p>原内容出自：@张栋_机器学习 <a id="internal-source-marker_0.9130577763102338" href="http://weibo.com/1686830902/xAkL3EV4p">http://weibo.com/1686830902/xAkL3EV4p</a></p>
<p><a href="http://weibo.com/1686830902/xAkL3EV4p"><img class="alignnone" title="机器学习在互联网应用面临的 10 大挑战" src="http://ww2.sinaimg.cn/large/648aff36gw1dnpma0avpmj.jpg" alt="" width="852" height="616" /></a></p>
<p>1： “数据稀释性”：训练一个模型，需要大量（标注）数据，但是数据往往比较稀疏。比如，我们想训练一个模型表征某人  “购物兴趣”，但是这个人在网站上浏览行为很少，购物历史很少，很难训练出一个 &#8220;meaningful model&#8221;  来预测应该给这个人推荐什么商品等 &#8230;</p>
<p>2：“不平稳随机过程产生的数据”：机器学习模型往往假设数据的产生是 “平稳随机过程”。但是有些互联网数据（比如 spam 邮件，spam 网站等）的产生是动态的，不平稳随机过程 …</p>
<p>3～4：高数量和高质量 “标定数据” 是机器学习效果的保障，但是获取标定数据需要耗费大量人力和财力。而且，人会出错，人有主观性。如何获取高数量和高质量标定数据，或者用机器学习方法只标定 “关键” 数据 (active learning) 值得深入研究 …</p>
<p>5：&#8221;Scalability&#8221;  是互联网的核心问题之一。搜索引擎索引的重要网页超过 100 亿: 如果1台机器每秒处理1000 网页，需要至少100天。所以出现了  MapReduce, MPI, Spark, Pegasus, Pregel, Hama &#8230;  等分布式计算构架。选择什么样的计算平台，和算法设计紧密相关 …</p>
<p>6：“速度” 是互联网核心的用户体验。线下模型训练可以花费很长时间：比如，Google 某个模型更新一次需要几千台机器，大约训练半年时间。但是，线上使用模型的时候 要求一定要 “快，实时 (real-time)” …</p>
<p>7： 互联网 每时每刻 都在产生大量新数据，要求模型随之不停更新，所以 &#8220;online learning&#8221; 是机器学习的一个重要研究方向。人也是一样:  Life is an online learning process &#8230; &#8220;online learning&#8221; 也是人最重要的能力之一</p>
<p>8：“Cold- Start” (冷启动)  是互联网应用的典型问题：一个好互联网产品，用的人多，得到的数据多；得到的数据越多，模型训练的越好，产品会变得更好用，用的人就会更多 &#8230; 进入  “良性循环”。对于一个新产品，在初期，要面临数据不足的 “冷启动” 问题 &#8230;</p>
<p>9： 机器学习之美在于对于不同的问题需要不同建模方法。我去给讲座的时候，经常听有人说：“SVM 做过，Naive Bayes 做过 &#8230;  但效果不好”；做过很简单，但能否 “做到极致”? 这个世界你能想到别人想不到的事情概率极低，只有一件你花十倍努力做到极致的事情！</p>
<p>10：“Human  + Machine” 机器学习应用在一个特定领域，需要特定领域的专家知识：机器学习应用在 “医学”，需要一个经验丰富的医生；应用在  “股票”，需要一个有10年以上经验的操盘手；应用在 “互联网”，需要一个上网时间超过 1万小时的 PM &#8230;</p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Hunch推荐系统——能帮你更好地挑选礼物吗？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fhunch-algorithm.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802348.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Hunch推荐系统——能帮你更好地挑选礼物吗？</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="算法学术 | Resys China" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2Fcategory%2Falgorithm&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">算法学术 | Resys China</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="YouTube转向Amazon的推荐算法" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Fyoutube_uses_amazon_algorithm.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">YouTube转向Amazon的推荐算法</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="PayPal创始人：兴趣图谱模式将造就下一个互联网巨头" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Finterest-graph.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/16/23099235.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">PayPal创始人：兴趣图谱模式将造就下一个互联网巨头</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="StumbleUpon新功能：从内容推荐到应用推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fstumbleupon-app-recsys.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/28/26050677.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">StumbleUpon新功能：从内容推荐到应用推荐</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Social Media Algorithm: Hacker News" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2009%2Fsocial_media_algorithm_hackernews.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1801986.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Social Media Algorithm: Hacker News (@guwendong)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Social Media Algorithm: StumbleUpon" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2009%2Fsocial_media_algorithm_stumbleupon.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/20/23627877.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Social Media Algorithm: StumbleUpon (@guwendong)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="互联网应用周刊：卡小包、狠狠推、爱美图、你拍我猜等" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F73159.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/05/14/26253353.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">互联网应用周刊：卡小包、狠狠推、爱美图、你拍我猜等 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="互联网应用周刊：畅无线、黄金时代、爱班网、板报等" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F73018.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/05/06/25014432.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">互联网应用周刊：畅无线、黄金时代、爱班网、板报等 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐系统：关联规则(3) —— FP-Growth 算法" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2008%2Ffpgrowth_algorithm.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fmachine_learning_10_challenges.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802009.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐系统：关联规则(3) —— FP-Growth 算法 (@guwendong)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/05/machine_learning_10_challenges.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>围脖局部影响力计算example</title>
		<link>http://www.resyschina.com/2012/05/weibo_spear.html</link>
		<comments>http://www.resyschina.com/2012/05/weibo_spear.html#comments</comments>
		<pubDate>Fri, 04 May 2012 08:43:54 +0000</pubDate>
		<dc:creator>mcgrady164</dc:creator>
				<category><![CDATA[算法学术]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=658</guid>
		<description><![CDATA[最近看了一篇SIGIR 09年的paper，介绍的是如何区分一个用户是expert还是spammer，其实也就是对用户的expertise进行排名。 文中讲到的算法spear，是基于HITS（Hypertext Induced Topic Search）算法改进的，是SPamming-resistant Expertise Analysis and Ranking的缩写。文中的实验数据是用delicious的数据。算法中做了两个基本的假设是 （1）Mutual reinforcement of user expertise and document quality 专家级的用户更加倾向于产生高质量的document，而高质量的document常常由高水平的用户tag （2）Discoverers v.s. followers 专家级的用户往往是一个“发现者”，也就是说他们更加倾向于第一个bookmark和tag高质量的document，然后将这些好的document带入到用户社区中。这个好比高水平的学者。 spear算法将会给早期发现新信息的discovers更多的credit，具体多少credit将会由一个credit function 计算得到。 这幅图介绍了spear算法的具体实现，首先是根据数据集生成一个带权重邻接矩阵和两个分数向量，邻接矩阵中的元素a(i,j)表示第i个用户在第j个document上的credit score。两个向量分别表示用户的expertise score 和 document的quality score。这里score function 是一个可以自己定义的函数，要保证discovers的分数要高于后面followers的分数，但分数的差距又不能差太多。（具体的参数可以参考后面的文献） 然后通过几百次的迭代计算，就可以得到一个收敛的E和Q，也就对user expertise进行了排名。 由于这篇paper是为了证明他们的算法可行，所以不但用了从delicious.com爬下来的真实数据，并且根据user behavior做了六类仿真用户混在真实数据中，以检验算法的效果。 用在我们这里，计算围脖局部影响力的时候，就不必设置仿真用户了。只需要将同一个圈子里的若干users的围脖信息考虑进来做计算就可以了。在选择种子用户的时候，参考了@clickstone的这个结果（http://weibo.com/1641544424/eB2mq72b8ai）。从这个user列表中爬他们关注的user形成整个user圈子，同时爬下他们发布的围脖信息，（围脖的id和timestamp，这里如果是用户原创的围脖则记录该围脖id和该条围脖发布的timestamp；如果这条围脖是用户转发的，则记录被转发的这条原始围脖的id以及该用户转发这条围脖时的timestamp）这样的话，我拿到的数据格式就是（timestamp,userid,weiboid），这里围脖对应着paper中的webpage。然后用spear算法计算得到了下面的结果： http://expertise.sinaapp.com/show.php 这里是top50的排名。大家可以对比一下一年前@clickstone的计算结果 http://weibo.com/1641544424/eB2mq72b8ai 参考文献（点我下载） 1、Telling Experts from Spammers:Expertise Ranking in Folksonomies 2、SPEAR:Spamming-Resistant Expertise Analysis and Ranking [...]<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="尝试pagerank做围脖局部影响力计算" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">尝试pagerank做围脖局部影响力计算</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="社交媒体流量：Facebook给力？Stumbleupon更给力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fstumbleupon-facebook.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1815225.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">社交媒体流量：Facebook给力？Stumbleupon更给力</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐解释可以增加用户的信任" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Freason-increase-user-trust.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802352.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐解释可以增加用户的信任</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="转载：探析数字音乐个性化推荐系统" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fpersonal-music-resys.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802354.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">转载：探析数字音乐个性化推荐系统</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p>最近看了一篇SIGIR 09年的paper，介绍的是如何区分一个用户是expert还是spammer，其实也就是对用户的expertise进行排名。</p>
<p>文中讲到的算法spear，是基于HITS（Hypertext Induced Topic Search）算法改进的，是SPamming-resistant Expertise Analysis and Ranking的缩写。文中的实验数据是用delicious的数据。算法中做了两个基本的假设是</p>
<p>（1）Mutual reinforcement of user expertise and document quality</p>
<p>专家级的用户更加倾向于产生高质量的document，而高质量的document常常由高水平的用户tag</p>
<p>（2）Discoverers v.s. followers</p>
<p>专家级的用户往往是一个“发现者”，也就是说他们更加倾向于第一个bookmark和tag高质量的document，然后将这些好的document带入到用户社区中。这个好比高水平的学者。</p>
<p><img class="aligncenter" src="http://www.michael-noll.com/blog/uploads/SPEAR-algorithm-discoverers-followers-600x258.png" alt="" width="600" height="258" />spear算法将会给早期发现新信息的discovers更多的credit，具体多少credit将会由一个credit function 计算得到。</p>
<p><img class="aligncenter" src="http://www.michael-noll.com/blog/uploads/SPEAR-algorithm-computation-600x386.png" alt="" width="600" height="386" /></p>
<p>这幅图介绍了spear算法的具体实现，首先是根据数据集生成一个带权重邻接矩阵和两个分数向量，邻接矩阵中的元素a(i,j)表示第i个用户在第j个document上的credit score。两个向量分别表示用户的expertise score 和 document的quality score。这里score function 是一个可以自己定义的函数，要保证discovers的分数要高于后面followers的分数，但分数的差距又不能差太多。（具体的参数可以参考后面的文献）</p>
<p>然后通过几百次的迭代计算，就可以得到一个收敛的E和Q，也就对user expertise进行了排名。</p>
<p>由于这篇paper是为了证明他们的算法可行，所以不但用了从delicious.com爬下来的真实数据，并且根据user behavior做了六类仿真用户混在真实数据中，以检验算法的效果。</p>
<p>用在我们这里，计算围脖局部影响力的时候，就不必设置仿真用户了。只需要将同一个圈子里的若干users的围脖信息考虑进来做计算就可以了。在选择种子用户的时候，参考了@clickstone的这个结果（<a href="http://weibo.com/1641544424/eB2mq72b8ai">http://weibo.com/1641544424/eB2mq72b8ai</a>）。从这个user列表中爬他们关注的user形成整个user圈子，同时爬下他们发布的围脖信息，（围脖的id和timestamp，这里如果是用户原创的围脖则记录该围脖id和该条围脖发布的timestamp；如果这条围脖是用户转发的，则记录被转发的这条原始围脖的id以及该用户转发这条围脖时的timestamp）这样的话，我拿到的数据格式就是（timestamp,userid,weiboid），这里围脖对应着paper中的webpage。然后用spear算法计算得到了下面的结果：</p>
<p><a href="http://expertise.sinaapp.com/show.php">http://expertise.sinaapp.com/show.php</a></p>
<p>这里是top50的排名。大家可以对比一下一年前@clickstone的计算结果</p>
<p><a href="http://weibo.com/1641544424/eB2mq72b8ai">http://weibo.com/1641544424/eB2mq72b8ai</a></p>
<p>参考文献（<a href="http://vdisk.weibo.com/s/4psnN/1335001090" target="_blank">点我下载</a>）</p>
<p>1、Telling Experts from Spammers:Expertise Ranking in Folksonomies</p>
<p>2、SPEAR:Spamming-Resistant Expertise Analysis and Ranking in Collaborative Tagging Systems</p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="尝试pagerank做围脖局部影响力计算" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%25B0%259D%25E8%25AF%2595pagerank%25E5%2581%259A%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">尝试pagerank做围脖局部影响力计算</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="社交媒体流量：Facebook给力？Stumbleupon更给力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fstumbleupon-facebook.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1815225.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">社交媒体流量：Facebook给力？Stumbleupon更给力</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="提问的智慧：利用决策树进行推荐系统新用户引导" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F03%2Fusing_decision_tree_for_bootstrapping.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/12/5492733.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">提问的智慧：利用决策树进行推荐系统新用户引导</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="StumbleUpon新功能：从内容推荐到应用推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fstumbleupon-app-recsys.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/08/28/26050677.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">StumbleUpon新功能：从内容推荐到应用推荐</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Solving the Crisis of Choice Online-解决在线选择的危机" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fsolving-the-crisis-of-choice-online.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802248.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Solving the Crisis of Choice Online-解决在线选择的危机</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/05/weibo_spear.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Early Amazon: Inventory Cache</title>
		<link>http://www.resyschina.com/2012/03/early-amazon-inventory-cache.html</link>
		<comments>http://www.resyschina.com/2012/03/early-amazon-inventory-cache.html#comments</comments>
		<pubDate>Thu, 29 Mar 2012 02:49:31 +0000</pubDate>
		<dc:creator>raully7</dc:creator>
				<category><![CDATA[亚马逊]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=634</guid>
		<description><![CDATA[原文链接：http://glinden.blogspot.com/2006/01/early-amazon-inventory-cache.html 原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。 著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。 其 Blog – Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。 类似谷歌的20%时间，亚马逊的员工有时也会做一些不如日常事务那么紧要的额外工作。 我当时负责了好几个项目，但我想跳出来，以掌握更多部分的代码。只是读代码过于无聊了，我需要做一些特别的事情，让我更深的理解他们。 所以空闲时候，我开始试着做性能优化。主要关注于那些高流量的页面——首页、书籍详情页、搜索结果页。我问自己，obidos（注1：亚马逊的页面渲染引擎）把时间耗在什么鬼地方了？ 很快我发现了些有趣的东西。第一个上手改造的是购物车。 当你走进一个超市，你做的第一件事很可能就是推一个购物车。同样当人们访问亚马逊时，系统做的第一件事情也是给他们分配一个购物车——在数据库里开辟部分空间以存储购买的商品。 然而，超市并不需要应付大量机器访问或多窗口购物的情况；如果需要，他们也得有更多的购物车，而且几乎每个都是空的。 考虑这些并不实际购买的情况，延后到第一次购买物品时再分配购物车就变得很有意义了。把所有购物车的花销加起来，这个小改动带来的帮助比想像的要大的多。 另一个更大的问题是实时库存检查。当你查找一本书时，亚马逊检查仓库货架内是否有存货。如果没找到，他会再查找过多久才能订购这本书。所有这些都是实时完成的。 这是书籍详情页中最耗时的操作。库存检查是个丑陋的业务逻辑。 但我们真的需要实时的信息吗？也许用几分钟前的结果就够了。恩，是的，可以缓存数据，稍有延迟并不是什么大问题。 因为是用业余时间做这件事情，我从一些并不太大的改动开始。我认为，有多大压力是由网站业务决定的，我要做的是把加锁减到最少。同时我认为能通过缓存预加载，使得用户在缓存刷新时也没有任何延迟。 我鼓捣出的东西似乎运作良好。在测试中，访问延迟从一个很大值降低到了接近零。我开始同其他人讨论这个原型，听取他们的改进意见。 正巧，他们正在对亚马逊进行重新设计，包括彻底的改造和增加新功能。有人找到我希望能在搜索结果页也显示书的库存情况，然而没有缓存这是完全不可能实现的。除非能很快把我的原型实现并弄上线。我们也确实这么做了。 当然，这些如今看来是过时的。当我实现这个库存缓存时，它被设计为一个西雅图的小仓库工作，运行在单台大铁壳（honkin&#8217; iron）服务器上。如今海量的货物散步在若干个巨大的配送中心中——其中一些甚至大的能装下13个足球场——通过切换到一个商品服务器集群能在几秒内获得结果，这最终使得旧缓存不再适用。他运转良好到超出了他的时代，长到年轻时的功绩躺在自己的衰老下被遗忘。（It lasted well beyond its time, so long that the heroics of its youth lay forgotten under the problems of its senility.） [...]<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Gravity“兴趣图”能否实现互联网个性化？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fgravity-interest-graph.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802345.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Gravity“兴趣图”能否实现互联网个性化？</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="转载：少数人的智慧（The wisdom of the Few）" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fthe-wisdom-of-the-few.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/03/10/3351344.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">转载：少数人的智慧（The wisdom of the Few）</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="围脖局部影响力计算example" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2F%25E5%259B%25B4%25E8%2584%2596%25E5%25B1%2580%25E9%2583%25A8%25E5%25BD%25B1%25E5%2593%258D%25E5%258A%259B%25E8%25AE%25A1%25E7%25AE%2597example.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/04/21/21668404.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">围脖局部影响力计算example</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="也吐下新浪微博“你可能感兴趣的人”改版的槽" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2F%25E4%25B9%259F%25E5%2590%2590%25E4%25B8%258B%25E6%2596%25B0%25E6%25B5%25AA%25E5%25BE%25AE%25E5%258D%259A%25E2%2580%259C%25E4%25BD%25A0%25E5%258F%25AF%25E8%2583%25BD%25E6%2584%259F%25E5%2585%25B4%25E8%25B6%25A3%25E7%259A%2584%25E4%25BA%25BA%25E2%2580%259D%25E6%2594%25B9%25E7%2589%2588%25E7%259A%2584%25E6%25A7%25BD.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802364.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">也吐下新浪微博“你可能感兴趣的人”改版的槽</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Reculike : 开源论文推荐系统" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2Freculike.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/03/27412342.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Reculike : 开源论文推荐系统</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p><span>原文链接：<a href="http://glinden.blogspot.com/2006/01/early-amazon-inventory-cache.html">http://glinden.blogspot.com/2006/01/early-amazon-inventory-cache.html</a></span></p>
<blockquote><p>原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。<br />
著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。<br />
其 Blog – Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。</p></blockquote>
<p><span>类似谷歌的20%时间，亚马逊的员工有时也会做一些不如日常事务那么紧要的额外工作。</span></p>
<p><span>我当时负责了好几个项目，但我想跳出来，以掌握更多部分的代码。只是读代码过于无聊了，我需要做一些特别的事情，让我更深的理解他们。</span></p>
<p><span>所以空闲时候，我开始试着做性能优化。主要关注于那些高流量的页面——首页、书籍详情页、搜索结果页。我问自己，<a href="http://www.guwendong.com/post/2010/early_amazon_1.html">obidos</a>（注1：亚马逊的页面渲染引擎）把时间耗在什么鬼地方了？</span></p>
<p><span><span>很快我发现了些有趣的东西。第一个上手改造的是购物车。</span></span></p>
<p><span><span><span>当你走进一个超市，你做的第一件事很可能就是推一个购物车。同样当人们访问亚马逊时，系统做的第一件事情也是给他们分配一个购物车——在数据库里开辟部分空间以存储购买的商品。</span></span></span></p>
<p>然而，超市并不需要应付大量机器访问或多窗口购物的情况；如果需要，他们也得有更多的购物车，而且几乎每个都是空的。</p>
<p>考虑这些并不实际购买的情况，延后到第一次购买物品时再分配购物车就变得很有意义了。把所有购物车的花销加起来，这个小改动带来的帮助比想像的要大的多。</p>
<p>另一个更大的问题是实时库存检查。当你查找一本书时，亚马逊检查仓库货架内是否有存货。如果没找到，他会再查找过多久才能订购这本书。所有这些都是实时完成的。</p>
<p>这是书籍详情页中最耗时的操作。库存检查是个丑陋的业务逻辑。</p>
<p>但我们真的需要实时的信息吗？也许用几分钟前的结果就够了。恩，是的，可以缓存数据，稍有延迟并不是什么大问题。</p>
<p>因为是用业余时间做这件事情，我从一些并不太大的改动开始。我认为，有多大压力是由网站业务决定的，我要做的是把加锁减到最少。同时我认为能通过缓存预加载，使得用户在缓存刷新时也没有任何延迟。</p>
<p>我鼓捣出的东西似乎运作良好。在测试中，访问延迟从一个很大值降低到了接近零。我开始同其他人讨论这个原型，听取他们的改进意见。</p>
<p>正巧，他们正在对亚马逊进行重新设计，包括彻底的改造和增加新功能。有人找到我希望能在搜索结果页也显示书的库存情况，然而没有缓存这是完全不可能实现的。除非能很快把我的原型实现并弄上线。我们也确实这么做了。</p>
<p>当然，这些如今看来是过时的。当我实现这个库存缓存时，它被设计为一个西雅图的小仓库工作，运行在单台大铁壳（honkin&#8217; iron）服务器上。如今海量的货物散步在若干个巨大的配送中心中——其中一些甚至大的能装下13个足球场——通过切换到一个商品服务器集群能在几秒内获得结果，这最终使得旧缓存不再适用。他运转良好到超出了他的时代，长到年轻时的功绩躺在自己的衰老下被遗忘。（It lasted well beyond its time, so long that the heroics of its youth lay forgotten under the problems of its senility.）</p>
<p>这里我将库存缓存作为个人自由时间带来收益的若干例子中的一个。 20％时间产生了远远超出其比例的价值。</p>
<p>注1：obidos是amazon早期的页面渲染引擎，一直被使用到06年8月底。因临近亚马逊河的小镇<a href="http://en.wikipedia.org/wiki/%C3%93bidos,_Par%C3%A1">Óbidos</a>而得名。<a href="http://en.wikipedia.org/wiki/Obidos_(software)">http://en.wikipedia.org/wiki/Obidos_(software)</a></p>
<hr />Greg的这系列文章，很多都是回忆他20%时间里鼓捣的事情。也侧面说明了真激情确实可以一辈子。</p>
<div>月初James Whittaker离开谷歌时，也专门写了一篇<a href="http://blogs.msdn.com/b/jw_on_tech/archive/2012/03/13/why-i-left-google.aspx">吐槽文</a>。其中一段回忆到昔日公司的创新气氛：</div>
<blockquote>
<div>In such an environment you don’t have to be part of some executive’s inner circle to succeed. You don’t have to get lucky and land on a sexy project to have a great career. Anyone with ideas or the skills to contribute could get involved.</div>
</blockquote>
<p>20%时间，一直我们这些国内工程师艳羡和乐于谈论的，似乎总是可望不可及。环境不论，很多事情其实可以从自己做起。</p>
<p><a href="http://www.resyschina.com/wp-content/uploads/2012/03/20-time1.jpg"><img class="size-full wp-image-649 alignleft" src="http://www.resyschina.com/wp-content/uploads/2012/03/20-time1.jpg" alt="" width="500" height="357" /></a></p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="提问的智慧：利用决策树进行推荐系统新用户引导" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F03%2Fusing_decision_tree_for_bootstrapping.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/12/5492733.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">提问的智慧：利用决策树进行推荐系统新用户引导</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="浅析新闻推荐及个性化推荐的领域相关性" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fnews_recommendation.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802322.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">浅析新闻推荐及个性化推荐的领域相关性</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="社交媒体流量：Facebook给力？Stumbleupon更给力" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fstumbleupon-facebook.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/05/1815225.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">社交媒体流量：Facebook给力？Stumbleupon更给力</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="信息过载：最佳过滤器是相关性还是人气？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Frelavence-or-hotrank.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/30/2474442.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">信息过载：最佳过滤器是相关性还是人气？</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="围脖局部影响力计算example" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F05%2Fweibo_spear.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-inventory-cache.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/04/21/21668404.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">围脖局部影响力计算example</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/03/early-amazon-inventory-cache.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Early Amazon: boy-am-i-hard-to-please</title>
		<link>http://www.resyschina.com/2012/03/early-amazon-boy-am-i-hard-to-please.html</link>
		<comments>http://www.resyschina.com/2012/03/early-amazon-boy-am-i-hard-to-please.html#comments</comments>
		<pubDate>Tue, 20 Mar 2012 00:25:01 +0000</pubDate>
		<dc:creator>clickstone</dc:creator>
				<category><![CDATA[亚马逊]]></category>
		<category><![CDATA[精华]]></category>
		<category><![CDATA[amazon]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=630</guid>
		<description><![CDATA[原文链接：http://glinden.blogspot.jp/2006/01/early-amazon-boy-am-i-hard-to-please.html 原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。 著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。 其 Blog &#8211; Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。 亚马逊早期的代码中，有相当一部分是由最早的两位员工编写的。他们的痕迹在 obidos 系统及其相关工具中随处可见。 可想而知他们的工作很多，但令人惊讶的是，起码我很难想象是如何办到的，他们居然还有时间进行其他一些有趣的边缘项目。我最感兴趣的一个叫做「Eyes」。 Eyes 曾经非常有用。它允许读者登记一个 Email 地址，设定一个查询条件，当有某本新书满足这个查询条件的时候，Eyes 会自动给读者发送一封 Email 通知。这种获取新书消息的方式非常棒，尤其对于写实文学来讲。 介绍 Eyes 的文案很有意思。它是这么写的： Eyes 是你的全自动搜索器，它很神奇。告诉它你喜欢哪些作者，喜欢哪些主题，它就会自动匹配这些设定，跟踪每一本你感兴趣的新书。注册 Eyes 吧，当那些你关心的书籍上市的时候，我们就会自动发送 Email 通知到你。 好吧，如果你认为这项免费服务不够酷或者没用的话，请发送邮件到 boy-am-I-hard-to-please@amazon.com，告诉我们为什么。 Eyes 服务了许多年，直到后来被 Amazon Alerts 所取代，Amazon Alerts 不可谓不是一个好服务，但与 Eyes 相比，总好像是少了些什么。 通常，停留在记忆里面的就是这些好玩儿的事情。在我创建 Findory 的最早期，一个客服邮件地址就是 boy-am-i-hard-to-please@findory.com。 要有爱，要做让自己快乐的事情。<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Door desks" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/02/22/15874032.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Door desks</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: BookMatcher" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/03/16434590.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: BookMatcher</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p>原文链接：<a href="http://glinden.blogspot.jp/2006/01/early-amazon-boy-am-i-hard-to-please.html" target="_blank">http://glinden.blogspot.jp/2006/01/early-amazon-boy-am-i-hard-to-please.html</a></p>
<blockquote><p>原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。<br />
著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。<br />
其 Blog &#8211; Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。</p></blockquote>
<p>亚马逊早期的代码中，有相当一部分是由最早的两位员工编写的。他们的痕迹在 <a href="http://www.guwendong.com/post/2010/early_amazon_1.html" target="_self">obidos</a> 系统及其相关工具中随处可见。</p>
<p>可想而知他们的工作很多，但令人惊讶的是，起码我很难想象是如何办到的，他们居然还有时间进行其他一些有趣的边缘项目。我最感兴趣的一个叫做「Eyes」。</p>
<p>Eyes 曾经非常有用。它允许读者登记一个 Email 地址，设定一个查询条件，当有某本新书满足这个查询条件的时候，Eyes 会自动给读者发送一封 Email 通知。这种获取新书消息的方式非常棒，尤其对于写实文学来讲。</p>
<p>介绍 Eyes 的文案很有意思。它是这么写的：</p>
<blockquote><p>Eyes 是你的全自动搜索器，它很神奇。告诉它你喜欢哪些作者，喜欢哪些主题，它就会自动匹配这些设定，跟踪每一本你感兴趣的新书。注册 Eyes 吧，当那些你关心的书籍上市的时候，我们就会自动发送 Email 通知到你。</p>
<p>好吧，如果你认为这项免费服务不够酷或者没用的话，请发送邮件到 boy-am-I-hard-to-please@amazon.com，告诉我们为什么。</p></blockquote>
<p>Eyes 服务了许多年，直到后来被 <a href="http://www.amazon.com/exec/obidos/tg/browse/-/569160" target="_blank">Amazon Alerts</a> 所取代，Amazon Alerts 不可谓不是一个好服务，但与 Eyes 相比，总好像是少了些什么。</p>
<p>通常，停留在记忆里面的就是这些好玩儿的事情。在我创建 Findory 的最早期，一个客服邮件地址就是 boy-am-i-hard-to-please@findory.com。</p>
<p><a href="http://www.guwendong.com/wp-content/uploads/2012/03/You-cant-please-anyone.jpg"><img title="You-cant-please-anyone" src="http://www.guwendong.com/wp-content/uploads/2012/03/You-cant-please-anyone.jpg" alt="" width="500" height="343" /></a></p>
<p>要有爱，要做让自己快乐的事情。</p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Door desks" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/02/22/15874032.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Door desks</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: BookMatcher" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/03/16434590.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: BookMatcher</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/03/early-amazon-boy-am-i-hard-to-please.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Early Amazon: BookMatcher</title>
		<link>http://www.resyschina.com/2012/03/early-amazon-bookmatcher.html</link>
		<comments>http://www.resyschina.com/2012/03/early-amazon-bookmatcher.html#comments</comments>
		<pubDate>Fri, 02 Mar 2012 20:35:02 +0000</pubDate>
		<dc:creator>clickstone</dc:creator>
				<category><![CDATA[亚马逊]]></category>
		<category><![CDATA[精华]]></category>
		<category><![CDATA[amazon]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=624</guid>
		<description><![CDATA[原文链接：http://glinden.blogspot.com/2006/01/early-amazon-bookmatcher.html 原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。 著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。 其 Blog &#8211; Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。 在参加亚马逊面试的时候，我提出了很多想法。我列举了一坨我认为可以改进亚马逊网站的方法，其中就包括书籍推荐系统。 我喜欢读书。很中意书籍推荐系统这个想法。发现未知的有趣的书籍，这是多么好玩儿的一件事情啊。我想亲手去实现它。 在亚马逊的第一个星期，我很失望，因为我发现已经有人在做书籍推荐系统了。亚马逊网站的早期用户应该会记得，那个玩意儿叫做“BookMatcher”。 BookMatcher 首先要求用户给 20～30 本书打分，然后才会做出推荐。想必你可以猜得到，几乎没人用它。必须给 20+ 本书打分，对大多数用户来说太麻烦了。 BookMatcher 系统由一个外协公司在提供技术服务。杯具的是，它根本不 work。需要给 20+ 本书打分这样的进入门槛只是问题之一。整个推荐方法偏向于热卖的产品而忽略了长尾。系统还很慢。总而言之，这个系统既不可靠，一有压力还老是宕机。 这不是亚马逊需要的系统。亚马逊的书籍推荐应该基于稀疏数据，少量的打分或者购买记录。它需要快速给出结果。它应该能够扩展到大规模用户群以及数目巨大的品类。它应该能够帮助用户发现那些湮没在类目深处的他们无法找到的书籍。 一定有更好的选择！ 补充： 今天有一条讨论很热烈的微博，和这篇文章里面描述的状况有点儿像， @范凯robbin：昨天一位运维人员入职，我们进行了运维系统介绍和培训，以及接下来逐步清理和改造运维的计划和步骤。今天早上他来找我，说我们公司网站服务器运维现状太乱不是他喜欢的环境，辞职了。我理解他的选择，不过亦为他惋惜。一套完全运转良好的运维系统是无法给有进取心的人任何发挥的空间的。 如今国内的创业公司想找到愿意冒些风险迎接挑战的靠谱的人，难如登天。这事儿倒也自然，人各有志，莫强求，看缘份。不过我很同意 @Fenng 的说法，“正所谓，公司的问题，就是员工的机会”。你看，Greg Linden 也是这么走过来的。 去年，我一个很给力的大学同学，被国内某家名称以数字开头的互联网巨头搞去秘密研发手机操作系统，结果怎么样我就不说了，反正现在转去搞 Web App 了，号称下一代移动应用开发框架。。。好吧，其实我想说明的是，大公司也不一定就有多靠谱，还不如我们死磕一件事，从头开始稳扎稳打，只要事情是你喜欢的，百炼成钢。来吧，一定有更好的选择！<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: boy-am-i-hard-to-please" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/20/17678174.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: boy-am-i-hard-to-please</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Door desks" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/02/22/15874032.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Door desks</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon与数据导向" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon与数据导向</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p>原文链接：<a href="http://glinden.blogspot.com/2006/01/early-amazon-bookmatcher.html" target="_blank">http://glinden.blogspot.com/2006/01/early-amazon-bookmatcher.html</a></p>
<blockquote><p>原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。<br />
著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。<br />
其 Blog &#8211; Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。</p></blockquote>
<p>在参加亚马逊面试的时候，我提出了很多想法。我列举了一坨我认为可以改进亚马逊网站的方法，其中就包括书籍推荐系统。</p>
<p>我喜欢读书。很中意书籍推荐系统这个想法。发现未知的有趣的书籍，这是多么好玩儿的一件事情啊。我想亲手去实现它。</p>
<p>在亚马逊的第一个星期，我很失望，因为我发现已经有人在做书籍推荐系统了。亚马逊网站的早期用户应该会记得，那个玩意儿叫做“BookMatcher”。</p>
<p>BookMatcher 首先要求用户给 20～30 本书打分，然后才会做出推荐。想必你可以猜得到，几乎没人用它。必须给 20+ 本书打分，对大多数用户来说太麻烦了。</p>
<p>BookMatcher 系统由一个外协公司在提供技术服务。杯具的是，它根本不 work。需要给 20+ 本书打分这样的进入门槛只是问题之一。整个推荐方法偏向于热卖的产品而忽略了长尾。系统还很慢。总而言之，这个系统既不可靠，一有压力还老是宕机。</p>
<p>这不是亚马逊需要的系统。亚马逊的书籍推荐应该基于稀疏数据，少量的打分或者购买记录。它需要快速给出结果。它应该能够扩展到大规模用户群以及数目巨大的品类。它应该能够帮助用户发现那些湮没在类目深处的他们无法找到的书籍。</p>
<p>一定有更好的选择！</p>
<p><a href="http://www.guwendong.com/wp-content/uploads/2012/03/choice.preview.jpg"><img title="choice.preview" src="http://www.guwendong.com/wp-content/uploads/2012/03/choice.preview.jpg" alt="" width="576" height="427" /></a></p>
<p>补充：<br />
今天有一条讨论很热烈的微博，和这篇文章里面描述的状况有点儿像，</p>
<blockquote><p><a href="http://weibo.com/1654762921/y7z9npNTO" target="_blank">@范凯robbin</a>：昨天一位运维人员入职，我们进行了运维系统介绍和培训，以及接下来逐步清理和改造运维的计划和步骤。今天早上他来找我，说我们公司网站服务器运维现状太乱不是他喜欢的环境，辞职了。我理解他的选择，不过亦为他惋惜。一套完全运转良好的运维系统是无法给有进取心的人任何发挥的空间的。</p></blockquote>
<p>如今国内的创业公司想找到愿意冒些风险迎接挑战的靠谱的人，难如登天。这事儿倒也自然，人各有志，莫强求，看缘份。不过我很同意 <a href="http://weibo.com/1577826897/y82o7d1Zg" target="_blank">@Fenng</a> 的说法，“正所谓，公司的问题，就是员工的机会”。你看，Greg Linden 也是这么走过来的。</p>
<p>去年，我一个很给力的大学同学，被国内某家名称以数字开头的互联网巨头搞去秘密研发手机操作系统，结果怎么样我就不说了，反正现在转去搞 Web App 了，号称下一代移动应用开发框架。。。好吧，其实我想说明的是，大公司也不一定就有多靠谱，还不如我们死磕一件事，从头开始稳扎稳打，只要事情是你喜欢的，百炼成钢。来吧，一定有<a href="http://www.zhiyue.me/team.html">更好的选择</a>！</p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: boy-am-i-hard-to-please" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/20/17678174.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: boy-am-i-hard-to-please</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Door desks" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/02/22/15874032.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Door desks</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon与数据导向" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon与数据导向</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/03/early-amazon-bookmatcher.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Early Amazon: Door desks</title>
		<link>http://www.resyschina.com/2012/02/early-amazon-door-desks.html</link>
		<comments>http://www.resyschina.com/2012/02/early-amazon-door-desks.html#comments</comments>
		<pubDate>Wed, 22 Feb 2012 15:22:10 +0000</pubDate>
		<dc:creator>clickstone</dc:creator>
				<category><![CDATA[亚马逊]]></category>
		<category><![CDATA[精华]]></category>
		<category><![CDATA[amazon]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=619</guid>
		<description><![CDATA[原文链接：http://glinden.blogspot.com/2006/01/early-amazon-door-desks.html 原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。 著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。 其 Blog &#8211; Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。 译者：谷文栋 (clickstone) 亚马逊经常被视为互联网大潮中的弄潮儿，但是与其他拿着 VC 的钱创业的公司不同，它从来没有提供过 Aeron 牌的座椅或者是免费按摩之类的福利。 亚马逊的文化提倡节俭。在我加入亚马逊的时候，它提供的健康计划是大打折扣的，几乎没有什么作用。薪水也非常低，是我印象里我接受过的最低的薪水。 最能代表亚马逊节俭的，就是「木门桌」（door desk）。 哥们儿，你说的是「木门桌」吗？好吧，我现在有一扇木门。那么我如何才能把它搞成一张桌子呢？ 这事儿交给杰夫·贝索斯就好了。买一扇木门，最好是空心木门而且没有预先打孔的那种。整两对 4&#8243; x 4&#8243; x 6&#8242; 的方柱子，再用丑陋的角铁托架把这些柱子固定在门上。最后把它推到一名程序员面前。搞掂，这就是木门桌。 木门桌不仅价钱便宜，而且还可以提供很大的使用面积。放一个显示器在上面看起来根本就没占什么地方。要不是亚马逊实在太抠门的话，你在上面再放三个显示器也没问题。有任何东西你就尽管往上面堆，全部 hold 住。 从人体工程学的角度讲，木门桌有太多可吐槽的地方。键盘的位置太高了，敲的时间长了很不舒服。那些角铁的边缘很尖，一不小心剐在肉上就会很杯具，现在想起来简直是不堪回首啊。。。 尽管如此，木门桌后来仍然成为了亚马逊节俭文化的一个符号。它们演绎着自己的传说。多年后，在 2001 年，西雅图发生了一起 6.8 级的地震。那时我们已经搬到了另外的一幢建筑里——位于西雅图市中心灯塔山上面的风景秀丽的 PacMed 大楼——但是依旧用着木门桌。彼时彼刻，当整栋大楼在震中摇曳的时候，我们躲在木门桌的下面，庆幸之情简直是难以言表啊～ 补充： 1998 年，在接受 Seattle Times 访问的时候，杰夫·贝索斯曾经说过， 木门桌是亚马逊节俭文化的象征，它代表着一种思考问题的方式。对亚马逊非常重要的是，我们必须确认我们把钱花在了提升购物用户体验的事情上。我们鼓励自力更生的文化。（使用这些简易的木桌）&#8230; 我们可以省下很多钱。 [...]<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: BookMatcher" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/03/16434590.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: BookMatcher</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: boy-am-i-hard-to-please" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/20/17678174.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: boy-am-i-hard-to-please</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p>原文链接：<a href="http://glinden.blogspot.com/2006/01/early-amazon-door-desks.html" target="_blank">http://glinden.blogspot.com/2006/01/early-amazon-door-desks.html</a></p>
<blockquote><p>原文作者 Greg Linden 毕业于华盛顿大学计算机学院，1997 年加入 Amazon，开发了享誉业界的 Amazon 推荐引擎。<br />
著名的 Item-based 推荐算法的提出者之一；Findory.com 创始人。<br />
其 Blog &#8211; Geeking with Greg 是个性化推荐领域最有影响力的博客（没有之一）。</p></blockquote>
<p>译者：<a href="http://guwendong.com/" target="_blank">谷文栋 (clickstone)</a></p>
<p>亚马逊经常被视为互联网大潮中的弄潮儿，但是与其他拿着 VC 的钱创业的公司不同，它从来没有提供过 Aeron 牌的座椅或者是免费按摩之类的福利。</p>
<p>亚马逊的文化提倡节俭。在我加入亚马逊的时候，它提供的健康计划是大打折扣的，几乎没有什么作用。薪水也非常低，是我印象里我接受过的最低的薪水。</p>
<p>最能代表亚马逊节俭的，就是「木门桌」（door desk）。</p>
<p>哥们儿，你说的是「木门桌」吗？好吧，我现在有一扇木门。那么我如何才能把它搞成一张桌子呢？</p>
<p>这事儿交给杰夫·贝索斯就好了。买一扇木门，最好是空心木门而且没有预先打孔的那种。整两对 4&#8243; x 4&#8243; x 6&#8242; 的方柱子，再用丑陋的角铁托架把这些柱子固定在门上。最后把它推到一名程序员面前。搞掂，这就是木门桌。</p>
<p>木门桌不仅价钱便宜，而且还可以提供很大的使用面积。放一个显示器在上面看起来根本就没占什么地方。要不是亚马逊实在太抠门的话，你在上面再放三个显示器也没问题。有任何东西你就尽管往上面堆，全部 hold 住。</p>
<p>从人体工程学的角度讲，木门桌有太多可吐槽的地方。键盘的位置太高了，敲的时间长了很不舒服。那些角铁的边缘很尖，一不小心剐在肉上就会很杯具，现在想起来简直是不堪回首啊。。。</p>
<p>尽管如此，木门桌后来仍然成为了亚马逊节俭文化的一个符号。它们演绎着自己的传说。多年后，在 2001 年，西雅图发生了一起 6.8 级的地震。那时我们已经搬到了另外的一幢建筑里——位于西雅图市中心灯塔山上面的风景秀丽的 PacMed 大楼——但是依旧用着木门桌。彼时彼刻，当整栋大楼在震中摇曳的时候，我们躲在木门桌的下面，庆幸之情简直是难以言表啊～</p>
<p>补充：</p>
<p>1998 年，在接受 <a href="http://community.seattletimes.nwsource.com/archive/?date=19980726&amp;slug=2763230" target="_blank">Seattle Times</a> 访问的时候，杰夫·贝索斯曾经说过，</p>
<blockquote><p>木门桌是亚马逊节俭文化的象征，它代表着一种思考问题的方式。对亚马逊非常重要的是，我们必须确认我们把钱花在了提升购物用户体验的事情上。我们鼓励自力更生的文化。（使用这些简易的木桌）&#8230; 我们可以省下很多钱。</p></blockquote>
<p><a href="http://www.guwendong.com/wp-content/uploads/2012/02/door-desk.jpg"><img title="door-desk" src="http://www.guwendong.com/wp-content/uploads/2012/02/door-desk.jpg" alt="" width="500" height="333" /></a><br />
图片来源：<a href="https://secure.flickr.com/photos/wonderlane/5042548829/" target="_blank">https://secure.flickr.com/photos/wonderlane/5042548829/</a></p>
<p>从 <a href="http://www.dbanotes.net/jobs/howto_choose_company_style.html">Fenng 的文章</a>里看到王建硕的一句话，「小公司总有一天会淘汰现在的大公司，自己也会变成大公司，否则的话，我们现在看到的都是那些恐龙级的企业了」。十四年前用破木门当办公桌的这家公司，如今成为了全世界最富有创新力最受人尊敬的公司之一。对不确定性的恐惧是大多数人的天性。一年多之前，和 Facebook 的朋友聊天，他们的危机感与努力程度让我吃惊。牛逼如 Facebook 尚且如此，不要奢望哪家创业公司能许诺你一个未来，这要靠大家一起去打拼。人这一辈子至少要工作二十年，抽出两三年到一家创业公司去尝试一下<a href="http://www.zhiyue.me/team.html" target="_blank">别样的体验</a>，这是人生的财富。</p>
<p>另外，在十四年前的 1998 年，贝索斯说过这样一段话，我不翻译了，原汁原味。</p>
<blockquote><p>&#8220;Not only do you have to avoid the bad ideas, but you have to avoid many of the good ideas for reasons of focus,&#8221; said Bezos. &#8220;Ideas are important, but they are relatively easy. What&#8217;s hard is taking that list of hundred ideas and ranking them and picking the three that we&#8217;re actually gonna do. That&#8217;s intellectually one of the most challenging things that happens every day in a company growing this fast, that sort of brutal triage of ideas.&#8221;</p></blockquote>
<p>创业以来，我深刻地体会到了什么叫做「知易行难」。道理永远是这么简单，但拼尽全力，才能把握一二。</p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: BookMatcher" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/03/16434590.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: BookMatcher</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: boy-am-i-hard-to-please" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Fearly-amazon-door-desks.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/20/17678174.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: boy-am-i-hard-to-please</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/02/early-amazon-door-desks.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Amazon与数据导向</title>
		<link>http://www.resyschina.com/2012/02/amazon-data-oriented.html</link>
		<comments>http://www.resyschina.com/2012/02/amazon-data-oriented.html#comments</comments>
		<pubDate>Mon, 06 Feb 2012 02:25:08 +0000</pubDate>
		<dc:creator>clickstone</dc:creator>
				<category><![CDATA[亚马逊]]></category>
		<category><![CDATA[产品评论]]></category>
		<category><![CDATA[amazon]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=615</guid>
		<description><![CDATA[我对个性化推荐技术的关注，是从 Amazon 开始的，因此，但凡是与 Amazon 个性化推荐相关的内容，我都比较有兴趣。知乎上有一个问题，《为什么 Amazon 的首页很少有促销？》，聚集了不少有益的回答。下面这个是我的回答， 世界的进步靠的是异类，美国线下商场也搞促销，Amazon 也一样可以依靠促销策略。在还没有改变世界之前就真心相信自己可以改变世界的人，要么是傻逼要么是真牛逼。Amazon 的 CEO 是杰夫·贝索斯，换个人不一定这样。keso 说的牛逼推荐算法是结果，长尾理论的兴起也是结果，在别人都不明白怎么回事儿的情况下，贝索斯一直坚持在做，这个是 Amazon 成为推荐系统之王以及可以引领云计算潮流的原因。 另外，大环境及所处的阶段也很重要。开始的时候，其实美国人也不看好 Amazon，亏了那么多年钱，所以相对的竞争不像现在国内电商这么惨烈，凡夫俗子们给了贝索斯足够的时间去建立竞争壁垒。年初的时候，淘宝赞助 resyschina 搞推荐系统大会， 和淘宝朋友的交流发现，淘宝不可谓不重视技术。但如果你看看去年光棍节淘宝大促当日的成交金额，你会发现，你投入几百个工程师花很长时间搞出来的东西，在 这些面前不值一晒，更何况，竞争对手中还有大票的普通青年一个劲儿地在搞秒杀和限时抢购，你会怎么选择？长远收益在眼前利益与竞争压力之前，没几个人可以扛得住。 最后，回答这个问题，“为什么 Amazon 的首页很少有促销活动”？贝索斯曾经说，“我们不是通过卖东西赚钱，而是通过帮助消费者做出更好的购买决策而赚钱”。Amazon，它和我们不在一个时代。 我没在 Amazon 工作过，上面的回答全凭揣测，说是 YY 也行。。。上周，前 Amazon 推荐引擎小组成员，目前个性化阅读社区无觅网的创始人 @林承仁 ，“从内部人员的角度补充了两点”： 1. 首页的空间这么有限，每个下面的部门都想把自己做的内容放上去（一旦霸占了首页的一块可能整年的目标就达到了），那么到底该显示哪些内容呢？Amazon 里面有一套机制叫King of The Hill Campaign，粗略的意思就是你们谁都有机会，但谁带来的价值越大展示的机会就越多。放到首页的内容当然也包括促销活动（比如Gold Box等），但其表现往往比推荐引擎差得多。 2. 我们在内部开发推荐引擎的目标是在serendipity（惊喜、发现），不单是让用户买更多的商品，而是让用户发现更多他感兴趣但他不知道的商品。这两者有很大的区别。 结论：为什么Amazon的首页很少有促销活动？数据导向。 后续我们在微博上又做了简单的进一步沟通，@林承仁：“Jeff怎么想还真不知道，只是他那种以客户为导向的坚持基本上已经灌输到每个员工心上去了。对长期利益的追求跟坚持到后面都慢慢体现出来了，公司内部有个很出名的 process：start from the customer, work backward&#8230;” 另外，关于首页上“促销和推荐”的部分，前当当网技术总监 @wanght1979 也透露了一个情况：“当当图书首页的推荐模块位于第二屏，但点击量是该页面最高的”。 平衡短期利益与长期利益，数据导向，这两点我相信绝大多数管理者都知道，但这些真正的是知易行难。 ResysChina 上有一个 [...]<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: BookMatcher" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/03/16434590.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: BookMatcher</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: boy-am-i-hard-to-please" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/20/17678174.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: boy-am-i-hard-to-please</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Amazon：关于这个网络零售巨头您所不知道的10个秘密" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F62854.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/07/24/18778681.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon：关于这个网络零售巨头您所不知道的10个秘密 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="2011年Amazon的13个里程碑事件" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F68685.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/12/21/12816987.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">2011年Amazon的13个里程碑事件 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="案例：Amazon是怎样用一项小策略创造了27亿美元的？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F4146.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/08/5243608.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">案例：Amazon是怎样用一项小策略创造了27亿美元的？ (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐系统：Amazon、Pandora 和 Del.icio.us" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2007%2Famazon_pandora_delicious.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐系统：Amazon、Pandora 和 Del.icio.us (@guwendong)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon 的早与晚" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F20249.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/08/5241662.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon 的早与晚 (@alibuybuy)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<div>
<p>我对个性化推荐技术的关注，是从 Amazon 开始的，因此，但凡是与 Amazon 个性化推荐相关的内容，我都比较有兴趣。知乎上有一个问题，《<a href="http://www.zhihu.com/question/19899603">为什么 Amazon 的首页很少有促销？</a>》，聚集了不少有益的回答。下面这个是<a href="http://www.zhihu.com/question/19899603/answer/13296772">我的回答</a>，</p>
<blockquote><p>世界的进步靠的是异类，美国线下商场也搞促销，Amazon 也一样可以依靠促销策略。在还没有改变世界之前就真心相信自己可以改变世界的人，要么是傻逼要么是真牛逼。Amazon 的 CEO 是杰夫·贝索斯，换个人不一定这样。keso 说的牛逼推荐算法是结果，长尾理论的兴起也是结果，在别人都不明白怎么回事儿的情况下，贝索斯一直坚持在做，这个是 Amazon 成为推荐系统之王以及可以引领云计算潮流的原因。</p>
<p>另外，大环境及所处的阶段也很重要。开始的时候，其实美国人也不看好 Amazon，亏了那么多年钱，所以相对的竞争不像现在国内电商这么惨烈，凡夫俗子们给了贝索斯足够的时间去建立竞争壁垒。年初的时候，淘宝赞助 resyschina 搞<a href="http://www.guwendong.com/post/2011/resysforum-org.html">推荐系统大会</a>， 和淘宝朋友的交流发现，淘宝不可谓不重视技术。但如果你看看去年光棍节淘宝大促当日的成交金额，你会发现，你投入几百个工程师花很长时间搞出来的东西，在 这些面前不值一晒，更何况，竞争对手中还有大票的普通青年一个劲儿地在搞秒杀和限时抢购，你会怎么选择？长远收益在眼前利益与竞争压力之前，没几个人可以扛得住。</p>
<p>最后，回答这个问题，“为什么 Amazon 的首页很少有促销活动”？贝索斯曾经说，“我们不是通过卖东西赚钱，而是通过帮助消费者做出更好的购买决策而赚钱”。Amazon，它和我们不在一个时代。</p></blockquote>
<p>我没在 Amazon 工作过，上面的回答全凭揣测，说是 YY 也行。。。上周，前 Amazon 推荐引擎小组成员，目前个性化阅读社区<a href="http://www.wumii.com/">无觅网</a>的创始人 <a href="http://weibo.com/trumanlam">@林承仁</a> ，“从内部人员的角度补充了<a href="http://www.zhihu.com/question/19899603/answer/13783949">两点</a>”：</p>
<blockquote><p>1. 首页的空间这么有限，每个下面的部门都想把自己做的内容放上去（一旦霸占了首页的一块可能整年的目标就达到了），那么到底该显示哪些内容呢？Amazon 里面有一套机制叫King of The Hill Campaign，粗略的意思就是你们谁都有机会，但谁带来的价值越大展示的机会就越多。放到首页的内容当然也包括促销活动（比如Gold Box等），但其表现往往比推荐引擎差得多。</p>
<p>2. 我们在内部开发推荐引擎的目标是在serendipity（惊喜、发现），不单是让用户买更多的商品，而是让用户发现更多他感兴趣但他不知道的商品。这两者有很大的区别。</p>
<p>结论：为什么Amazon的首页很少有促销活动？<strong>数据导向</strong>。</p></blockquote>
<p>后续我们在微博上又做了简单的进一步沟通，@林承仁：“Jeff怎么想还真不知道，只是他那种以客户为导向的坚持基本上已经灌输到每个员工心上去了。对长期利益的追求跟坚持到后面都慢慢体现出来了，公司内部有个很出名的 process：start from the customer, work backward&#8230;”</p>
<p>另外，关于首页上“促销和推荐”的部分，前当当网技术总监 <a href="http://weibo.com/wanght1979">@wanght1979</a> 也透露了<a href="http://weibo.com/1641544424/y3w0e8Zt7">一个情况</a>：“当当图书首页的推荐模块位于第二屏，但点击量是该页面最高的”。</p>
<p>平衡短期利益与长期利益，数据导向，这两点我相信绝大多数管理者都知道，但这些真正的是知易行难。</p>
<p>ResysChina 上有一个<a href="http://www.resyschina.com/category/amazon"> Amazon 专栏</a>，其中有些是 Greg Linden 的《<a href="http://glinden.blogspot.com/2006/05/early-amazon-end.html">Early Amazon</a>》系列文章的中文翻译，我起了个头翻译了前两篇，后面还有十几篇没有完成。Greg Linden 这系列文章写得非常精彩，我在这里邀请有<a href="http://book.douban.com/subject/7007666/">盈余</a>的朋友和我一起，把剩下的部分翻译完成。有兴趣的朋友可以把你想翻译的那一篇发个邮件给我 <a href="http://wendell.gu#gmail.com">wendell.gu#gmail.com</a> ，我来统一协调。非常感谢！</p>
</div>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Early Amazon: Group discounts" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F10%2Fearly-amazon-group-discounts.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802327.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: Group discounts</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon的个性化推荐" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Famazon-recsys-functions.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/14/12249021.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon的个性化推荐</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: BookMatcher" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-bookmatcher.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/03/16434590.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: BookMatcher</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: boy-am-i-hard-to-please" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2012%2F03%2Fearly-amazon-boy-am-i-hard-to-please.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2012/03/20/17678174.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: boy-am-i-hard-to-please</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Amazon：关于这个网络零售巨头您所不知道的10个秘密" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F62854.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/07/24/18778681.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon：关于这个网络零售巨头您所不知道的10个秘密 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="2011年Amazon的13个里程碑事件" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F68685.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/12/21/12816987.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">2011年Amazon的13个里程碑事件 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="案例：Amazon是怎样用一项小策略创造了27亿美元的？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F4146.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/08/5243608.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">案例：Amazon是怎样用一项小策略创造了27亿美元的？ (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐系统：Amazon、Pandora 和 Del.icio.us" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.guwendong.com%2Fpost%2F2007%2Famazon_pandora_delicious.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐系统：Amazon、Pandora 和 Del.icio.us (@guwendong)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Amazon 的早与晚" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F20249.html&from=http%3A%2F%2Fwww.resyschina.com%2F2012%2F02%2Famazon-data-oriented.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/08/5241662.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Amazon 的早与晚 (@alibuybuy)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2012/02/amazon-data-oriented.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>简网发布“指阅”，打造中国版Zite</title>
		<link>http://www.resyschina.com/2011/10/zhiyue.html</link>
		<comments>http://www.resyschina.com/2011/10/zhiyue.html#comments</comments>
		<pubDate>Wed, 26 Oct 2011 13:58:59 +0000</pubDate>
		<dc:creator>xlvector</dc:creator>
				<category><![CDATA[产品评论]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=609</guid>
		<description><![CDATA[个性化阅读类的应用一直是一个火热的创业方向，这个领域已经先后出现了 Taptu、Pulse、Zite 等红极一时的应用。两个月前，一直火热的Zite 才被CNN 以2000万美元的价格收购了。北京创业公司简网（Cutt.com）也很看好这个方向，于10月10日在App Store（下载地址）发布了类似Zite的个性化阅读应用“指阅”。 指阅的目标是新一代的资讯发现与阅读应用，致力于打造懂你口味的阅读体验。指阅将互联网上的繁多资讯收集梳理，以『话题』的形式重新组织成“微媒体”，帮助你方便快速地发现你感兴趣的内容。 首次打开指阅后，应用会邀请绑定你的微博帐号，经过你的授权，指阅的服务器后台就会分析你近期发布的微博内容，得到你可能感兴趣的主题。同时，绑定微博之后你可以把你发现的好内容分享给你的好友。 在进入文章阅读后，你可以随时完善文章来源和主题，应用会记录你在阅读过程中的每个操作，喜欢与否、来源选择、话题偏好、分享互动。你的这些反馈，可以让指阅更加完善地了解你的阅读喜好。这些体验流程和 Zite 都基本类似。 通过语义分析与个性化推荐技术，指阅试图为你打造个人专属的信息过滤与发现引擎。这些服务器端的后台技术，简网团队内部称之为“信息基因技术”，此前已经推出的产品 cutt.com、机器维护的微博帐号以及指阅都采用了相应的技术。 简网创始人丁钧，1997年进入微软工作直到2006年，离开前是微软顾问咨询部的总监。目前公司还没有开始盈利，不过对于未来的商业模式，简网团队并不担心。他们希望通过对人们兴趣信息的捕捉分析，甚至是某一区域、某一年龄、某一族落的人群的精准定位，为用户提供一系列的个性化服务。不过目前阶段最重要的事情，还是专心打造好一款对用户有用的产品，欢迎大家下载试用（下载地址）。 首发：http://www.36kr.com/p/52181.html<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="2011推荐系统论坛·官网发布" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F03%2Fresysforum-org.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">2011推荐系统论坛·官网发布</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Hulu发布新的推荐系统页面" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Fhulu_recommendation_hub.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Hulu发布新的推荐系统页面</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Twitter联合创始人：个性化服务是最大挑战" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fpersonalized-twitter.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802342.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Twitter联合创始人：个性化服务是最大挑战</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="电影推荐引擎Jinni获500万美金投资" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fjinni-500w.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/06/1830945.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">电影推荐引擎Jinni获500万美金投资</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="推荐结果是不是应该变一变？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Ftemporal_diversity.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/12/5485123.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">推荐结果是不是应该变一变？</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="中国移动手机冲浪浏览器正式发布" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F58234.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/11/5426536.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">中国移动手机冲浪浏览器正式发布 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="什么是灰度发布？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F67127.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">什么是灰度发布？ (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="“.中国”域名全国公测启动 附公测地址" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2F5475.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">“.中国”域名全国公测启动 附公测地址 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Comsenz 核心产品 Discuz!7.0.0 测试版发布" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F2444.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Comsenz 核心产品 Discuz!7.0.0 测试版发布 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="谷歌中国低调发布购物搜索 测试版全面上线" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F6887.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">谷歌中国低调发布购物搜索 测试版全面上线 (@alibuybuy)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p>个性化阅读类的应用一直是一个火热的创业方向，这个领域已经先后出现了 Taptu、Pulse、Zite 等红极一时的应用。两个月前，一直火热的<a href="http://www.36kr.com/p/43980.html">Zite 才被CNN 以2000万美元的价格收购</a>了。北京创业公司简网（Cutt.com）也很看好这个方向，于10月10日在App Store（<a href="http://itunes.apple.com/us/app/id450737500?mt=8">下载地址</a>）发布了类似Zite的个性化阅读应用“指阅”。</p>
<p><a href="http://www.resyschina.com/wp-content/uploads/2011/10/mzl.rynefles.480x480-75.jpg"><img class="alignnone size-full wp-image-610" title="mzl.rynefles.480x480-75" src="http://www.resyschina.com/wp-content/uploads/2011/10/mzl.rynefles.480x480-75.jpg" alt="" width="480" height="360" /></a></p>
<p>指阅的目标是新一代的资讯发现与阅读应用，致力于打造懂你口味的阅读体验。指阅将互联网上的繁多资讯收集梳理，以『话题』的形式重新组织成“微媒体”，帮助你方便快速地发现你感兴趣的内容。</p>
<p><a href="http://www.resyschina.com/wp-content/uploads/2011/10/100APPLE_IMG_0247.png"><img class="alignnone size-full wp-image-611" title="100APPLE_IMG_0247" src="http://www.resyschina.com/wp-content/uploads/2011/10/100APPLE_IMG_0247.png" alt="" width="717" height="538" /></a></p>
<p>首次打开指阅后，应用会邀请绑定你的微博帐号，经过你的授权，指阅的服务器后台就会分析你近期发布的微博内容，得到你可能感兴趣的主题。同时，绑定微博之后你可以把你发现的好内容分享给你的好友。</p>
<p>在进入文章阅读后，你可以随时完善文章来源和主题，应用会记录你在阅读过程中的每个操作，喜欢与否、来源选择、话题偏好、分享互动。你的这些反馈，可以让指阅更加完善地了解你的阅读喜好。这些体验流程和 Zite 都基本类似。</p>
<p><a href="http://www.resyschina.com/wp-content/uploads/2011/10/mzl.aqieyrqt.480x480-75.jpg"><img class="alignnone size-full wp-image-612" title="mzl.aqieyrqt.480x480-75" src="http://www.resyschina.com/wp-content/uploads/2011/10/mzl.aqieyrqt.480x480-75.jpg" alt="" width="480" height="360" /></a></p>
<p>通过语义分析与个性化推荐技术，指阅试图为你打造个人专属的信息过滤与发现引擎。这些服务器端的后台技术，简网团队内部称之为“信息基因技术”，此前已经推出的产品 <a href="http://cutt.com/">cutt.com</a>、机器维护的微博帐号以及指阅都采用了相应的技术。</p>
<p>简网创始人<a href="http://weibo.com/1650243841">丁钧</a>，1997年进入微软工作直到2006年，离开前是微软顾问咨询部的总监。目前公司还没有开始盈利，不过对于未来的商业模式，简网团队并不担心。他们希望通过对人们兴趣信息的捕捉分析，甚至是某一区域、某一年龄、某一族落的人群的精准定位，为用户提供一系列的个性化服务。不过目前阶段最重要的事情，还是专心打造好一款对用户有用的产品，欢迎大家下载试用（<a href="http://itunes.apple.com/us/app/id450737500?mt=8">下载地址</a>）。</p>
<p>首发：<a href="http://www.36kr.com/p/52181.html">http://www.36kr.com/p/52181.html</a></p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="2011推荐系统论坛·官网发布" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F03%2Fresysforum-org.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">2011推荐系统论坛·官网发布</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Hulu发布新的推荐系统页面" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F02%2Fhulu_recommendation_hub.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Hulu发布新的推荐系统页面</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Early Amazon: The first week" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F03%2Fearly-amazon-1.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/12/30380540.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Early Amazon: The first week</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Twitter联合创始人：个性化服务是最大挑战" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fpersonalized-twitter.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802342.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Twitter联合创始人：个性化服务是最大挑战</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="[转载] APP汇：基于新浪微博的App分享推荐平台" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F06%2F%25E8%25BD%25AC%25E8%25BD%25BD-app%25E6%25B1%2587%25EF%25BC%259A%25E5%259F%25BA%25E4%25BA%258E%25E6%2596%25B0%25E6%25B5%25AA%25E5%25BE%25AE%25E5%258D%259A%25E7%259A%2584app%25E5%2588%2586%25E4%25BA%25AB%25E6%258E%25A8%25E8%258D%2590%25E5%25B9%25B3%25E5%258F%25B0.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/06/02/10705779.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">[转载] APP汇：基于新浪微博的App分享推荐平台</font>
                    </a>
                </td>
        </tr>
        <br/>
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">来自无觅网络的相关文章：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="中国移动手机冲浪浏览器正式发布" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F58234.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/11/5426536.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">中国移动手机冲浪浏览器正式发布 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="什么是灰度发布？" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F67127.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">什么是灰度发布？ (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="“.中国”域名全国公测启动 附公测地址" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2F5475.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">“.中国”域名全国公测启动 附公测地址 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="Comsenz 核心产品 Discuz!7.0.0 测试版发布" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F2444.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Comsenz 核心产品 Discuz!7.0.0 测试版发布 (@alibuybuy)</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="谷歌中国低调发布购物搜索 测试版全面上线" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.alibuybuy.com%2Fposts%2F6887.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F10%2Fzhiyue.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">谷歌中国低调发布购物搜索 测试版全面上线 (@alibuybuy)</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2011/10/zhiyue.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>推荐系统开源工具 – SVDFeature</title>
		<link>http://www.resyschina.com/2011/09/%e6%8e%a8%e8%8d%90%e7%b3%bb%e7%bb%9f%e5%bc%80%e6%ba%90%e5%b7%a5%e5%85%b7-%e2%80%93-svdfeature.html</link>
		<comments>http://www.resyschina.com/2011/09/%e6%8e%a8%e8%8d%90%e7%b3%bb%e7%bb%9f%e5%bc%80%e6%ba%90%e5%b7%a5%e5%85%b7-%e2%80%93-svdfeature.html#comments</comments>
		<pubDate>Thu, 22 Sep 2011 17:52:36 +0000</pubDate>
		<dc:creator>wnzhang</dc:creator>
				<category><![CDATA[产品评论]]></category>

		<guid isPermaLink="false">http://www.resyschina.com/?p=596</guid>
		<description><![CDATA[SVDFeature是我们（上海交大Apex实验室）在参加KDDCUP 2011期间开发的。通过这个工具，我们和港科大（HKUST）的联合小组InnerPeace在KDDCUP 2011中获得Track 1第三名，并创造单模型最好成绩。在此分享给大家，并希望和大家有更多的交流。工具总体感觉是： （1）基于feature的可扩展性 —— SVDFeature实现了我们的基础模型feature-based matrix factorization。有了这个工具之后，CF算法部分被封装，设计新模型只需要关注如何生成特征，不需要修改任何训练代码。团队成员能够集中精力在各种信息的利用上面。 （2）大规模数据的可适应性 —— SVDFeature可以使用较少的内存去做大规模的实验。这主要是我们不需要将所有训练数据都读入内存。同时为了提高效率，我们使用了两个独立线程分别负责从硬盘读取训练数据和更新权重。最终我们能用2G内存花一天不到的时间在KDDCUP 2011 Track 1上跑出RMSE=22.16的单模型结果。 推荐系统不需要用户输入查询词，而是根据用户的历史行为判断用户的喜好，进而推荐出用户可能喜欢的物品。这是它和和搜索引擎的一大区别。那么既然没有明确的用户喜好，推荐系统就需要抓取尽可能多的，有价值的信息来捕捉用户的喜好。 近年来，一些推荐系统的比赛开始将重点转至信息的挖掘与利用上面。Yahoo! Music Recommendation (KDDCUP 2011)包括了用户收听音乐的时间，音乐的专辑，艺术家，类别。Context-aware Movie Recommendation (CAMRa 2010-2011)在此基础之上更强调了context信息，例如用户所在的家庭，收看电影时的心情。从比赛的结果来看，充分利用这些信息能使推荐系统的效果得到很大的提高。 这里就会有一个问题了：面对不同的Task，不同的数据集，我们需要抽取不同的信息将其加入到推荐系统之中。例如Yahoo! Music Recommendation可以加入音乐的专辑，艺术家，类别；Context-aware Movie Recommendation可以加入用户的家庭，心情等。相信实现过多个推荐系统的朋友都知道这是一件麻烦的事情。有没有一个很好的框架能概括这种信息融入的形式呢？ 我们找到了feature-based matrix factorization这个模型。它在经典的matrix factorization的基础之上，融入了feature的概念。预测公式可以写成如下形式： ﻿﻿ 简单地解释下，等式右边前两项就是feature产生的bias，其中包括了global features (g), user featuers (u)和item features (i)。最后一项是矩阵分解项，不过user/item latent factor也是通过user/item feature进行选取并合成的。这样，share相同user/item feature的users/items就会share对应的user/item latent factor以及feature bias。为了更好的解释，这里加一幅示意图： 通过feature-based matrix factorization，我们就能通过将我们需要的信息转化成feature的形式加入到推荐系统中，去帮助系统更好的学习出用户的喜好。我们写了SVDFeature这样一个toolkit来专门做feature-based matrix factorization。这样一来，开发人员只需要将手里拥有的数据转化成预先定好的格式，其他的事情就是Feature-based [...]<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Reculike : 开源论文推荐系统" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2Freculike.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/03/27412342.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Reculike : 开源论文推荐系统</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="转载：推荐系统应用研究：音乐电台" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2F%25E8%25BD%25AC%25E8%25BD%25BD%25EF%25BC%259A%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BA%2594%25E7%2594%25A8%25E7%25A0%2594%25E7%25A9%25B6%25EF%25BC%259A%25E9%259F%25B3%25E4%25B9%2590%25E7%2594%25B5%25E5%258F%25B0.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/22/8584401.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">转载：推荐系统应用研究：音乐电台</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="提问的智慧：利用决策树进行推荐系统新用户引导" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F03%2Fusing_decision_tree_for_bootstrapping.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/12/5492733.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">提问的智慧：利用决策树进行推荐系统新用户引导</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="互联网趋势与推荐系统的机会（2）" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fweb-trend-and-recsys-2.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">互联网趋势与推荐系统的机会（2）</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="转载：探析数字音乐个性化推荐系统" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fpersonal-music-resys.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802354.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">转载：探析数字音乐个性化推荐系统</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></description>
			<content:encoded><![CDATA[<p><a href="http://apex.sjtu.edu.cn/apex_wiki/svdfeature">SVDFeature</a>是我们（上海交大Apex实验室）在参加KDDCUP 2011期间开发的。通过这个工具，我们和港科大（HKUST）的联合小组InnerPeace在KDDCUP 2011中获得Track 1第三名，并创造单模型最好成绩。在此分享给大家，并希望和大家有更多的交流。工具总体感觉是：</p>
<p>（1）基于feature的可扩展性 —— SVDFeature实现了我们的基础模型feature-based matrix factorization。有了这个工具之后，CF算法部分被封装，设计新模型只需要关注如何生成特征，不需要修改任何训练代码。团队成员能够集中精力在各种信息的利用上面。</p>
<p>（2）大规模数据的可适应性 —— SVDFeature可以使用较少的内存去做大规模的实验。这主要是我们不需要将所有训练数据都读入内存。同时为了提高效率，我们使用了两个独立线程分别负责从硬盘读取训练数据和更新权重。最终我们能用2G内存花一天不到的时间在KDDCUP 2011 Track 1上跑出RMSE=22.16的单模型结果。</p>
<p>推荐系统不需要用户输入查询词，而是根据用户的历史行为判断用户的喜好，进而推荐出用户可能喜欢的物品。这是它和和搜索引擎的一大区别。那么既然没有明确的用户喜好，推荐系统就需要抓取尽可能多的，有价值的信息来捕捉用户的喜好。</p>
<p>近年来，一些推荐系统的比赛开始将重点转至信息的挖掘与利用上面。Yahoo! Music Recommendation (KDDCUP 2011)包括了用户收听音乐的时间，音乐的专辑，艺术家，类别。Context-aware Movie Recommendation (CAMRa 2010-2011)在此基础之上更强调了context信息，例如用户所在的家庭，收看电影时的心情。从比赛的结果来看，充分利用这些信息能使推荐系统的效果得到很大的提高。</p>
<p>这里就会有一个问题了：面对不同的Task，不同的数据集，我们需要抽取不同的信息将其加入到推荐系统之中。例如Yahoo! Music Recommendation可以加入音乐的专辑，艺术家，类别；Context-aware Movie Recommendation可以加入用户的家庭，心情等。相信实现过多个推荐系统的朋友都知道这是一件麻烦的事情。有没有一个很好的框架能概括这种信息融入的形式呢？</p>
<p>我们找到了feature-based matrix factorization这个模型。它在经典的matrix factorization的基础之上，融入了feature的概念。预测公式可以写成如下形式：</p>
<p>﻿﻿<a href="http://www.resyschina.com/wp-content/uploads/2011/09/12.png"><img class="alignnone size-full wp-image-602" src="http://www.resyschina.com/wp-content/uploads/2011/09/12.png" alt="" width="724" height="77" /></a></p>
<p>简单地解释下，等式右边前两项就是feature产生的bias，其中包括了global features (g), user featuers (u)和item features (i)。最后一项是矩阵分解项，不过user/item latent factor也是通过user/item feature进行选取并合成的。这样，share相同user/item feature的users/items就会share对应的user/item latent factor以及feature bias。为了更好的解释，这里加一幅示意图：</p>
<p><a href="http://www.resyschina.com/wp-content/uploads/2011/09/2.png"><img class="alignnone size-full wp-image-598" src="http://www.resyschina.com/wp-content/uploads/2011/09/2.png" alt="" width="691" height="248" /></a></p>
<p>通过feature-based matrix factorization，我们就能通过将我们需要的信息转化成feature的形式加入到推荐系统中，去帮助系统更好的学习出用户的喜好。我们写了SVDFeature这样一个toolkit来专门做feature-based matrix factorization。这样一来，开发人员只需要将手里拥有的数据转化成预先定好的格式，其他的事情就是Feature-based matrix factorization了。这就好比用 做文本分类一样。</p>
<p>通过设置user/item/global features，SVDFeature可以实现当今CF领域的大多数有名的算法：temporal dynamics，neighborhood，hierarchy information，implicit/explicit feedback (SVD++)。另外，值得一提的是SVDFeature能够很方便的做pairwise ranking，即是以排序为目标进行的训练，最终推荐给每个user一个list的item。由于有了这一点，我们可以算是没有改一行代码，就又参加了CAMRa 2011 （推荐一个list的movie给每个家庭）。更多详细的信息请见我们的technical report以及toolkit manual。<a href="http://apex.sjtu.edu.cn/apex_wiki/svdfeature">http://apex.sjtu.edu.cn/apex_wiki/svdfeature</a></p>
<p>希望这个开源工具能对大家的研究有所帮助。欢迎大家提供宝贵的意见。我们会继续将SVDFeature做的更好:)</p>
<table class="wumii-related-items" cellspacing="0" cellpadding="3" border="0"  style="clear: both;">
    
    <tr>
        <td colspan="5"><b><font size="-1"  style="display: block !important; padding: 20px 0 5px !important;">您可能也喜欢：</font></b></td>
    </tr>
    
        <tr>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important;">
                    <a target="_blank" title="Reculike : 开源论文推荐系统" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2Freculike.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/03/27412342.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">Reculike : 开源论文推荐系统</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="转载：推荐系统应用研究：音乐电台" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2F%25E8%25BD%25AC%25E8%25BD%25BD%25EF%25BC%259A%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BA%2594%25E7%2594%25A8%25E7%25A0%2594%25E7%25A9%25B6%25EF%25BC%259A%25E9%259F%25B3%25E4%25B9%2590%25E7%2594%25B5%25E5%258F%25B0.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/09/22/8584401.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">转载：推荐系统应用研究：音乐电台</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="提问的智慧：利用决策树进行推荐系统新用户引导" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F03%2Fusing_decision_tree_for_bootstrapping.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/04/12/5492733.png" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">提问的智慧：利用决策树进行推荐系统新用户引导</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="互联网趋势与推荐系统的机会（2）" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2011%2F01%2Fweb-trend-and-recsys-2.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/images/blogWidget/wordpress_default.gif" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">互联网趋势与推荐系统的机会（2）</font>
                    </a>
                </td>
                <td width="102" valign="top" style="padding: 5px !important; margin: 0 !important; border-left: 1px solid #DDDDDD !important;">
                    <a target="_blank" title="转载：探析数字音乐个性化推荐系统" style="text-decoration: none !important; cursor: pointer !important;" href="http://app.wumii.com/ext/redirect?url=http%3A%2F%2Fwww.resyschina.com%2F2010%2F11%2Fpersonal-music-resys.html&from=http%3A%2F%2Fwww.resyschina.com%2F2011%2F09%2F%25E6%258E%25A8%25E8%258D%2590%25E7%25B3%25BB%25E7%25BB%259F%25E5%25BC%2580%25E6%25BA%2590%25E5%25B7%25A5%25E5%2585%25B7-%25E2%2580%2593-svdfeature.html">
                        <img style="margin: 0 !important; padding: 2px !important; border: 1px solid #DDDDDD !important; width: 96px !important; height: 96px !important;" src="http://static.wumii.com/site_images/2011/01/04/1802354.jpg" width="96px" height="96px" /><br />
                        <font size="-1" color="#333333" style="display: block !important; line-height: 15px !important; width: 102px !important; font: 12px/15px arial !important; height: 60px !important; margin: 3px 0 0 0 !important; padding: 0 !important; overflow: hidden !important;">转载：探析数字音乐个性化推荐系统</font>
                    </a>
                </td>
        </tr>
    
    <tr>
        <td colspan="5" align="right">
            <a style="text-decoration: none !important;" href="http://www.wumii.com/widget/relatedItems" target="_blank" title="无觅相关文章插件">
                <font size="-1" color="#bbbbbb" style="display: block !important; font-family: arial !important; padding: 5px 0 !important; font-size: 12px !important; color: #bbb !important;">无觅</font>
            </a>
        </td>
    </tr>
</table>]]></content:encoded>
			<wfw:commentRss>http://www.resyschina.com/2011/09/%e6%8e%a8%e8%8d%90%e7%b3%bb%e7%bb%9f%e5%bc%80%e6%ba%90%e5%b7%a5%e5%85%b7-%e2%80%93-svdfeature.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

