【正文】
webpage2 webpageK … webpageN feature1 feature2 feature1 feature2 feature3 feature1 feature2 … featureK … featureN … webpage1 webpage2 webpage1 webpage2 webpage3 WebGather : technologies in retriever subsystem 3/4 ? Traditional IR (VSM ) ? Query cache, hot click ? Cut words ? Anchor text, Link popularity WebGather : technologies in user behavior subsystem 4/4 ? Link popularity ? Replica popularity ? User popularity Conclusion : ? Searchengine is More and more important. ? Web is a good experimental object, we can do a lot Ramp。D on it.