1. 论坛系统升级为Xenforo,欢迎大家测试!
    Dismiss Notice

百度搜索引擎中文分词的三点原理

Discussion in 'SEO 专区' started by 萧萧服务, Mar 21, 2011.

  1. 萧萧服务

    萧萧服务 New Member

    Joined:
    Mar 21, 2011
    Messages:
    12
    Likes Received:
    0
    百度中文分词算法:指搜索引擎为了更好的辨别用户的需求,并且为了快速提供给用户需求性信息而使用的算法。
      搜索引擎要在单位时间内处理千万亿级的页面数据量,因此搜索引擎拥有一个中文词库。比如百度现在大约有9万个中文词,那么搜索引擎就可以对千亿级的页面进行分析,按照中文词库进行了分类。

      百度分词基本有三种分法

      1、基于理解:傻瓜式匹配,小于等于3个中文字符百度是不进行切词的,比如搜索“大学堂”。
     
  2. 0309a

    0309a New Member

    Joined:
    Mar 9, 2011
    Messages:
    521
    Likes Received:
    0
    学习一下,谢谢分享
     
  3. 3906933

    3906933 New Member

    Joined:
    Mar 14, 2011
    Messages:
    66
    Likes Received:
    0
    这个很好啊!学习了!
     
  4. rqblmy

    rqblmy New Member

    Joined:
    Oct 2, 2010
    Messages:
    502
    Likes Received:
    0
    进来看一下。
     
  5. q84816977

    q84816977 New Member

    Joined:
    Mar 10, 2011
    Messages:
    23
    Likes Received:
    0
    这个很好阿。 支持
     
  6. 丫头

    丫头 New Member

    Joined:
    Mar 22, 2011
    Messages:
    40
    Likes Received:
    0
    学习了....
     
  7. 173782322

    173782322 New Member

    Joined:
    Sep 4, 2010
    Messages:
    271
    Likes Received:
    0
    学习一下,
     
  8. qdsjzh

    qdsjzh New Member

    Joined:
    Mar 22, 2011
    Messages:
    51
    Likes Received:
    0
    学习了~感谢分享