Distribution of Words on the World-Wide Web

  • The diverse words of all kinds of language are added into the World-Wide Web in an extremely complex and arbitrary manner. Behind the apparent arbitrariness topology, as we show here, there is an order hidden in the word network. By making use of Google search engine, we find that the distributions of the basic English words and Chinese characters on the web follow a universal power law. The power law exponent of rank-ordered frequency distribution is α ~ 0.99 for basic English words and α ~ 0.98 for Chinese characters. The Zipf law and page size distribution on the Web are used to explain the phenomena.
  • Article Text

  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return