Abstract: The diverse words of all kinds of language are added into the World-Wide Web in an extremely complex and arbitrary manner. Behind the apparent arbitrariness topology, as we show here, there is an order hidden in the word network. By making use of Google search engine, we find that the distributions of the basic English words and Chinese characters on the web follow a universal power law. The power law exponent of rank-ordered frequency distribution is α ~ 0.99 for basic English words and α ~ 0.98 for Chinese characters. The Zipf law and page size distribution on the Web are used to explain the phenomena.
WEI Fang-Ping;LI Sheng;MA Hong-Ru. Distribution of Words on the World-Wide Web[J]. 中国物理快报, 2005, 22(3): 762-764.
WEI Fang-Ping, LI Sheng, MA Hong-Ru. Distribution of Words on the World-Wide Web. Chin. Phys. Lett., 2005, 22(3): 762-764.