Statistics per Wikibook

      

SW - Swahili

Wikibooks rankings and chapter lists

The algorithm used for deriving book and section titles from chapter (article) titles is described below.

11 books ordered by size in bytes
100 kNdugu
41 kWafransisko
30 kWaklara
17 kHistoria
9.4 kKlara
6.8 kMtaguso
430Mwanzo
292Kanisa
145Alfu
 Sala

11 books ordered by number of edits
14Mwanzo
2Wafransisko
2Mtaguso
1Ndugu
1Waklara
1Historia
1Klara
1Kanisa
1Alfu
1Sala

11 books ordered by number of registered authors
7Mwanzo
1Alfu
 Wafransisko
 Mtaguso
 Ndugu
 Waklara
 Historia
 Klara
 Kanisa
 Sala

11 books ordered by number of chapters
2Wafransisko
2Mtaguso
1Mwanzo
1Alfu
1Ndugu
1Waklara
1Historia
1Klara
1Kanisa
1Sala





Legend

Section title
[Aaaa] = book section when article with same title does exist
[Aaaa] = book section when article with same title does not exist

Chapter size
Chapter size in bytes: Xaaa > 2000 ≥ Yaaa ≥ 500 > Zaaa will be shown as: Xaaa / Yaaa / Zaaa / Xaaa / Yaaa / Zaaa

Choose from three display modes (click below at 'Select' to change display mode, changing may take a few seconds on large files)

Select mode "Xaaa / Yaaa / Zaaa"    => font color varies, large chapters are shown in bold type
Select mode   "Xaaa / Yaaa / Zaaa"     => font color and size vary
Select mode   "Xaaa / Yaaa / Zaaa"  => font color, size and weight vary


Wafransisko

2 chapters, 2 edits, size 40 kB, 6410 words, 0 registered authors

Wasekulari


Mtaguso

2 chapters, 2 edits, size 5.7 kB, 1110 words, 0 registered authors

II wa Vatikano



Remainder

Books that seemingly are not divided into chapters
 
A  Alfu  
H  Historia  
K  Kanisa / Klara  
M  Mwanzo  
N  Ndugu  
S  Sala  
W  Waklara



Legend

Section title
[Aaaa] = book section when article with same title does exist
[Aaaa] = book section when article with same title does not exist

Chapter size
Chapter size in bytes: Xaaa > 2000 ≥ Yaaa ≥ 500 > Zaaa will be shown as: Xaaa / Yaaa / Zaaa / Xaaa / Yaaa / Zaaa

Choose from three display modes (click below at 'Select' to change display mode, changing may take a few seconds on large files)

Select mode "Xaaa / Yaaa / Zaaa"    => font color varies, large chapters are shown in bold type
Select mode   "Xaaa / Yaaa / Zaaa"     => font color and size vary
Select mode   "Xaaa / Yaaa / Zaaa"  => font color, size and weight vary



Algorithm

The algorithm used to detect book titles is roughly this:

On a first pass through the input, article titles are scanned for candidate book and chapter names as follows:
Find the first colon, forward slash and hyphen. Whichever of these comes first determines division between book and chapter title.
If none of these are found treat text between brackets as chapter title, rest as book title.
If no brackets are found and the article title ends with one or more digits, assume this is a numbered chapter.
On a second pass book titles that occurred less than three times are marked as possible false positives
Article titles which match exactly a candidate book name that occurs more than twice, are added to the selection
(these are introductory pages, first chapters, or how you want to call them)
Counts are now collected.

Before writing the report, books are further divided into 'subbooks' and single chapters, using almost the same algorithm as above,
except that now subbook titles without colon, slash, hyphen, bracket or trailing digit are matched for the longest text
that occurs several times within one book.

Generated on Tuesday February 16, 2010 from recent database dump files.
Data processed up to Sunday January 31, 2010
Please note that the lengthy dump process (many weeks) means a delay in publishing these statistics is always to be expected.

Script version:2.5
Author:Erik Zachte (Web site)
Mail:ezachte@### (no spam: ### = wikimedia.org)
For documentation see meta
Scripts: scripts.zip

All data and images on this page are in the public domain.