Statistics per Wikibook

      

AF - Afrikaans

Wikibooks rankings and chapter lists

The algorithm used for deriving book and section titles from chapter (article) titles is described below.

17 books ordered by size in bytes
4.3 kDuits
3.9 kTuisblad
2.5 kSir Arthur Conan Doyle
2.4 kGeloofsbelydenis
1.7 kHTML
1.3 kNederlands
1.1 kHoe om
1.1 kOngeklassifiseerde
611Dieetgrappe
320Sterrekunde
201Wikibooks
86Kookboek
54Esperanto
44Lekker
 Boekrak
 Wiki

17 books ordered by number of edits
42Tuisblad
12Duits
11Esperanto
7Nederlands
6HTML
5Ongeklassifiseerde
4Hoe om
4Dieetgrappe
4Sterrekunde
4Wikibooks
4Lekker
2Geloofsbelydenis
2Kookboek
2Wiki
1Sir Arthur Conan Doyle
1Boekrak

17 books ordered by number of registered authors
7Tuisblad
4Duits
4HTML
3Sterrekunde
2Esperanto
2Nederlands
2Hoe om
2Dieetgrappe
2Wikibooks
2Lekker
2Geloofsbelydenis
1Ongeklassifiseerde
1Kookboek
1Wiki
1Sir Arthur Conan Doyle
 Boekrak

17 books ordered by number of chapters
14Tuisblad
7Duits
2Nederlands
2Hoe om
1HTML
1Sterrekunde
1Esperanto
1Dieetgrappe
1Wikibooks
1Lekker
1Geloofsbelydenis
1Ongeklassifiseerde
1Kookboek
1Wiki
1Sir Arthur Conan Doyle
1Boekrak





Legend

Section title
[Aaaa] = book section when article with same title does exist
[Aaaa] = book section when article with same title does not exist

Chapter size
Chapter size in bytes: Xaaa > 2000 ≥ Yaaa ≥ 500 > Zaaa will be shown as: Xaaa / Yaaa / Zaaa / Xaaa / Yaaa / Zaaa

Choose from three display modes (click below at 'Select' to change display mode, changing may take a few seconds on large files)

Select mode "Xaaa / Yaaa / Zaaa"    => font color varies, large chapters are shown in bold type
Select mode   "Xaaa / Yaaa / Zaaa"     => font color and size vary
Select mode   "Xaaa / Yaaa / Zaaa"  => font color, size and weight vary


Tuisblad

14 chapters, 42 edits, size 333 , 543 words, 7 registered authors

[Boekrakke]   -Geesteswetenskappe, Kuns en Kultuur / -Kontinente, Lande, Organisasies / -Natuur en Tegnologie / -Vermaak en Ontspanning / -Wikijunior
[Styl]   -Boks / -Opskrif

Aanbevole boeke / Boek van die maand / Inleiding / Innerlik / Susterprojekte


Duits

7 chapters, 12 edits, size 476 , 608 words, 4 registered authors

[Grammatika]   -Geslag

Bylae/Lys van onreëlmatige werkwoorde / Die Duitse taal / Duitsland / Inhoud


Hoe om

2 chapters, 4 edits, size 1.0 kB, 187 words

'n atlas te gebruik / te gebruik


Nederlands

2 chapters, 7 edits, size 825 , 237 words, 2 registered authors

Les 1



Remainder

Books that seemingly are not divided into chapters
 
B  Boekrak  
D  Dieetgrappe  
E  Esperanto  
G  Geloofsbelydenis  
H  HTML  
K  Kookboek  
L  Lekker  
O  Ongeklassifiseerde  
S  Sir Arthur Conan Doyle / Sterrekunde  
W  Wikibooks / Wiki



Legend

Section title
[Aaaa] = book section when article with same title does exist
[Aaaa] = book section when article with same title does not exist

Chapter size
Chapter size in bytes: Xaaa > 2000 ≥ Yaaa ≥ 500 > Zaaa will be shown as: Xaaa / Yaaa / Zaaa / Xaaa / Yaaa / Zaaa

Choose from three display modes (click below at 'Select' to change display mode, changing may take a few seconds on large files)

Select mode "Xaaa / Yaaa / Zaaa"    => font color varies, large chapters are shown in bold type
Select mode   "Xaaa / Yaaa / Zaaa"     => font color and size vary
Select mode   "Xaaa / Yaaa / Zaaa"  => font color, size and weight vary



Algorithm

The algorithm used to detect book titles is roughly this:

On a first pass through the input, article titles are scanned for candidate book and chapter names as follows:
Find the first colon, forward slash and hyphen. Whichever of these comes first determines division between book and chapter title.
If none of these are found treat text between brackets as chapter title, rest as book title.
If no brackets are found and the article title ends with one or more digits, assume this is a numbered chapter.
On a second pass book titles that occurred less than three times are marked as possible false positives
Article titles which match exactly a candidate book name that occurs more than twice, are added to the selection
(these are introductory pages, first chapters, or how you want to call them)
Counts are now collected.

Before writing the report, books are further divided into 'subbooks' and single chapters, using almost the same algorithm as above,
except that now subbook titles without colon, slash, hyphen, bracket or trailing digit are matched for the longest text
that occurs several times within one book.

Generated on Tuesday February 16, 2010 from recent database dump files.
Data processed up to Sunday January 31, 2010
Please note that the lengthy dump process (many weeks) means a delay in publishing these statistics is always to be expected.

Script version:2.5
Author:Erik Zachte (Web site)
Mail:ezachte@### (no spam: ### = wikimedia.org)
For documentation see meta
Scripts: scripts.zip

All data and images on this page are in the public domain.