Last updated: 23.05.2020

RDS-Q No. #3

Quran Statistics (Chapter Level)

Reproducible Data Science - Quran | quran.telematika.org

Home GitHub PDF

Overview

The Holy Book Quran consists of 114 Chapters (Surah) and totally 6236 Verses (Ayah). The Quran was orally revealed over a period of 23 years and written for the first time at the time of Khulafaur Rasyidin. In the most printed editions, the Quran consists of 604 pages, which furtherly organized into parts, notably a so-called Juz. There is 30 Juz in total.

For the results presented in this document, the Quran text is based on the Uthmani version published by Tanzil project (http://tanzil.net/). Based on this text, the Quran is composed of 77430 words and 325666 letters. For comparison, based on data published by corpus.quran.com (Kais Dukes, University of Leeds) the figures are 77429 words and 623638 (join letters and diacritics/harakah).

This document mainly presents some numbers and figures with minimal narration since it is meant to be a quick reference for supporting further research in some aspects of The Noble Quran. Data used in RDS-Q #3 is at chapter level (length of 114) i.e. data related to verses, words, and letters are pre-processed for the whole chapter.

All Chapters - C,V,C+V

The following two Figures show number of verses (V), chapter number (C) and the addition of C+V. The first Figure depicts linear plot of V and C+V over C. The corresponding density functions are plotted in the second Figure.

... ...

The following Table gives some statistical values for these parameters.

max 75% 50% 25% min mean std count sum
Chapter No. (C) 114 85.75 57.5 29.25 1 57.5 33.05 114 6555
#Verses (V) 286 77.75 39.0 15.5 3 54.7 53.21 114 6236
C+V 288 120.0 110.0 95.5 8 112.2 35.74 114 12791

All Chapters - W,L

The following two Figures show number of words (W) and number of letters (L). The first Figure depicts linear plot of W and L over C. The corresponding density functions are plotted in the second Figure.

... ...

The following Table gives some statistical values for W and L.

max 75% 50% 25% min mean std count sum
Num. of Words 6116 877.5 344.0 92.5 10 679.21 931.18 114 77430
Num. of Letters 25865 3750.0 1436.0 384.25 42 2856.72 3933.94 114 325666

Split [1-57] & [58-114]

For some investigations we might want to look at a segment of the data. The following Figure shows density functions of C,V and C+V for the segment [1-57] and [58-114].

...

The following Table gives some statistical values for C,V and C+V for segmen [1-57].

max 75% 50% 25% min mean std count sum
Chapter No. (C) 57 43.0 29.0 15.0 1 29.0 16.6 57 1653
#Verses (V) 286 111.0 78.0 53.0 7 89.54 55.03 57 5104
C+V 288 133.0 114.0 88.0 8 118.54 47.8 57 6757

The following Table gives some statistical values for C,V and C+V for segmen [58-114].

max 75% 50% 25% min mean std count sum
Chapter No. (C) 114 100.0 86.0 72.0 58 86.0 16.6 57 4902
#Verses (V) 56 28.0 17.0 8.0 3 19.86 14.63 57 1132
C+V 130 115.0 110.0 102.0 73 105.86 14.53 57 6034

Odd/Even - Chapters & Verses

For symmetry investigations we might want to look at a segment of the data which is based on certain criteria, in this case: odd/even C+V criteria as reported in RDS-Q #1 and RDS-Q #2. The following Figure shows density functions of C,V and C+V for the odd and even segment.


...

The following Table gives some statistical values for C,V and C+V for odd segmen.

max 75% 50% 25% min mean std count sum
Chapter No. (C) 110 82.0 55.0 37.0 3 57.05 30.39 57 3252
#Verses (V) 227 78.0 38.0 19.0 3 57.95 54.94 57 3303
C+V 253 121.0 111.0 95.0 65 115.0 36.4 57 6555

The following Table gives some statistical values for C,V and C+V for even segmen.

max 75% 50% 25% min mean std count sum
Chapter No. (C) 114 91.0 58.0 27.0 1 57.95 35.78 57 3303
#Verses (V) 286 77.0 40.0 11.0 3 51.46 51.71 57 2933
C+V 288 120.0 110.0 98.0 8 109.4 35.16 57 6236

Odd/Even - Words & Letters

The following Figure shows density functions of W and L for the odd and even segment.

...

The following Table gives some statistical values for W and L for odd segmen.

max 75% 50% 25% min mean std count sum
Num. of Words 3481 860.0 346.0 164.0 10 651.44 828.91 57 37132
Num. of Letters 14738 3469.0 1453.0 667.0 42 2735.21 3486.34 57 155907

The following Table gives some statistical values for W and L for even segmen.

max 75% 50% 25% min mean std count sum
Num. of Words 6116 893.0 300.0 54.0 14 706.98 1030.06 57 40298
Num. of Letters 25865 3818.0 1263.0 250.0 47 2978.23 4363.89 57 169759

Resources

URL
Data https://github.com/eueung/rds-q/tree/master/data
PDF https://github.com/eueung/rds-q/tree/master/PDF
Project (All) https://github.com/eueung/rds-q/
Web https://quran.telematika.org/00003/quran-statistics-114.html
Web (All) https://quran.telematika.org/

Sample Data

ch_no t_ve t_w_nb t_c_nb cavoe cav
1 7 29 139 even 8
2 286 6116 25865 even 288
3 200 3481 14738 odd 203
4 176 3747 16063 even 180
5 120 2804 12015 odd 125
6 165 3050 12554 odd 171
7 206 3320 14213 odd 213
8 75 1233 5342 odd 83
9 129 2498 10940 even 138
10 109 1833 7506 odd 119
11 123 1917 7714 even 134
12 111 1777 7193 odd 123
13 43 853 3495 even 56
14 52 830 3494 even 66
15 99 655 2826 even 114
16 128 1844 7709 even 144
17 111 1556 6555 even 128
18 110 1579 6480 even 128
19 98 961 3861 odd 117
20 135 1335 5329 odd 155
21 112 1169 4969 odd 133
22 78 1274 5232 even 100
23 118 1050 4390 odd 141
24 64 1316 5655 even 88