Datos para la gráfica tiempo de indexación de la colección
Coleccion TREC-9 Colección HTML 1GB
Lucene 3.1 9.17
Compass 6.31 5.2
Terrier 3.8 7.9
Solr 10.7 3.7
18
16
14
12
Tiempo (min)
Coleccion TREC
10
Colección HTML 1GB
8
Colección HTML 2GB
6
4
2
0
Lucene Compass Terrier Solr
Producto
Datos para la gráfica porcentaje del índice según tamaño de la colección
Coleccion TREC-9 Colección HTML 1GB
Lucene 34 7.8
Compass 165.1 39
Terrier 57.65 14.2
Solr 87.44 8.21
180
tamaño colección / tamaño índice
160
140
120
Coleccion TREC
100
(%)
Colección HTML 1GB
80
Colección HTML 2GB
60
40
20
0
Lucene Compass Terrier Solr
Lucene Compass Terrier Solr
Producto
Tamaño del índice
Coleccion TREC-9 Colección HTML 1GB
Lucene 130 79.9
Compass 645 399
Terrier 222 145
Solr 341 84.1
800
700
600
Tamaño Indice (MB)
500
Coleccion TREC
400 Colección HTML 1GB
Colección HTML 2GB
300
200
100
0
Lucene Compass Terrier Solr
Datos de la gráfica consumo de memoria en función del tiempo
Tiempo (seg) Consumo de Memoria (KB)Consumo de Memoria (KB) Consumo de Memoria (KB)
0 0 0 0
30 52.396 5 32.248
60 52.508 10 51.012
90 52.508 15
120 52.532 20
150 52.56 25
180 51.95 30
210
240
270
300
330
360
390
420
450
480
510
540
570
600
630
660
690
720
750
780
810
840
870
900
930
960
Colección HTML 2GB
16.19
10.52
12.6
7.17
Coleccion TREC-9
Colección HTML 1GB
Colección HTML 2GB
Colección HTML 2GB
6.6
33.25
8.7
7
Coleccion TREC-9
Colección HTML 1GB
Colección HTML 2GB
Colección HTML 2GB
135
681
178
143
Coleccion TREC-9
Colección HTML 1GB
Colección HTML 2GB
Consumo de Memoria (KB) HTML1G
0 Lucene Terrier
5
10 0 0
15 59.116 51.38
20 74.496 56.564
25 88.348 59.596
30 92.532 64.552
92.564 64.552
103.056 64.548
103.056 64.584
103.056 64.548
105.072 64.664
105.172 64.676
112.12 64.712
129.368 64.676
128.864 64.976
128.864 64.94
130.516 64.976
64.94
64.976
65.012
930
960
HTML2G
Lucene Terrier
Tiempo (seg) Consumo de Memoria (KB) Consumo de Memoria (KB)
0 0 0
30 53.338 40.98
60 65.264 58.134
90 87.92 58.24
120 88.76 58.36
150 91.732 58.34
180 93.188 58.452
210 96.344 58.72
240 98.104 60.84
270 110.728 60.86
300 110.728 60.86
330 110.728 60.824
360 125.796 60.86
390 125.796 62.532
420 140.308 62.532
450 140.308 62.568
480 140.308 62.532
510 140.308 62.568
540 146.464 62.532
570 146.464 62.568
600 146.464 65.52
630 146.464 65.52
660 146.464 65.52
690 147.116 65.5
720 147.116 65.52
750 149.304 65.516
780 65.532
810 65.516
840 65.532
870 65.516
900 65.516
930 66.268
960 66.288
0 30 60 90 120 150 180 210 240 270
Lucene 0 52.4 52.51 52.51 52.532 52.56 51.95
Compass 0 126.7 174 174.6 179.812 179.81 179.9 179.88 183.1 183.1
Terrier 0 40.34 51.18 57.19 64.336 67.56 72.24 73.548 85.6 91.98
Solr 0 69.74 70.03 70.16 70.4 71.4 71.5 71.5 76.41 76.41
200
180
160
140
Memoria(MB)
Lucene
120
Compass
100
Terrier
80
Solr
60
40
20
0
Tiempo(seg)
0 30 60 90 120 150 180 210 240 270
Lucene 0 51.38 56.56 59.6 64.552 64.552 64.55 64.584 64.55 64.66
Compass 0 94.63 94.74 94.64 94.264 93.468 95.21 95.228 95.23 94.26
Terrier 0 59.12 74.5 88.35 92.532 92.564 103.1 103.06 103.1 105.1
Solr 0 67.57 74.22 74.24 74.24 74.272 74.4 74.4
140
120
100
Memoria(MB)
Lucene
80 Compass
60 Terrier
Solr
40
20
0
Tiempo(seg)
Tiempo(seg)
0 30 60 90 120 150 180 210 240 270
Lucene 0 40.98 58.13 58.24 58.36 58.34 58.45 58.72 60.84 60.86
Compass 0 93.99 93.28 94.68 94.4 94.916 94.78 94.744 94.76 94.78
Terrier 0 53.34 65.26 87.92 88.76 91.732 93.19 96.344 98.1 110.7
Solr 0 41.37 63.17 69.69 70.848 72.276 73.71 73.712 79.11 79.11
160
140
Memoria(MB) 120
100
80
60
40
20
0
120
180
360
420
480
600
0
240
300
540
60
Tiempo(seg)
300 330 360 390 420 450 480 510 540 570 600 630
183.43 183.43 183.4
91.98 91.98 91.98 91.98 91.98 92.012 92.012 92.01 92.012
76.41 76.44 80.27 80.3 80.332 84.772 84.272 84.3 84.304 84.336 84.46 84.46
Lucene
Compass
Terrier
TREC-9
Solr
300 330 360 390 420 450 480 510 540
64.676 64.712 64.68 64.976 64.94 64.976 64.94 64.98 65.012
95.228 95.296
105.17 112.12 129.4 128.86 128.86 130.52
HTML1G
300 330 360 390 420 450 480 510 540 570 600 630
60.86 60.824 60.86 62.532 62.532 62.568 62.532 62.57 62.532 62.568 65.52 65.52
94.808 94.808 94.81 94.808 94.808 95.312 95.38 95.45 95.444 95.46 95.44 95.48
110.73 110.73 125.8 125.8 140.31 140.31 140.31 140.3 146.46 146.46 146.5 146.5
79.108 79.108 79.14
Lucene
Compass
HTML 2G
Terrier
Solr
660
720
900
960
780
840
660 690 720 750 780 810 840 870 900
65.52 65.5 65.52 65.52 65.532 65.516 65.532 65.516 65.516
146.5 147.12 147.1 149.3
930 960
66.268 66.288
Consultas Simples
human M
male
peritoneal
support
neoplasms
steril S
diabetes
neurosurgery
urology
hepatology
analysis T
news
transplantation
receptor
polyionic
87049088 U
90366767
88265350
87150402
91149897
results W
weight
tipped
autistic
failure
article P
published
periodical
tutorial
legal
lucas A
mattson
dowson
massi
akisada
published OR legal P
S: OR diabetes S
transplantationOR receptor T
weight OR mattson W OR A
urology OR massi S OR A
hepatology AND article S AND P
polyionic AND human T AND M
male AND autistic M AND W
analysis AND diabetes T AND S
weight AND neoplasms W AND M
male NOT steril M NOT S
human NOT transplantation M NOT T
neurosurgery NOT failure S NOT W
88265350 NOT tipped U NOT W
periodical NOT dowson P NOT A
article AND news AND results P AND T AND W
peritoneal AND receptor NOT diabetes M AND T NOT S
lucas AND tutorial OR legal A AND P OR P
tipped OR support OR 88265350 W OR M OR U
akisada OR results AND urology A OR W AND S
neoplasms OR autistic NOT 91149897 M OR W NOT U
neurosurgery NOT transplantation NOT periodical S NOT T NOT P
87049088 NOT weight AND male U NOT W AND M
analysis NOT steril OR human T NOT S OR M
Consultas avanzadas
transpl*
h?man
87049* AND anal??is
sup* AND supp?rt NOT s*s
ar* NOT do*on OR a?tist?c
"human support" Frases
"receptor analysis"
"male support"
"results failure"
"article published"
neurosur~ Fuzzy
transplan~
perito~ NOT neopla~
res~ OR fail~
dow~ NOT mass~ OR lucas~
"news receptor"~3 Proximidad
"neoplasms peritoneal"~5
"failure tipped"~20 OR "autistic results"~50
"peritoneal support"~5 NOT "human male"~3
T:"news receptor"~3 OR M:"neoplasms peritoneal"~5
M:hum* AND (S:ste??? OR S:urolo~) OR W:result* Avanzadas
T:"news receptor"~3 OR T:ne*s NOT S:n*uro~ AND A:ak*
(U:911??8*7 AND P:artic~) NOT W:res* OR P:"article published"
U:[89049* TO 91149??7] NOT U: [8904999 TO 9000000]
LUCENE COMPASS TERRIER SOLR
26 59 93 490
10.3 33 32 390
36.4 23 16 172
11 36 31 140
5.6 26 47 234
5 22 15 156
35.1 38 15 63
4 21 15 93
2.3 14 16 124
2.4 25 31 16
6.5 22 16 172
3.6 31 31 343
4.2 29 31 156
4.5 25 31 234
1.8 14 15 31
5 29 25 63
3.7 23 22 94
1.7 19 16 31
3.4 18 18 0
3.3 23 17 0
9.5 52 31 296
5 31 16 266
3.9 31 63 125
3.3 30 15 156
4.8 33 31 250
56 52 78 172
1.9 16 32 63
1.7 17 31 140
5.4 29 31 47
3.3 24 16 62
37.1 14 16 281
1.8 20 16 125
3.5 38 16 140
3.7 14 31 156
1.7 28 15 94
9.097142857 27.4 27.74285714 153.571429
4.1 41 16 796
1.5 45 47 749
1.4 64 47 718
2.3 20 31 827
4 18 31 249
4 19 94 297
2.8 18 93 531
1.4 26 47 499
3.6 18 31 250
4.3 28 31 405
39.9 50 31 234
65.5 81 94 375
4.7 23 46 343
1.6 21 31 265
9.3 15 46 281
10.02666667 32.4666667 47.73333333 99.8666667
6.2 73 93 640
4.7 54 31 390
1.6 21 47 281
21.8 61 78 453
15 39 47 141
9.3 22 32 218
4.6 18 47 390
43.7 82 109 230
74.6 96 108 234
20.16666667 51.7777778 65.77777778 330.777778
1.6 19 n/d 38
15.6 38 n/d 358
6.3 21 n/d 71
53.1 80 n/d 218
68.6 247 n/d 792
29.04 81 n/d 295.4
103 112 172 568
37.4 29 78 339
84.3 77 172 301
45.3 41 62 419
35.9 25 171 97
61.18 56.8 131 344.8
46.8 149 n/d 995
102.9 309 n/d 1053
170 182 n/d 5873
288.6 1269 n/d 7118
297.9 1123 n/d 9187
181.24 606.4 n/d 4845.2
4.7 27 63 199
6.2 33 47 443
35.9 33 78 658
4.7 33 203 601
3.2 30 47 15
10.94 31.2 87.6 383.2
60.8 258 n/d 548
57.7 95 n/d 471
35.9 100 n/d 200
393.2 n/d n/d 2766
153.31 113.25 n/d 996.25
LUCENE HTML
wikipedia 15 4.7
content 14 6.2
search 12 7.8
free 14 9.3
jump 13 6.3
time 12 7.8
extern 7.8 1.6
name 11 4.6
year 11 4.7
febrauri 9.4 7.8
process 11 4.6
human 11 6.2
word 11 6.3
research 12 4.7
build 9 6.2
height 9 4.7
compon 8 3.1
queen 9 4.7
context 11 6.2
survey 9.5 7.8
holder 9.5 4.7
clinton 9.5 3.1
montereal 11 1.5
melbourn 7.8 3.2
filipino 9.5 9.4
chiefest 7.8 3.1
stoller 7.8 3.2
unfrag 7.8 3.1
denslow 7.8 7.8
king 7.8 3.1
10.2 5.25
wikipedia OR extern 21 17.4
process OR queen 15.6 7.8
montereal OR word 15.6 6.2
denslow OR height 14.1 10.9
melbourn OR crossingpoint 11 7.8
name AND context 17.2 7.8
free AND stoller 12.5 4.7
build AND survey 16 6.3
filipino AND febrauri 11 3.1
human AND jump 22 6.3
holder NOT crossingpoint 12.5 9.3
research NOT chiefest 15.6 6.2
compon NOT time 11 6.3
free NOT clinton 23.4 10.9
year NOT content 17.2 14.1
15.71333 8.34
chiefest AND extern AND content 12.5 7.8
process AND jump NOT queen 20.3 14
height AND time OR unfrag 17.2 12.5
name OR free OR human 25 12.5
word OR crossingpoint NOT survey 17.2 10.9
wikipedia OR febrauri AND content 11 9.4
melbourn NOT denslow NOT research 12.5 10.9
holder NOT montereal AND year 18.8 11
stoller NOT search OR build 15.6 12.4
16.67778 11.2666667
wiki* 125 49.8
h?man 53.1 30.6
w*dia NOT y*r AND c?????n 368.8 171.5
s* NOT st*r 523 226
chie??st AND extern AND co??ent 39.1 18.3
221.8 99.24
word research 26.5 15.8
time holder 21.9 15.6
human height 21.9 15.7
search queen 26.6 11.9
free year 35.9 19.6
26.56 15.72
content~ 2012.5 1029.8
unfrag~ 1607.8 913.7
filipino~ NOT name~ 3159.4 1745.2
NOT crossingpoint~ 2106.3 1192.3
word~ OR research~ 3223.4 1783.2
2421.88 1332.84
"free wikipedia" ~3 87.5 49.9
"melbourn word"~12 14.1 9.4
"compon human"~6 NOT "melbourn word"~12 15.7 7.9
"survey holder"~10 OR "extern queen"~10 21.9 12.8
"extern jump"~10 OR "extern unfrag"~5 18.8 4.7
31.6 16.94
NOT crossingpoint~ AND ("compon human"~6 NOT "melbourn word"~12) 2176.5 1211.4
3268.7
"word research" OR chie??st AND extern AND co??ent AND (filipino~ NOT name~) 1776.5
37.5
"extern jump"~10 OR "extern unfrag"~5 NOT conte? OR chie??st AND extern AND co??ent 19.2
389
(wiki* NOT wikimedia AND "wiki* free"~10 AND w*dia NOT y*r AND c?????n) NOT queen 182.4
1174.34 637.9
SOLR HTML TERRIER HTML COMPASS HTML
204 94 96 39
234 187 76 56
78 141 73 49
125 78 87 63
141 187 52 42
218 234 51 46
172 140 89 16
188 249 91 48
156 125 78 40
147 32 62 21
219 188 78 60
172 250 84 48
140 47 62 26
140 187 78 48
157 203 83 43
125 234 64 46
187 94 94 24
94 31 46 26
31 47 92 92
156 188 78 56
171 15 94 37
125 47 73 73
94 234 63 29
140 141 46 31
62 78 62 48
110 47 56 56
94 62 63 34
63 47 55 26
63 109 72 43
47 56 67 45
135.1 125.733333 72.1666667 43.7
297 281 93 81
265 249 110 72
218 198 95 76
250 210 125 78
281 196 92 63
281 140 134 60
187 138 109 62
250 132 156 59
63 53 172 73
265 124 184 81
188 102 112 62
297 168 104 59
203 192 125 93
125 106 98 56
188 89 94 94
223.866667 158.533333 120.2 71.26666667
47 32 96 96
390 204 192 103
391 223 114 114
343 197 92 110
390 188 104 89
110 58 145 94
438 306 172 87
391 201 208 104
265 179 94 94
307.222222 176.444444 135.222222 99
250 156 n/d 40
1016 608 n/d 65
1328 718 n/d 126
2797 1154 n/d 624
141 109 n/d 60
1106.4 549 n/d 183
204 249 78 18
141 281 31 20
94 172 15 14
125 187 63 17
219 281 31 34
156.6 234 43.6 20.6
11203 5533 n/d 2023
7625 2496 n/d 1725
15234 5086 n/d 3422
7922 2745 n/d 2049
16203 5648 n/d 3474
11637.4 4301.6 n/d 2538.6
218 203 281 54
265 171 62 72
1031 156 140 126
953 266 78 67
390 219 171 92
571.4 203 146.4 82.2
8688 2590 n/d 2223
15032 4571 n/d 3391
262 78 n/d 46
2062 390 n/d 471
6511 1907.25 n/d 1532.75
Colección HTML
Lucene Compass Terrier Solr
Consultas 1 palabra 5.25 43.7 72.16 125.73
Consultas 2 palabras 8.34 71.26 120.2 158.53
Consultas 3 palabras 11.26 99 135.22 176.44
200
Lucene Compass Terrier Solr 180
Comodines 99.24 183 n/d 549
Frases 15.72 20.6 43.6 234 160
Fuzzy 1332.84 2538.6 n/d 4301.6
Proximidad 16.94 82.2 146.4 203 140
Variadas 637.9 1532.75 n/d 1907.25
120
Tiempo(ms)
100
80
60
40
20
0
5000
4500
4000
3500
3000
Tiempo(ms)
2500
2000
1500
1000
500
0
Colección TREC9
Lucene Compass Terrier Solr
Consultas 1 palabra 9.097 27.4 27.7428 153.5714 300
Consultas 2 palabras 10 32.46 47.733 99.86
Consultas 3 palabras 20 51.77 65.777 330.77 240
Tiempo(ms)
180
Lucene Compass Terrier Solr
Comodines 29.04 81 n/d 295.4 120
Frases 61.18 56.8 131 344.8
Fuzzy 181.24 606.4 n/d 4845.2 60
Proximidad 10.94 31.2 87.6 383.2
0
Variadas 153.31 113.25 n/d 996.25
6000
5000
4000
Tiempo (ms)
3000
2000
1000
0
Consultas básicas (HTML)
200
180
160
140
120
100
80
60
40
20
0
Lucene Compass Terrier Solr
Consultas avanzadas (HTML)
5000
4500
4000
3500
3000
2500
2000
1500
1000
500
0
Lucene Compass Terrier Solr
Lucene Compass Terrier Solr
Consultas básicas (TREC-9)
300
240
Consultas 1 palabra
180 Consultas 2 palabras
Consultas 3 palabras
120
60
0
Lucene Compass Terrier Solr
6000
5000
4000
Lucene
Compass
3000
Terrier
2000 Solr
1000
0
Comodines Frases Fuzzy Proximidad Variadas
Consultas 1 palabra
Consultas 2 palabras
Consultas 3 palabras
Comodines
Frases
Fuzzy
Proximidad
Variadas
Solr
Solr
compass
tiempo consumo memoria consumo cpu indice = 401 MB
30 126.712 50 tiempo= 6 min
60 173.96 55.6 % tamaño=102,8 %
90 174.6 43.33
120 179.812 43.49
150 179.812 46.73
180 179.876 56.61
210 179.876 37.34
240 183.096 32.78
270 183.128 53.33
300 183.432 55.21
330 183.432 44.33
360 183.432 42.11
420 46.73833333
450
480
510
540
570
600
630
660
690
720
750
780
810
840
870
900
930
960
990
1020
1050
1080
1120