Suffix Array Construction Benchmark

Last updated: 2008-03-02. Recent updates

This page presents the results of the benchmark test that compared the performance of various, latest suffix array construction programs.


Benchmark results:

Timing results on Core 2 Duo
CorpusSizeArchon4r0BPRDC32Deep-Shallowdivsufsort-1.2.3KAKSMSufSort-3.1.1betaqsufsort
totals1289730704263.643717.1971044.819955.447181.437650.9161927.695266.178659.009
ACT-Executable49386800.3061.0722.0590.4560.3971.4223.1870.5941.003
ACT-Graphics124663041.1442.7475.1841.3880.8222.9654.4591.3562.422
ACT-Sound87023920.9162.3194.4280.9340.9374.95612.2751.2971.872
ACT-Text49202860.3690.7562.0500.5940.4501.2663.1750.6191.241
Artificial Corpus3000010.0840.1060.2000.4250.0840.2340.1190.1090.112
Calgary Corpus31416220.4090.9971.4380.5030.4530.9001.0220.5660.722
Canterbury Corpus28107840.3370.7591.2220.5160.3500.6720.7810.5030.587
Large Canterbury Corpus111594820.9811.6384.9471.4191.1753.31910.3721.5652.953
Manzini's Corpus896819039186.047278.297769.319323.347132.866501.5031501.054158.944465.809
Maximum Compression Testfiles531347267.13112.64734.51210.6255.20017.76949.3916.74716.366
Miscellaneous Corpus10000000.0780.1530.3470.1220.1060.2000.3280.1220.128
Protein Corpus71544010.5721.5002.8340.8930.8222.3916.4470.8811.362
Silesia Corpus21193858028.694133.393144.30640.71325.54492.891283.37831.55679.781
Testfiles41943042.50934.4502.3913.4470.3380.6811.86936.5905.753
The Gauntlet6705010334.066246.36369.581570.06611.89419.74749.83824.72878.897
Timing results on Pentium M
CorpusSizeArchon4r0BPRDC32Deep-Shallowdivsufsort-1.2.3KAKSMSufSort-3.1.1betaqsufsort
totals1289730704675.8137291.6172363.2581992.735479.5201939.9279690.051576.5602406.991
ACT-Executable49386800.8173.0144.1941.0271.1255.00712.6101.3603.231
ACT-Graphics124663042.3887.42910.2152.9541.9699.29317.4832.9027.246
ACT-Sound87023921.9256.1719.7982.5422.90417.58731.1472.8085.222
ACT-Text49202861.0171.9574.8551.3461.3245.05111.8671.3984.096
Artificial Corpus3000010.1360.1660.3160.7610.1220.5610.2100.1800.196
Calgary Corpus31416220.8312.3562.4740.9430.8712.1773.5031.0741.827
Canterbury Corpus28107840.7191.7752.2031.0500.6711.7142.7960.9231.749
Large Canterbury Corpus111594823.0574.24612.8043.7113.63713.22331.2013.7498.748
Manzini's Corpus896819039437.1235891.9141699.748697.401342.1941467.4548620.099342.2521825.437
Maximum Compression Testfiles5313472616.50432.51779.17626.30214.68360.753135.32715.62048.067
Miscellaneous Corpus10000000.1620.5770.6850.2460.2680.7991.5320.2600.541
Protein Corpus71544011.4864.4646.6421.9652.4229.08719.7122.0514.312
Silesia Corpus21193858068.140381.357323.21289.47268.657281.136651.02069.628219.511
Testfiles419430426.712119.18112.95323.0511.4503.0107.86580.04718.232
The Gauntlet67050103114.797834.494193.9831139.96337.22263.072143.67952.306258.575
Timing results on PowerPC G5
CorpusSizeArchon4r0BPRDC32Deep-Shallowdivsufsort-1.2.3KAKSMSufSort-3.1.1betaqsufsort
totals12897307041682.6544737.8265532.6872935.5681061.1413077.2919352.8611007.5174204.685
ACT-Executable49386801.1584.6164.7311.3461.4285.67016.2071.6703.978
ACT-Graphics124663043.50612.61411.2413.6972.50711.00024.5113.43410.184
ACT-Sound87023924.29315.42716.5615.1485.11826.13358.1164.2498.806
ACT-Text49202861.6393.2735.9901.8671.8605.97817.8421.7666.007
Artificial Corpus3000010.0680.1380.2370.5180.0630.4970.1540.1220.140
Calgary Corpus31416220.6862.8752.2820.7610.7622.3654.7441.0131.968
Canterbury Corpus28107840.5802.1751.9040.7730.5671.6163.4510.8511.634
Large Canterbury Corpus111594825.1989.11316.3925.4535.15816.15851.3715.34014.213
Manzini's Corpus8968190391244.4843026.6934273.9811540.585782.6642384.1677294.444729.7192917.944
Maximum Compression Testfiles5313472633.09878.928143.64646.52325.20972.457215.71625.86182.271
Miscellaneous Corpus10000000.3160.9080.8810.3350.4211.0242.0780.3390.713
Protein Corpus71544012.8069.9688.4452.9144.09012.57033.4573.0857.211
Silesia Corpus211938580166.572624.345666.309181.118141.888444.3141344.418130.561464.288
Testfiles419430411.059100.09717.18421.5581.9823.2516.7427.39213.726
The Gauntlet67050103207.191846.655362.9021122.97487.42390.091279.60892.113671.603

The current fastest three programs:


Benchmark environments:

Core 2 DuoPentium MPowerPC G5
Processor2.66 GHz Intel Core 2 Duo E67501.7 GHz Intel Pentium M1.8 GHz PowerPC G5
L1 Cache(32 Kb + 32 Kb) x 232 Kb + 32 Kb(I)64 Kb + (D)32 Kb
L2 Cache4 Mb1 Mb512 Kb
RAM2 Gb main memory1 Gb main memory1 Gb main memory
Operating systemWindows XP Home SP 2Windows XP Professional SP 2Darwin 8.11.0
Compilergcc version 4.2.3gcc version 4.2.3gcc version 4.2.3

All programs were compiled with gcc/g++ using '-O3 -fomit-frame-pointer' optimization options. Times are the average of five runs, in seconds, and were measured using the standard Unix/Cygwin 'time' command. (user + system)

The suffix array construction programs:

The test files:


Recent updates:


This page is maintained by Yuta Mori <yiv01157 at nifty dot com>, Copyright © 2005-2008.