Commit Graph

95 Commits

Author SHA1 Message Date
Jakub Szuppe
790dcab387 Fix benchmark for std::partial_sum()
In partial_sum benchmark for STL std::parial_sum() should not
be run in-place since benchmarks for other libs are not run
in-place.
2016-07-25 22:37:17 +02:00
Jakub Szuppe
1311de15a3 Add benchmark for comparison sort 2016-06-22 15:59:55 +02:00
Jakub Szuppe
3493b750e8 Fix type, long_ and long might be different types 2016-05-21 16:57:45 +02:00
Jakub Szuppe
5e28138a7a Update minimum required Boost version to 1.54 2016-05-03 14:32:22 +02:00
Jakub Szuppe
9173c07f40 Fix saxpy benchmark
boost::compute::fill() may not be done before we start
execution measuring time.
2016-04-10 12:42:35 +02:00
Jakub Szuppe
dcddaae8d2 Make number of trials configurable from cmd line
Make number of trials in benchmarks (how many times algorithm
is run) configurable from command line.
2016-04-10 12:42:35 +02:00
Jakub Szuppe
8bc763cd53 MSVC/WIN-related update of CMakeLists.txt files
This commit fixes misc. bugs in CMakeLists.txt files that
were causing various build errors and warnigns.
2016-03-09 18:56:16 +01:00
Jakub Szuppe
a1895728a1 Use custom FindOpenCL.cmake only for cmake older than 3.1 2016-03-09 15:30:02 +01:00
Jakub Szuppe
8e3a57626b Add missing includes 2016-02-11 19:07:59 +01:00
Jakub Szuppe
e85f4ba826 Add missing include in perf_fill.cpp 2016-02-08 21:01:14 +01:00
Kyle Lutz
f249741bf8 Merge pull request #522 from haahh/pr_find_extrema_first_extremum
Fix find_extrema_with_atomics and _with_reduce
2015-10-23 00:19:54 -07:00
Jakub Szuppe
e357a6a338 Test if min/max_element algorithms return the first min/max in given range 2015-10-03 18:20:48 +02:00
Jakub Szuppe
586f815e48 Remove extra semicolon in perf_bolt_saxpy.cpp 2015-09-24 15:03:48 +02:00
Jakub Szuppe
36418e4cf8 Remove redundant kernel compilation from binary_find
After this binary_find kernel is compiled only once during binary_find algorithm
execution instead of in every iteration of the while loop.
2015-09-19 19:17:27 +02:00
Kyle Lutz
cf5e40ee26 Merge pull request #497 from haahh/pr_find_extrema_supports_custom_comparision
Support for custom comparison function in find_extrema()
2015-08-19 20:24:04 -07:00
Jakub Szuppe
11709434eb Fix perf_max_element benchmark 2015-08-19 13:42:16 +02:00
Kyle Lutz
cadff8e049 Merge pull request #489 from haahh/pr_serial_merge
Using serial merge in merge() for small inputs
2015-08-01 13:44:52 -07:00
Jakub Szuppe
e3324f887d Using serial merge in merge algorithm for small inputs 2015-08-01 19:41:16 +02:00
Kyle Lutz
15a778d84a Merge pull request #480 from haahh/pr_boost_inspection
Fixing problems reported in Boost Inspection Report
2015-07-18 11:01:57 -07:00
Jakub Szuppe
7e12f63a81 Add missing Boost license info and copyrights 2015-07-18 14:01:29 +02:00
Jakub Szuppe
350f2c93a8 Fix violations of Boost min/max guidelines 2015-07-16 21:18:08 +02:00
Jakub Szuppe
33d8ed2fa9 Reduce_by_key benchmarks for Boost.Compute, Thrust and BOLT 2015-07-15 11:20:49 +02:00
Jakub Szuppe
c5409541ac Parallel reduce by key algorithm implementation 2015-07-15 11:20:49 +02:00
Junaid Muzammil
a9c373f92c Merge remote-tracking branch 'newrepo/develop' into develop 2015-06-02 19:18:00 +05:00
Jakub Szuppe
e6af5dc632 Suppress warnings caused by BOLT library 2015-05-21 13:27:14 +02:00
Jakub Szuppe
abd0d8d355 Fix std::max_element() benchmark
It removes the situation when compiler is optimizing out std::max_element()
call entirely because its result is never used.
2015-05-19 13:02:17 +02:00
Kyle Lutz
fda67a22d0 Update GitHub links 2015-05-17 20:32:09 -07:00
Kyle Lutz
7a943b4f91 Merge pull request #453 from haahh/perf_tests_amd_bolt_pr
Performance tests for AMD's Bolt library
2015-05-06 21:54:41 -07:00
Jakub Szuppe
4248957720 Add missing copyright and license info 2015-05-02 19:46:28 +02:00
Jakub Szuppe
bb1ac41ce5 Add AMD's Bolt perf tests
This adds AMD's Bolt C++ Template Library performance tests, so
Boost.Compute can be compared with Bolt.
2015-05-02 19:46:20 +02:00
Anees Haider
b979ff527d Added Threefry RNG to performance benchmark 2015-04-06 09:51:53 +05:00
Kyle Lutz
415e7a0a21 Merge pull request #444 from haahh/find_benchmark
Benchmarks for find() algorithm #325
2015-03-27 08:17:36 -07:00
Jakub Szuppe
a65de68e0d Making sure compiler won't cut out find() calls
Making sure that compiler won't cut out std::find() and thrust:find()
calls while optimizing the code.
2015-03-27 09:25:47 +01:00
Jakub Szuppe
b1acf0ae74 Benchmarks for find() algorithm
Boost.Compute, STL and Thrust benchmarks for find() algorithm.
2015-03-26 13:50:32 +01:00
Kyle Lutz
a3cbcc8b25 Add perf_random_number_engine benchmark
This unifies the random number engine benchmarks into a single
benchmark application which can switch between each engine based
on a command-line argument.
2015-03-16 20:57:12 -07:00
Kyle Lutz
4175a85171 Add parameter_cache class
This adds a parameter_cache class which can be used to store
execution parameters for an algorithm. Also updates some of
the benchmark programs to find and store optimal parameters.
2015-03-15 18:15:57 -07:00
Jakub Szuppe
5aace2dde9 Fixing author. 2015-03-14 00:09:36 +01:00
Jakub Szuppe
0ab1d5fea2 Add reverse_copy benchmarks 2015-03-10 13:25:12 +01:00
Kyle Lutz
de76e66dd1 Merge pull request #426 from jmr1/msvc2013
MSVC 2013 build fixes
2015-02-22 19:02:45 -07:00
jmr1
300a3df380 Enable math defines for MSVC 2015-02-22 17:21:05 +01:00
Roshan
bf89f042e7 Rewrite benchmarks for set_algorithms 2015-01-25 03:18:44 +05:30
Kyle Lutz
417cb03670 Fixes for Mac OS X 2014-12-24 09:25:55 -08:00
Kyle Lutz
88811e345b Merge pull request #353 from kylelutz/more-thrust-benchmarks
Add more Thrust performance benchmarks
2014-12-21 09:34:35 -08:00
Kyle Lutz
e92eb1da54 Add more Thrust performance benchmarks 2014-12-21 09:24:15 -08:00
Kyle Lutz
c97c83a562 Rename types/builtin.hpp to types/fundamental.hpp 2014-12-20 13:39:50 -08:00
Denis Demidov
51db1b1f75 Show Thrust results on performance plots 2014-12-15 23:08:09 +03:00
Denis Demidov
292e5136ed Use loglog scale for plotting performance data
This makes comparison of Boost.Compute and STL performance easier,
especially for smaller sizes.
2014-12-15 16:16:17 +03:00
Denis Demidov
077a4896c7 Make performance scripts compatible with python3 2014-12-15 16:15:37 +03:00
Rastko Anicic
ae014dc757 max_element perf added 2014-12-10 23:23:50 -05:00
Kyle Lutz
05afa5f707 Use thread-local storage for global program cache 2014-09-28 12:37:48 -07:00