Jakub Szuppe
790dcab387
Fix benchmark for std::partial_sum()
...
In partial_sum benchmark for STL std::parial_sum() should not
be run in-place since benchmarks for other libs are not run
in-place.
2016-07-25 22:37:17 +02:00
Jakub Szuppe
1311de15a3
Add benchmark for comparison sort
2016-06-22 15:59:55 +02:00
Jakub Szuppe
3493b750e8
Fix type, long_ and long might be different types
2016-05-21 16:57:45 +02:00
Jakub Szuppe
5e28138a7a
Update minimum required Boost version to 1.54
2016-05-03 14:32:22 +02:00
Jakub Szuppe
9173c07f40
Fix saxpy benchmark
...
boost::compute::fill() may not be done before we start
execution measuring time.
2016-04-10 12:42:35 +02:00
Jakub Szuppe
dcddaae8d2
Make number of trials configurable from cmd line
...
Make number of trials in benchmarks (how many times algorithm
is run) configurable from command line.
2016-04-10 12:42:35 +02:00
Jakub Szuppe
8bc763cd53
MSVC/WIN-related update of CMakeLists.txt files
...
This commit fixes misc. bugs in CMakeLists.txt files that
were causing various build errors and warnigns.
2016-03-09 18:56:16 +01:00
Jakub Szuppe
a1895728a1
Use custom FindOpenCL.cmake only for cmake older than 3.1
2016-03-09 15:30:02 +01:00
Jakub Szuppe
8e3a57626b
Add missing includes
2016-02-11 19:07:59 +01:00
Jakub Szuppe
e85f4ba826
Add missing include in perf_fill.cpp
2016-02-08 21:01:14 +01:00
Kyle Lutz
f249741bf8
Merge pull request #522 from haahh/pr_find_extrema_first_extremum
...
Fix find_extrema_with_atomics and _with_reduce
2015-10-23 00:19:54 -07:00
Jakub Szuppe
e357a6a338
Test if min/max_element algorithms return the first min/max in given range
2015-10-03 18:20:48 +02:00
Jakub Szuppe
586f815e48
Remove extra semicolon in perf_bolt_saxpy.cpp
2015-09-24 15:03:48 +02:00
Jakub Szuppe
36418e4cf8
Remove redundant kernel compilation from binary_find
...
After this binary_find kernel is compiled only once during binary_find algorithm
execution instead of in every iteration of the while loop.
2015-09-19 19:17:27 +02:00
Kyle Lutz
cf5e40ee26
Merge pull request #497 from haahh/pr_find_extrema_supports_custom_comparision
...
Support for custom comparison function in find_extrema()
2015-08-19 20:24:04 -07:00
Jakub Szuppe
11709434eb
Fix perf_max_element benchmark
2015-08-19 13:42:16 +02:00
Kyle Lutz
cadff8e049
Merge pull request #489 from haahh/pr_serial_merge
...
Using serial merge in merge() for small inputs
2015-08-01 13:44:52 -07:00
Jakub Szuppe
e3324f887d
Using serial merge in merge algorithm for small inputs
2015-08-01 19:41:16 +02:00
Kyle Lutz
15a778d84a
Merge pull request #480 from haahh/pr_boost_inspection
...
Fixing problems reported in Boost Inspection Report
2015-07-18 11:01:57 -07:00
Jakub Szuppe
7e12f63a81
Add missing Boost license info and copyrights
2015-07-18 14:01:29 +02:00
Jakub Szuppe
350f2c93a8
Fix violations of Boost min/max guidelines
2015-07-16 21:18:08 +02:00
Jakub Szuppe
33d8ed2fa9
Reduce_by_key benchmarks for Boost.Compute, Thrust and BOLT
2015-07-15 11:20:49 +02:00
Jakub Szuppe
c5409541ac
Parallel reduce by key algorithm implementation
2015-07-15 11:20:49 +02:00
Junaid Muzammil
a9c373f92c
Merge remote-tracking branch 'newrepo/develop' into develop
2015-06-02 19:18:00 +05:00
Jakub Szuppe
e6af5dc632
Suppress warnings caused by BOLT library
2015-05-21 13:27:14 +02:00
Jakub Szuppe
abd0d8d355
Fix std::max_element() benchmark
...
It removes the situation when compiler is optimizing out std::max_element()
call entirely because its result is never used.
2015-05-19 13:02:17 +02:00
Kyle Lutz
fda67a22d0
Update GitHub links
2015-05-17 20:32:09 -07:00
Kyle Lutz
7a943b4f91
Merge pull request #453 from haahh/perf_tests_amd_bolt_pr
...
Performance tests for AMD's Bolt library
2015-05-06 21:54:41 -07:00
Jakub Szuppe
4248957720
Add missing copyright and license info
2015-05-02 19:46:28 +02:00
Jakub Szuppe
bb1ac41ce5
Add AMD's Bolt perf tests
...
This adds AMD's Bolt C++ Template Library performance tests, so
Boost.Compute can be compared with Bolt.
2015-05-02 19:46:20 +02:00
Anees Haider
b979ff527d
Added Threefry RNG to performance benchmark
2015-04-06 09:51:53 +05:00
Kyle Lutz
415e7a0a21
Merge pull request #444 from haahh/find_benchmark
...
Benchmarks for find() algorithm #325
2015-03-27 08:17:36 -07:00
Jakub Szuppe
a65de68e0d
Making sure compiler won't cut out find() calls
...
Making sure that compiler won't cut out std::find() and thrust:find()
calls while optimizing the code.
2015-03-27 09:25:47 +01:00
Jakub Szuppe
b1acf0ae74
Benchmarks for find() algorithm
...
Boost.Compute, STL and Thrust benchmarks for find() algorithm.
2015-03-26 13:50:32 +01:00
Kyle Lutz
a3cbcc8b25
Add perf_random_number_engine benchmark
...
This unifies the random number engine benchmarks into a single
benchmark application which can switch between each engine based
on a command-line argument.
2015-03-16 20:57:12 -07:00
Kyle Lutz
4175a85171
Add parameter_cache class
...
This adds a parameter_cache class which can be used to store
execution parameters for an algorithm. Also updates some of
the benchmark programs to find and store optimal parameters.
2015-03-15 18:15:57 -07:00
Jakub Szuppe
5aace2dde9
Fixing author.
2015-03-14 00:09:36 +01:00
Jakub Szuppe
0ab1d5fea2
Add reverse_copy benchmarks
2015-03-10 13:25:12 +01:00
Kyle Lutz
de76e66dd1
Merge pull request #426 from jmr1/msvc2013
...
MSVC 2013 build fixes
2015-02-22 19:02:45 -07:00
jmr1
300a3df380
Enable math defines for MSVC
2015-02-22 17:21:05 +01:00
Roshan
bf89f042e7
Rewrite benchmarks for set_algorithms
2015-01-25 03:18:44 +05:30
Kyle Lutz
417cb03670
Fixes for Mac OS X
2014-12-24 09:25:55 -08:00
Kyle Lutz
88811e345b
Merge pull request #353 from kylelutz/more-thrust-benchmarks
...
Add more Thrust performance benchmarks
2014-12-21 09:34:35 -08:00
Kyle Lutz
e92eb1da54
Add more Thrust performance benchmarks
2014-12-21 09:24:15 -08:00
Kyle Lutz
c97c83a562
Rename types/builtin.hpp to types/fundamental.hpp
2014-12-20 13:39:50 -08:00
Denis Demidov
51db1b1f75
Show Thrust results on performance plots
2014-12-15 23:08:09 +03:00
Denis Demidov
292e5136ed
Use loglog scale for plotting performance data
...
This makes comparison of Boost.Compute and STL performance easier,
especially for smaller sizes.
2014-12-15 16:16:17 +03:00
Denis Demidov
077a4896c7
Make performance scripts compatible with python3
2014-12-15 16:15:37 +03:00
Rastko Anicic
ae014dc757
max_element perf added
2014-12-10 23:23:50 -05:00
Kyle Lutz
05afa5f707
Use thread-local storage for global program cache
2014-09-28 12:37:48 -07:00