Richard Thier
|
57e328897a
|
added int_to_float.h as might be needed in future for new sort alg idea
|
2022-10-10 19:56:55 +02:00 |
|
Richard Thier
|
d0843cbc40
|
Revert "trying with the restrict keyword - unsuccessful"
This reverts commit 1908687002c085c628853711d26b95e9bc92a1ad.
|
2022-09-01 02:15:43 +02:00 |
|
Richard Thier
|
1908687002
|
trying with the restrict keyword - unsuccessful
|
2022-09-01 02:14:30 +02:00 |
|
Richard Thier
|
5831527f9d
|
removed vectorize makefile part example
|
2022-09-01 01:56:40 +02:00 |
|
Richard Thier
|
7e8aa96a39
|
duplication counting, word-based radix made possible (commented out), modulus impuit, vectorize makefile example
|
2022-09-01 01:56:15 +02:00 |
|
Richard Thier
|
c02aa49f73
|
tweaked to have better MEASURE_ONLY
|
2022-08-31 12:56:51 +02:00 |
|
Richard Thier
|
dac044fe91
|
Revert "trying more ILP in spsort - not much success and will be reverted"
This reverts commit c7e794b7ea1e10f25e64ba29f740890f3aa93b0a.
|
2022-08-16 19:24:35 +02:00 |
|
Richard Thier
|
c7e794b7ea
|
trying more ILP in spsort - not much success and will be reverted
|
2022-08-16 19:24:27 +02:00 |
|
Richard Thier
|
f8793f71ab
|
more ILP in some nearly sorted bucket parts in spsort - rand still not good so much
|
2022-08-16 18:59:18 +02:00 |
|
Richard Thier
|
4bed99751f
|
added feature to use profilers with algorithms more easily
|
2022-08-16 18:37:13 +02:00 |
|
Richard Thier
|
36ea38940b
|
in-place algorithm and fix of perf bug where we not early exited in spsort after insertion sorting small buckets so they were sorted again by us too :-)
|
2022-08-16 18:06:46 +02:00 |
|
Richard Thier
|
b1ab4f2f87
|
tested sp sort and works the first version. Slightly slower on random than std and sometimes slightly faster - but this version do copies that I should optimize out!
|
2022-08-16 16:00:02 +02:00 |
|
Richard Thier
|
edbb59d7a1
|
spsort got twovalue sort special case (no infinite recursion)
|
2022-08-16 15:50:36 +02:00 |
|
Richard Thier
|
6a0a2540bb
|
re-add other test cases - now fails constant because infinite recursion to be fixed for spsort
|
2022-08-16 12:41:19 +02:00 |
|
Richard Thier
|
d0fa5c5b48
|
simplification + fixing right side of array not sorted because -1 counts
|
2022-08-16 12:25:15 +02:00 |
|
Richard Thier
|
6073c03f81
|
temporarily do naive array separate code for debugging
|
2022-08-16 12:16:31 +02:00 |
|
Richard Thier
|
fbea1e607c
|
factored out internal_array_separate(..) to check if it has the errors or not
|
2022-08-16 11:44:34 +02:00 |
|
Richard Thier
|
680936f50a
|
still buggy sp code but differently...
|
2022-08-16 04:13:12 +02:00 |
|
Richard Thier
|
e83392ebaa
|
added "sp" to tests - buggy for now, but at least in const works and inc nearly ok
|
2022-08-16 03:29:08 +02:00 |
|
Richard Thier
|
fad7345a80
|
space partitioning sort first - buggy, but neargood - versions
|
2022-08-16 03:28:06 +02:00 |
|
Richard Thier
|
ee08930cae
|
add -DNDEBUG
|
2022-08-15 23:02:30 +02:00 |
|
Richard Thier
|
ec0f73af01
|
remove session.vim
|
2022-08-15 23:01:55 +02:00 |
|
Richard Thier
|
a2ee3cdb8c
|
make: better makefile
|
2022-01-25 20:04:25 +01:00 |
|
Richard Thier
|
ff05bc2688
|
added if constexpr(..) where it could be
|
2021-12-20 13:29:47 +01:00 |
|
Richard Thier
|
d858f39708
|
Merge branch 'tmp' into ilp-radix-1
|
2021-12-19 22:53:09 +01:00 |
|
Richard Thier
|
c77e592a84
|
mlocks and frewr algorithm both added
|
2021-12-19 21:55:48 +01:00 |
|
|
|
efa2c7bc26
|
old 2007 laptop linux results
|
2021-12-18 20:44:03 +01:00 |
|
|
|
1658e5abbe
|
fine-tuning on linux laptop (just parameters)
|
2021-12-18 20:41:30 +01:00 |
|
Richard Thier
|
a4d50c3309
|
input reduction for testing on less capable machines
|
2021-12-18 19:54:14 +01:00 |
|
Richard Thier
|
da4d122ee1
|
more of latest changes - random weird shiiit
|
2021-12-18 03:49:52 +01:00 |
|
Richard Thier
|
f24b3987c0
|
improved indirections
|
2021-12-18 02:34:22 +01:00 |
|
Richard Thier
|
3b413fcba0
|
removed reference to pointer parameter - a bit better indirections
|
2021-12-18 02:20:42 +01:00 |
|
Richard Thier
|
298edba5d2
|
minor unroll
|
2021-12-18 01:48:42 +01:00 |
|
Richard Thier
|
e7b677e4db
|
basic prefetch optimizations
|
2021-12-18 01:23:06 +01:00 |
|
Richard Thier
|
e5d4ff74ad
|
more manual unrolls
|
2021-12-17 23:37:48 +01:00 |
|
Richard Thier
|
645bc19f19
|
Manual occurence unrolling
|
2021-12-17 22:48:38 +01:00 |
|
Richard Thier
|
be450086b5
|
took out prefetch and added commented out pragmas - they not help
|
2021-12-17 22:09:35 +01:00 |
|
Richard Thier
|
3fdcaad537
|
trying some prefetch - not that good yet
|
2021-12-17 21:42:35 +01:00 |
|
Richard Thier
|
0b4eb5e5a6
|
minor speed tweaks by being able to define the counter type
|
2021-12-17 21:17:53 +01:00 |
|
Richard Thier
|
1686967f10
|
minor tweaks to 4pasu and added 4rot
|
2021-12-17 19:20:58 +01:00 |
|
Richard Thier
|
a878f20100
|
ypsus 4passu method optimized a bit
|
2021-12-15 16:09:40 +01:00 |
|
Richard Thier
|
a947cda58d
|
Revert "vsort version that got slower, but is really funny template code"
This reverts commit fd35dbc51b63fa97ff5a9d7a823cdfa271b99a43.
|
2021-12-15 14:48:27 +01:00 |
|
Richard Thier
|
fd35dbc51b
|
vsort version that got slower, but is really funny template code
|
2021-12-15 14:48:14 +01:00 |
|
Richard Thier
|
bff96c8f7f
|
upgraded vsort a bit (50-100ms)
|
2021-12-15 12:53:00 +01:00 |
|
Richard Thier
|
520db7049d
|
added ypsu-variants of radix-like things
|
2021-12-15 12:52:33 +01:00 |
|
Richard Thier
|
a044787846
|
finally again a real optimization and API for reusal - even faster for non-reused
|
2021-12-15 03:14:35 +01:00 |
|
Richard Thier
|
3490201420
|
further optimization - API change however is not a no-cost abstraction as it makes clang slower than original heap variant and g++ albeit faster than original it does not as fast as hardcoded - will investigave API change
|
2021-12-15 00:43:25 +01:00 |
|
Richard Thier
|
a1d6e96f5a
|
back to regular perf / measure run
|
2021-12-14 17:32:43 +01:00 |
|
Richard Thier
|
05235e269f
|
added simd-sort - basically the whole repo, but I haxed-in magyarsort as measure
|
2021-12-14 17:30:07 +01:00 |
|
Richard Thier
|
c4ed2994ea
|
Setting up causal profiling with "coz"
|
2021-12-14 17:29:33 +01:00 |
|