Richard Thier
|
66376651a3
|
Revert "thier3: tricky rotation based state storing..."
This reverts commit 1d1f151c0730314ee4370eb288bf1f8c09824b02.
|
2025-10-02 08:09:57 +02:00 |
|
Richard Thier
|
74e24486f4
|
Revert "tiny (hopefully) optimization for tpxb"
This reverts commit 2aa7de0d40bca5c8713c5edb41a2ff3995a2ea01.
|
2025-10-02 08:09:44 +02:00 |
|
Richard Thier
|
2aa7de0d40
|
tiny (hopefully) optimization for tpxb
|
2025-10-02 08:09:27 +02:00 |
|
Richard Thier
|
1d1f151c07
|
thier3: tricky rotation based state storing...
|
2025-10-02 05:48:24 +02:00 |
|
Richard Thier
|
ccdf991824
|
Revert "tpxb: 16-wide manual unroll - but it does not seem to be faster"
This reverts commit 6d794612624b445c8e4dae4ea3ee3b42b6a4c92f.
|
2025-10-01 04:26:44 +02:00 |
|
Richard Thier
|
100de9bc67
|
Revert "32-wide manual unroll with 2x compiled... still not as good perf as automatic 48x"
This reverts commit 18b734a6e70e989168c94d051bf2da5c08560790.
|
2025-10-01 04:26:32 +02:00 |
|
Richard Thier
|
18b734a6e7
|
32-wide manual unroll with 2x compiled... still not as good perf as automatic 48x
|
2025-10-01 04:18:04 +02:00 |
|
Richard Thier
|
6d79461262
|
tpxb: 16-wide manual unroll - but it does not seem to be faster
|
2025-10-01 04:02:08 +02:00 |
|
Richard Thier
|
036725611b
|
removed non-temporal writes as too random patterns for it
|
2025-10-01 03:24:08 +02:00 |
|
Richard Thier
|
31dd239ad3
|
Revert "thier3 / tpxb: int->uint32, but this loses a little perf because likely compiler uses the UB of signed overflow to optimize out stuff so will be reverted as it is not a practical thing anyways"
This reverts commit 808b87f266b2ce8a058b94d9183d100362abe1b4.
|
2025-10-01 02:06:23 +02:00 |
|
Richard Thier
|
808b87f266
|
thier3 / tpxb: int->uint32, but this loses a little perf because likely compiler uses the UB of signed overflow to optimize out stuff so will be reverted as it is not a practical thing anyways
|
2025-10-01 02:06:14 +02:00 |
|
Richard Thier
|
c032109110
|
Revert "tpbx: tried removal of relative addressing but it does not help, just makes n be int instead of uint32_t so probably will be reverted. Sad because this actually looked beneficial"
This reverts commit 5ecb48815b57c51527f2c55c3555fb40ffe48f6b.
|
2025-10-01 01:53:38 +02:00 |
|
Richard Thier
|
5ecb48815b
|
tpbx: tried removal of relative addressing but it does not help, just makes n be int instead of uint32_t so probably will be reverted. Sad because this actually looked beneficial
|
2025-10-01 01:53:28 +02:00 |
|
Richard Thier
|
98222d4494
|
tpxb: tried non-temporal writes (bad for random writes)
|
2025-10-01 01:28:49 +02:00 |
|
Richard Thier
|
08cb90bb1b
|
Revert "prepared for flame graph analysis"
This reverts commit ac873f7123c0dd23ff9d73668e005c71944a8afa.
|
2025-09-30 22:18:10 +02:00 |
|
Richard Thier
|
52fc14b0f6
|
Revert "thier3: write caching queues fixed - bug just makes it slower despite less cache misses"
This reverts commit 967c7c19b54fd0db820bbfa1cbe199a8ac9f5419.
|
2025-09-30 22:17:30 +02:00 |
|
Richard Thier
|
967c7c19b5
|
thier3: write caching queues fixed - bug just makes it slower despite less cache misses
|
2025-09-30 22:12:22 +02:00 |
|
Richard Thier
|
ac873f7123
|
prepared for flame graph analysis
|
2025-09-30 17:19:47 +02:00 |
|
Richard Thier
|
0a199b9d72
|
Revert "hand unrolled thiersort3 - I think its slower than gcc unrolling and surely more complex so I will revert"
This reverts commit 523605e8d841733d7c398131ea50e356b35b88e3.
|
2025-09-29 18:52:02 +02:00 |
|
Richard Thier
|
523605e8d8
|
hand unrolled thiersort3 - I think its slower than gcc unrolling and surely more complex so I will revert
|
2025-09-29 18:51:53 +02:00 |
|
Richard Thier
|
a5cb0995e3
|
added missing headers for thiersort3
|
2025-09-29 18:21:16 +02:00 |
|