20 Commits

Author SHA1 Message Date
Richard Thier
74e24486f4 Revert "tiny (hopefully) optimization for tpxb"
This reverts commit 2aa7de0d40bca5c8713c5edb41a2ff3995a2ea01.
2025-10-02 08:09:44 +02:00
Richard Thier
2aa7de0d40 tiny (hopefully) optimization for tpxb 2025-10-02 08:09:27 +02:00
Richard Thier
1d1f151c07 thier3: tricky rotation based state storing... 2025-10-02 05:48:24 +02:00
Richard Thier
ccdf991824 Revert "tpxb: 16-wide manual unroll - but it does not seem to be faster"
This reverts commit 6d794612624b445c8e4dae4ea3ee3b42b6a4c92f.
2025-10-01 04:26:44 +02:00
Richard Thier
100de9bc67 Revert "32-wide manual unroll with 2x compiled... still not as good perf as automatic 48x"
This reverts commit 18b734a6e70e989168c94d051bf2da5c08560790.
2025-10-01 04:26:32 +02:00
Richard Thier
18b734a6e7 32-wide manual unroll with 2x compiled... still not as good perf as automatic 48x 2025-10-01 04:18:04 +02:00
Richard Thier
6d79461262 tpxb: 16-wide manual unroll - but it does not seem to be faster 2025-10-01 04:02:08 +02:00
Richard Thier
036725611b removed non-temporal writes as too random patterns for it 2025-10-01 03:24:08 +02:00
Richard Thier
31dd239ad3 Revert "thier3 / tpxb: int->uint32, but this loses a little perf because likely compiler uses the UB of signed overflow to optimize out stuff so will be reverted as it is not a practical thing anyways"
This reverts commit 808b87f266b2ce8a058b94d9183d100362abe1b4.
2025-10-01 02:06:23 +02:00
Richard Thier
808b87f266 thier3 / tpxb: int->uint32, but this loses a little perf because likely compiler uses the UB of signed overflow to optimize out stuff so will be reverted as it is not a practical thing anyways 2025-10-01 02:06:14 +02:00
Richard Thier
c032109110 Revert "tpbx: tried removal of relative addressing but it does not help, just makes n be int instead of uint32_t so probably will be reverted. Sad because this actually looked beneficial"
This reverts commit 5ecb48815b57c51527f2c55c3555fb40ffe48f6b.
2025-10-01 01:53:38 +02:00
Richard Thier
5ecb48815b tpbx: tried removal of relative addressing but it does not help, just makes n be int instead of uint32_t so probably will be reverted. Sad because this actually looked beneficial 2025-10-01 01:53:28 +02:00
Richard Thier
98222d4494 tpxb: tried non-temporal writes (bad for random writes) 2025-10-01 01:28:49 +02:00
Richard Thier
08cb90bb1b Revert "prepared for flame graph analysis"
This reverts commit ac873f7123c0dd23ff9d73668e005c71944a8afa.
2025-09-30 22:18:10 +02:00
Richard Thier
52fc14b0f6 Revert "thier3: write caching queues fixed - bug just makes it slower despite less cache misses"
This reverts commit 967c7c19b54fd0db820bbfa1cbe199a8ac9f5419.
2025-09-30 22:17:30 +02:00
Richard Thier
967c7c19b5 thier3: write caching queues fixed - bug just makes it slower despite less cache misses 2025-09-30 22:12:22 +02:00
Richard Thier
ac873f7123 prepared for flame graph analysis 2025-09-30 17:19:47 +02:00
Richard Thier
0a199b9d72 Revert "hand unrolled thiersort3 - I think its slower than gcc unrolling and surely more complex so I will revert"
This reverts commit 523605e8d841733d7c398131ea50e356b35b88e3.
2025-09-29 18:52:02 +02:00
Richard Thier
523605e8d8 hand unrolled thiersort3 - I think its slower than gcc unrolling and surely more complex so I will revert 2025-09-29 18:51:53 +02:00
Richard Thier
a5cb0995e3 added missing headers for thiersort3 2025-09-29 18:21:16 +02:00