I have a little(maybe) question related with SIMD.
I would like just simply compare two 256 bit vector from 8 bit unsigned int. I found a perfect function _mm256_cmpgt_epu8_mask, which compares the two vector and give back a mask.
Unluckily this function (or Assembly) only work on AVX512VL + AVX512BW (only the server CPU).
My question is how can we achieve this comparing?
Is this _mm256_cmpgt_epu8_mask not possible for a normal Intel CPU?
Do we have some alternative solution to solve my problem?
1 2 3
error: inlining failed in call to always_inline ‘__mmask32 _mm256_cmpgt_epu8_mask(__m256i, __m256i)’: target specific option mismatch