old hard-coded inline asm is still included if requested.
-rick
nb: is a vector of eight floats not the same layout as a simple linear array of same? (Aside from the alignment requirements)
netbsd-family build fixes, also - the AVX2 codepaths are _compiler-specific_, they use features _exclusive_ to gcc and clang