Since addition mod 2 is xor, we can have multpile threads writing to the same FpVectorP<2> using AtomicU64::fetch_xor. This can be useful for parallelizing certain operations, such as acting on a FreeModule.
This will make the mod 2 situation substantially different from the odd prime situation though. Maybe wait till specialization...