Parallelizing micromagnetic computations using FFT in compute unified device architecture (CUDA)