Use a kernel thread to exec the command at spl0(), so avoid blocking interrupts for too long use 128bits I/O when possible.