create copy of host array into device
create uninitialized T.sizeof * n array in device
create fat pointer from raw pointer and its length
dtor calling cuMemFree
Copying this object is disabled.
A postblit is present on this object, but not explicitly documented in the source.
fat pointer in CUDA