Fix handling of intermediate dimensions in the Linear layer hook.
Add per sequential number of parameters estimation.
Fix sample doesn't work without GPU.
Clarified output in sample.
Allocate temporal blobs on the same device as model's parameters are located.
Add 1d operators: batch norm, poolings, convolution.
Add ability to output extended report to any output stream.
Add new operations: Conv3d, BatchNorm3d, MaxPool3d, AvgPool3d, ConvTranspose2d.
Add some results on widespread models to the README.
Initial release with basic functionality