TensorLib

Overview

TensorLib is a simple library for tensor operations, similar to PyTorch. The library supports both CPU and GPU tensors, and the operations are implemented using OpenMP/OpenBlas for CPU and CUDA/CuBlas for GPU. The library also supports automatic differentiation, similar to PyTorch autograd.

Library Structure

The library is structured as follows (see under include/ and src/):

Tensor: Base class for all tensors.
AutoGrad: Class for automatic differentiation.
Operations: Header for all the supported tensor operations.

To use the library in C++, simply include the necessary header #include <tensorlib/tensorlib.hpp> and link the necessary libraries.

To use the library in Python, simply import the necessary modules import tensorlib.

Build Instructions

First run

sh build.sh

which creates a build directory with the .so file, then set the PythonPath:

Python

For python, set the PYTHONPATH to the build directory:

export PYTHONPATH=$(pwd)/build:$PYTHONPATH

Afterwards, you can use tensorlib by import tensorlib, see example.py.

C++(Cuda)

For any C++ file, simply link the nessesary libraries:

nvcc example/example.cpp -Iinclude/ -Lbuild/ -ltensorlib_cpp -lopenblas -lcudart -lcublas -o example/example

for example, to run the example, run:

./example/example

see example.cpp for a sample use case for TensorLib.

Tensor Operations

Tensor: Base class for all tensors.
AutoGrad: Class for automatic differentiation.

Supported Operations

All operations are supported for both CPU and GPU tensors.

add: Element-wise addition, a + b.
sub: Element-wise subtraction, a - b.
mul: Element-wise multiplication, a * b.
div: Element-wise division, a / b.

All these operations support broadcasting, that is, they can be applied to tensors of different shapes, as long as the shapes are compatible (if one of the dimensions is 1, the tensor is broadcasted along that dimension).

matmul: Matrix multiplication, a @ b or matmul(a,b).
transpose: Transpose of a tensor, a.T or transpose(a).

These operations requires the tensors to be of compatible shapes (i.e., the number of columns of the first tensor should be equal to the number of rows of the second tensor), and they should be either 2 or 3-tensors.

sum: Sum of the elements of a tensor along a given axis, sum(a, axis).
mean: Mean of the elements of a tensor along a given axis, mean(a, axis).
max: Maximum of the elements of a tensor along a given axis, max(a, axis).
min: Minimum of the elements of a tensor along a given axis, min(a, axis).

These operations shrink the tensor along the given axis.

log: Element-wise natural logarithm, log(a).
exp: Element-wise exponential function, exp(a).
relu: ReLU activation function, relu(a).
select_idx: Select a subset of elements from a tensor, a[index] or select_idx(a, index).
reshape: Reshape a tensor to a given shape, reshape(a, shape).
flatten: Flatten a tensor, flatten(a).
broadcast_to: Broadcast a tensor to a given shape, broadcast_to(a, shape).

How it works

The project is structured similar to how PyTorch autograd works as described in the PyTorch documentation.

While constructing the computational graph, the user can specify the gradient of the output tensor with respect to the input tensor. For tensors that are not results of operations, they are the leaf nodes of the computational graph. (The figures below are taken from the PyTorch documentation.)

As the user does operations on the tensors, the computational graph is constructed. The new grad_fn, which represents the gradient function, is connected to the input tensors.

In the example cpp code, the computational graph is as follows

When the user calls the backward function, the gradient of the output tensor with respect to the input tensor is computed using backpropagation. The gradients are then stored in the input gradients of the tensors. If the tensor for which the user called backward on is not a scalar, the user can manually specify the gradient of the output tensor with respect to the input tensor.

For example, we can specify the gradient of the output tensor with respect to the input tensor as follows:

grad_output = tensorlib.ones(output.shape)
output.backward(grad_output)

similar to how it is done in PyTorch.

The computational graph is then used to compute the gradient of the output tensor with respect to the input tensor using backpropagation and the chain rule.

Example Neural Network

The project also includes a simple neural network implementation. The neural network is located in mlp_iris.py and is trained on the Iris dataset. The neural network has input size 4, hidden size 5, and depth is 2. The training results are as follows:

Epoch 0, Loss: 2.8286805152893066
Epoch 50, Loss: 0.6931464672088623
Training completed.

More detailed experiments are located in example/Experiment.md. Overall, we can see that the library has similar performance to PyTorch.

Project Structure

include/: Header files.
src/: Source files.
example/: Example files.
build/: Build directory (created by build.sh).
build.sh: Build script.

Dependencies

CMake, for building the project.
Python, for running the Python code.
OpenMP, for parallelizing the CPU code.
OpenBLAS, for CPU implementation of many matrix operations.
CUDA, for GPU implementation of many operations.
CuBLAS, for GPU implementation of many matrix operations.

Task List

Implement computational graph.
Implement backprop by extending computational graph.
Fix memory leakage due to circular dependency (sharedptr).
CPU implementation for most tensor functions.
GPU implementation for most tensor functions.
Simple neural network implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
example		example
fig		fig
include/tensorlib		include/tensorlib
python		python
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
build.sh		build.sh
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorLib

Overview

Library Structure

Build Instructions

Python

C++(Cuda)

Tensor Operations

Supported Operations

How it works

Example Neural Network

Project Structure

Dependencies

Task List

About

Releases

Packages

Languages

sogood99/tensorlib

Folders and files

Latest commit

History

Repository files navigation

TensorLib

Overview

Library Structure

Build Instructions

Python

C++(Cuda)

Tensor Operations

Supported Operations

How it works

Example Neural Network

Project Structure

Dependencies

Task List

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages