Tenstorrent QuietBox RISC-V Blackhole node
We have a single Tenstorrent QuietBox node with 4x Blackhole cards.
The host system is a single AMD 8124P 16-core processor with 512GB RAM.
It is a highly capable system for AI inferencing.
Accessing the Tenstorrent system
To access this system you need to join the do023 project, and then ssh tenstorrent from a login node.
Usage of the Tenstorrent system
Most tenstorrent commands start with tt. To access many of these, you will need to source /opt/venv/tenstorrent/bin/activate which makes commands like tt-smi available.
To install an updated software stack as a user, you may be able to follow the instructions here (though this may require root access): https://docs.tenstorrent.com/getting-started/README.html
tt-metalalium
The tt-metalium library for c++ is installed on the machine, this is the low-level SDK for tenstorrent devices.
Include the relevant headers like so:
#include <tt-metalium/host_api.hpp>
#include <tt-metalium/device.hpp>
#include <tt-metalium/...>
Note: tt-metalium requires that your compiler use at least the C++20 standard. Example programs can be found here.
When planning the structure of your program, it can be helpful to know the architecture of the device.
The p150 has:
120 Tensix cores (each containing 3 compute cores connected to a compute engine, 2 data-mover cores, and dedicated SRAM)
32GB of whole-device DRAM
16 full RISC-V cores
In the previous generation of TT devices (wormhole), orchestration and memory movement was handled by the host CPU. The introduction of 16 RISC-V cores on the card itself allows orchestration to be done on-device, reducing the reliance on PCIe transfers. Most of the existing documentation for tt-metal assumes a wormhole device is being used, and therefore does not make use of the RISC-V cores.
Please try using the tools and give any feedback.
Tutorials
There are a number of tutorial, podcasts and lectures on github
Usergroup
If you would like to join a UK Tenstorrent research mailing list, please let us know. Primarily for discussion between users.