Machine Learning using PCIe networks
Compare, benchmark and optimise machine learning frameworks.
Dolphin PXH830 Adapter card
Machine learning is a popular topic in data science, and there exists many frameworks that can be used for training and inference of these neural networks. However, many of these frameworks are still only optimised for one machine and only the CPU architecture. In this thesis we want to do a survey of available machine learning frameworks, and find out which frameworks support which features. After the survey is complete, we want to try to minify one or more of the frameworks to run on Dolphins PCIe networks, either with socket support or by using Dolphins low-level SISCI API.
Analyse several popular machine learning frameworks (such as TensorFlow, Caffe, etc.) and find out what features such as GPU acceleration, multi-machine support the different framework support. Select one of the frameworks that are open source, and try to modify the communication system to enable multi machine support in Dolphins PCIe networks.
In-depth knowledge and understanding of optimising a machine learning framework.
Good low-level computer systems understanding. The student should have completed, INF3151 or equivalent. IN5050 is recommended in the degree.