An introduction to NPU hardware and its growing presence outside of mobile computing devices.
AMD Zen4 Threadripper PRO vs Intel Xeon-w9 For Science and Engineering
The performance improvement with the new Zen4 TrPRO over the Zen3 TrPRO is very impressive!
My first recommendation for a Scientific and Engineering workstation CPU would now be the AMD Zen4 architecture as either Zen4 Threadripper PRO or Zen4 EPYC for multi-socket systems.
Intel Ice Lake Xeon-W vs AMD TR Pro Compute Performance (HPL, HPCG, NAMD, Numpy)
The single socket version of Intel third generation Xeon SP is out, the Ice Lake Xeon W 33xx. This is a much better platform with faster large capacity 8 channel memory and PCIe v4 with plenty of lanes. The new Intel platform is very much like the AMD Threadripper Pro (single socket version of EPYC Rome) so this is the obvious comparison to make. Read on to see how the numerical computing testing went!
Intel Rocket Lake Compute Performance Results HPL HPCG NAMD and Numpy
The new Intel Rocket Lake CPUs have been officially released. There were numerous posts and reviews before the official release date of March 30 2021, but I haven’t seen anything about the numerical compute performance. I’ve had access to a Core-i9 11900KF 8-core CPU and have compared it with (my own) AMD 5800X system.
Intel oneAPI AI Analytics Toolkit — Introduction and Install with conda
I recently wrote a post introducing Intel oneAPI that included a simple installation guide of the Base Toolkit. In that post I promised a follow-up about the the oneAPI AI Analytics Toolkit. This is it! I’ll describe what it is and give recommendations for doing an install setup of the AI toolkits using conda with Anaconda Python.
Intel oneAPI Developer Tools — Introduction and Install
Intel oneAPI is a massive collection of very high quality developer tools, and, it’s free to use! In this post I’ll give you a little background on what oneAPI is and my recommendations for doing an install setup to get started exploring the collection of tool-kits.
How To Install TensorFlow 1.15 for NVIDIA RTX30 GPUs (without docker or CUDA install)
In this post I will show you how to install NVIDIA’s build of TensorFlow 1.15 into an Anaconda Python conda environment. This is the same TensorFlow 1.15 that you would have in the NGC docker container, but no docker install required and no local system CUDA install needed either.
NVIDIA (Computing Hardware) Company of the Decade!
It’s the end of the 2010’s and start of 2020’s. Time to reflect …
SC19 A look at the high end of HPC
The Super Computing conference annual US counterpart is always a great meeting. It’s a chance to see the trend and get sentiment for the highest performance end of computing. I have written up a few observations and provided a few interesting links for SC19.
Intel Xeon W-3175X and i9 9990XE Linpack and NAMD on Ubuntu 18.04
There are 2 recent Intel processors that are really strange, the Xeon W-3175X 28-core, and the Core i9 9990XE overclocked 14-core. I was able to get a little time in on the these processors. I ran a couple of numerical compute performance tests with the Intel MKL Linpack benchmark and NAMD. I used the same system image that I had used recently to look at 3 Intel 8-core processors so I will include those results here as well. **There will be results for W-3175, 9990XE, 9800X, W-2145, and 9900K**.