Figure 1 - available via license: Creative Commons Attribution 4.0 International
Content may be subject to copyright.
Single core comparison baselined against StarFive VisionFive V2 running in double precision (FP64), against V1 and SG2042
Source publication
The Sophon SG2042 is the world's first commodity 64-core RISC-V CPU for high performance workloads and an important question is whether the SG2042 has the potential to encourage the HPC community to embrace RISC-V. In this paper we undertaking a performance exploration of the SG2042 against existing RISC-V hardware and high performance x86 CPUs in...
Contexts in source publication
Context 1
... only RV64GC is provided by the SiFive U74, and consequently there is no support for the RISC-V vector extension. Figure 1 reports a performance comparison of the VisionFive V2 and V1 against the SG2042 for double (FP64) and single (FP32) precision. The numbers reported are relative to the performance of the V2 running the RAJAPerf benchmark suite at double precision as a baseline. ...
Context 2
... can be seen from Figure 1 that a single C920 core of the SG2042 outperforms the U74 core of the V2 and V1 at both double and single precision. At double precision the C920 core delivers on average between 4.3 and 6.5 times the performance achieved when running at double precision on the U74 in the V2. ...
Context 3
... this means that we leverage vectorisation on the SG2042's C920, whereas this is not supported by the U74 and hence unavailable on the V1 or V2. As described in Section 2.1, the documentation is conflicting around whether the C920 provides FP64 vectorisation, however the results in Figure 1 demonstrate a noteworthy performance difference between FP32 and FP64 on the SG2042, suggesting that in-fact FP64 is not supported by C920 vector operations. By comparison, the performance difference between running double and single precision on the V2 is far less. ...
Context 4
... aspect of the results in Figure 1 that surprised us is how much slower the VisionFive V1 is than the V2. Considering that we are running RAJAPerf over a single core only, so the dual vs quad core nature of these machines does not matter, and that they both contain the same U74 core then one would assume that performance should be fairly comparable. ...
Context 5
... can be seen from Figure 1 that the performance obtained by a single C920 core of the SG2042 is impressive compared to existing, publicly available, commodity RISC-V hardware. This core is described by T-Head as a high performance RISC-V processor, and the benchmarking results reported in this section demonstrate that it delivers a large improvement in performance across the entire benchmarking suite against the U74 which would previously have been considered the best choice of widely available RISC-V CPU to experiment with HPC workloads upon. ...
Similar publications
Whilst RISC-V has grown phenomenally quickly in embedded computing, it is yet to gain significant traction in High Performance Computing (HPC). However, as we move further into the exascale era, the flexibility offered by RISC-V has the potential to be very beneficial in future supercomputers especially as the community places an increased emphasis...
With an increasing need for enormous amount of computing power, different approaches for building inexpensive supercomputers have emerged. Decreasing costs and continuous technology scaling has led to the emergence of computers with ARM processors which are considered as a possible alternative to traditional x86 processors for creating High Perform...
The increasing use and cost of high performance computing (HPC) requires new easy-to-use tools to enable HPC users and HPC systems engineers to transparently understand the utilization of resources. The MIT Lincoln Laboratory Supercomputing Center (LLSC) has developed a simple command, LLload, to monitor and characterize HPC workloads. LLload plays...
Benchmarks are essential in the design of modern HPC installations, as they define key aspects of system components. Beyond synthetic workloads, it is crucial to include real applications that represent user requirements into benchmark suites, to guarantee high usability and widespread adoption of a new system. Given the significant investments in...
Transitioning into the next generation of supercomputing resources, we’re faced with expanding user bases and diverse workloads, increasing the demand for improved security measures and deeper software compartmentalization. This is especially pertinent for virtualization, a key cloud computing component that’s at risk from attacks due to hypervisor...