Design And Implementation Of Writing Efficient Linear Algebra Subprograms

Posted on:2021-01-16

Degree:Master

Type:Thesis

Country:China

Candidate:Y L Y Ou

Full Text:PDF

GTID:2518306107450144

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

New non-volatile memories are playing an increasingly important role.Intel Optane series memory has been offificially put into use.The new non-volatile memory has low latency,low energy consumption,persistent storage,high storage density,and byte addressing.These characteristics allow the nonvolatile memories to be used as the main memory.However,an important feature of the new non-volatile memories is reading and writing asymmetry.Writing operation has a higher cost on time and energy comparing to the reading operation.The linear algebra,as the cornerstone of mathematics,is also widely used in computer science.Linear algebra is used in high-performance computing,graph computing,and deep learning.Open source basic linear algebra subprograms,and commercial linear algebra libraries such as Intel’s math kernel library are optimized for algorithms on symmetric memory.These libraries focus on parallel and distributed expansion and were not optimized for reducing communications,especially writing communication.The write-optimal matrix multiplication(WO-MM)algorithm is an improved algorithm based on the communication-avoiding algorithm and write-effificient algorithm.The most signifificant feature of the write-optimal matrix multiplication algorithm is that it reduces the reading and writing overheads simultaneously.Implementations of the write-optimal matrix multiplication algorithm have successively overcome the issues of hardware and software such as parallelization and vectorization.The main subprograms include matrix multiplication,triangular system solver,cholesky factorization,lower-upper factorization,and matrix inversion.In the experiment,the matrix multiplication algorithm reduces the execution time over 50%compared to the traditional algorithm.For other subprograms,time reduction is generally 25%.

Keywords/Search Tags:

None-volatile Memory, Asymmetric Memory Model, Write-efficient Algorithm, Basic Linear Algebra Subprograms

PDF Full Text Request

Related items

1	Research On LSM-tree Optimization Technology Based On Non-Volatile Memory
2	Research On New Indexing Technologies For Non-Volatile Memory
3	Research On Energy-Efficient Hybrid Main Memory Based On Non-Volatile Memory
4	Exploring Energy Optimization For Non-Volatile Memory
5	An Efficient Cache System For Hybrid Memory
6	Exploring The Durability And Security Issues Of NVM-based Main Memory In Mobile Devices
7	Energy-Efficient Management For Memory System Based On Access Patterns Of Applications
8	Efficient Mechanisms For Supporting Crash Consistency In Persistent Memory Systems
9	Research On NVM Based Main Memory Key Technology
10	Research On Key Techniques Of Hybrid Memory Management For Big-Data Application