Investigation of the performance of LU decomposition method using CUDA

Ozcan, Caner; Sen, Baha

Investigation of the performance of LU decomposition method using CUDA

dc.contributor.author	Ozcan, Caner
dc.contributor.author	Sen, Baha
dc.date.accessioned	2024-09-29T16:00:37Z
dc.date.available	2024-09-29T16:00:37Z
dc.date.issued	2012
dc.department	Karabük Üniversitesi	en_US
dc.description	1st World Conference on Innovation and Software Development (INSODE) -- OCT 02-10, 2011 -- Bahcesehir Univ, Istanbul, TURKEY	en_US
dc.description.abstract	In recent years, parallel processing has been widely used in the computer industry. Software developers, have to deal with parallel computing platforms and technologies to provide novel and rich experiences. We present a novel algorithm to solve dense linear systems using Compute Unified Device Architecture (CUDA). High-level linear algebra operations require intensive computation. In this study Graphics Processing Units (GPU) accelerated implementation of LU linear algebra routine is implemented. LU decomposition is a decomposition of the form A=LU where A is a square matrix. The main idea of the LU decomposition is to record the steps used in Gaussian elimination on A in the places where the zero is produced. L and U are lower and upper triangular matrices respectively. This means that L has only zeros above the diagonal and U has only zeros below the diagonal. We have worked to increase performance with proper data representation and reducing row operations on GPU. Because of the high arithmetic throughput of GPUs, initial results from experiments promised a bright future for GPU computing. It has been shown useful for scientific computations. GPUs have high memory bandwidth and more floating point units as compared to the CPU. We have tried our study on different systems that have different GPUs and CPUs. The computation studies were also evaluated for different linear systems. When we compared the results obtained from both systems, a better performance was obtained with GPU computing. According to results, GPU computation approximately worked 3 times faster than the CPU computation. Our implementation provides significant performance improvement so we can easily use it to solve dense linear system. (C) 2011 Published by Elsevier Ltd.	en_US
dc.identifier.doi	10.1016/j.protcy.2012.02.011
dc.identifier.endpage	54	en_US
dc.identifier.issn	2212-0173
dc.identifier.startpage	50	en_US
dc.identifier.uri	https://doi.org/10.1016/j.protcy.2012.02.011
dc.identifier.uri	https://hdl.handle.net/20.500.14619/5238
dc.identifier.volume	1	en_US
dc.identifier.wos	WOS:000318909900009	en_US
dc.identifier.wosquality	N/A	en_US
dc.indekslendigikaynak	Web of Science	en_US
dc.language.iso	en	en_US
dc.publisher	Elsevier Science Bv	en_US
dc.relation.ispartof	First World Conference On Innovation and Computer Sciences (Insode 2011)	en_US
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	LU decomposition	en_US
dc.subject	dense linear systems	en_US
dc.subject	gpu computing	en_US
dc.subject	cuda	en_US
dc.title	Investigation of the performance of LU decomposition method using CUDA	en_US
dc.type	Conference Object	en_US

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu

Investigation of the performance of LU decomposition method using CUDA

Dosyalar

Koleksiyon