Accelerating MATLAB with GPU Computing: A Primer with Examples

Ebook420 pages3 hours

Accelerating MATLAB with GPU Computing: A Primer with Examples

Name: Accelerating MATLAB with GPU Computing: A Primer with Examples
Brand: Elsevier Science
Rating: 3.0 (1 reviews)

By Jung W. Suh and Youngmin Kim

Rating: 3 out of 5 stars

3/5

()

Read preview

About this ebook

Beyond simulation and algorithm development, many developers increasingly use MATLAB even for product deployment in computationally heavy fields. This often demands that MATLAB codes run faster by leveraging the distributed parallelism of Graphics Processing Units (GPUs). While MATLAB successfully provides high-level functions as a simulation tool for rapid prototyping, the underlying details and knowledge needed for utilizing GPUs make MATLAB users hesitate to step into it. Accelerating MATLAB with GPUs offers a primer on bridging this gap.

Starting with the basics, setting up MATLAB for CUDA (in Windows, Linux and Mac OS X) and profiling, it then guides users through advanced topics such as CUDA libraries. The authors share their experience developing algorithms using MATLAB, C++ and GPUs for huge datasets, modifying MATLAB codes to better utilize the computational power of GPUs, and integrating them into commercial software products. Throughout the book, they demonstrate many example codes that can be used as templates of C-MEX and CUDA codes for readers’ projects. Download example codes from the publisher's website: http://booksite.elsevier.com/9780124080805/

Shows how to accelerate MATLAB codes through the GPU for parallel processing, with minimal hardware knowledge
Explains the related background on hardware, architecture and programming for ease of use
Provides simple worked examples of MATLAB and CUDA C codes as well as templates that can be reused in real-world projects

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateNov 18, 2013

ISBN9780124079168

Author

Jung W. Suh

Jung W. Suh is a senior algorithm engineer and research scientist at KLA-Tencor. Dr. Suh received his Ph.D. from Virginia Tech in 2007 for his 3D medical image processing work. He was involved in the development of MPEG-4 and Digital Mobile Broadcasting (DMB) systems in Samsung Electronics. He was a senior scientist at HeartFlow, Inc., prior to joining KLA-Tencor. His research interests are in the fields of biomedical image processing, pattern recognition, machine learning and image/video compression. He has more than 30 journal and conference papers and 6 patents.

Related authors

Skip carousel

Related to Accelerating MATLAB with GPU Computing

Related ebooks

Skip carousel

CUDA Application Design and Development
Ebook
CUDA Application Design and Development
byRob Farber
Rating: 0 out of 5 stars
0 ratings
Practical Scientific Computing
Ebook
Practical Scientific Computing
byMuhammad Ali
Rating: 0 out of 5 stars
0 ratings
Using HPC for Computational Fluid Dynamics: A Guide to High Performance Computing for CFD Engineers
Ebook
Using HPC for Computational Fluid Dynamics: A Guide to High Performance Computing for CFD Engineers
byShamoon Jamshed
Rating: 0 out of 5 stars
0 ratings
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
Ebook
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
byPramod Singh
Rating: 0 out of 5 stars
0 ratings
Parallel Programming with OpenACC
Ebook
Parallel Programming with OpenACC
byRob Farber
Rating: 5 out of 5 stars
5/5
Build Your Own Distributed Compilation Cluster: A Practical Walkthrough
Ebook
Build Your Own Distributed Compilation Cluster: A Practical Walkthrough
byHunter Davis
Rating: 0 out of 5 stars
0 ratings
GPU Programming in MATLAB
Ebook
GPU Programming in MATLAB
byNikolaos Ploskas
Rating: 1 out of 5 stars
1/5
Practical MATLAB Deep Learning: A Project-Based Approach
Ebook
Practical MATLAB Deep Learning: A Project-Based Approach
byMichael Paluszek
Rating: 0 out of 5 stars
0 ratings
Introduction to Parallel Programming
Ebook
Introduction to Parallel Programming
bySteven Brawer
Rating: 0 out of 5 stars
0 ratings
Topological Data Structures for Surfaces: An Introduction to Geographical Information Science
Ebook
Topological Data Structures for Surfaces: An Introduction to Geographical Information Science
bySanjay Rana
Rating: 0 out of 5 stars
0 ratings
CUDA Fortran for Scientists and Engineers: Best Practices for Efficient CUDA Fortran Programming
Ebook
CUDA Fortran for Scientists and Engineers: Best Practices for Efficient CUDA Fortran Programming
byGregory Ruetsch
Rating: 0 out of 5 stars
0 ratings
Theories and Practices of Self-Driving Vehicles
Ebook
Theories and Practices of Self-Driving Vehicles
byQingguo Zhou
Rating: 0 out of 5 stars
0 ratings
Heterogeneous Computing with OpenCL 2.0
Ebook
Heterogeneous Computing with OpenCL 2.0
byDavid R. Kaeli
Rating: 0 out of 5 stars
0 ratings
Nature-Inspired Computation and Swarm Intelligence: Algorithms, Theory and Applications
Ebook
Nature-Inspired Computation and Swarm Intelligence: Algorithms, Theory and Applications
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Boost.Asio C++ Network Programming Cookbook
Ebook
Boost.Asio C++ Network Programming Cookbook
byRadchuk Dmytro
Rating: 0 out of 5 stars
0 ratings
Advanced Python Development: Using Powerful Language Features in Real-World Applications
Ebook
Advanced Python Development: Using Powerful Language Features in Real-World Applications
byMatthew Wilkes
Rating: 0 out of 5 stars
0 ratings
CUDA Programming: A Developer's Guide to Parallel Computing with GPUs
Ebook
CUDA Programming: A Developer's Guide to Parallel Computing with GPUs
byShane Cook
Rating: 4 out of 5 stars
4/5
Exploring C++20: The Programmer's Introduction to C++
Ebook
Exploring C++20: The Programmer's Introduction to C++
byRay Lischner
Rating: 0 out of 5 stars
0 ratings
Objective-C Programming Nuts and bolts
Ebook
Objective-C Programming Nuts and bolts
byKeith Lee
Rating: 0 out of 5 stars
0 ratings
GPU Computing Gems Jade Edition
Ebook
GPU Computing Gems Jade Edition
byElsevier Books Reference
Rating: 5 out of 5 stars
5/5
Python How-To: 63 techniques to improve your Python code
Ebook
Python How-To: 63 techniques to improve your Python code
byYong Cui
Rating: 0 out of 5 stars
0 ratings
Optimizing Visual Studio Code for Python Development: Developing More Efficient and Effective Programs in Python
Ebook
Optimizing Visual Studio Code for Python Development: Developing More Efficient and Effective Programs in Python
bySufyan bin Uzayr
Rating: 0 out of 5 stars
0 ratings
Qt 5 Blueprints
Ebook
Qt 5 Blueprints
bySymeon Huang
Rating: 4 out of 5 stars
4/5
Mathematical Theory Of Rocket Flight
Ebook
Mathematical Theory Of Rocket Flight
byBarkley Rosser
Rating: 0 out of 5 stars
0 ratings
Shared Memory Application Programming: Concepts and Strategies in Multicore Application Programming
Ebook
Shared Memory Application Programming: Concepts and Strategies in Multicore Application Programming
byVictor Alessandrini
Rating: 0 out of 5 stars
0 ratings
GNU Octave Beginner's Guide
Ebook
GNU Octave Beginner's Guide
byJesper Schmidt Hansen
Rating: 3 out of 5 stars
3/5
Writing Compilers and Interpreters: A Software Engineering Approach
Ebook
Writing Compilers and Interpreters: A Software Engineering Approach
byRonald Mak
Rating: 3 out of 5 stars
3/5
Computer Programming and Architecture: The Vax
Ebook
Computer Programming and Architecture: The Vax
byHenry Levy
Rating: 0 out of 5 stars
0 ratings
Practical Neural Network Recipies in C++
Ebook
Practical Neural Network Recipies in C++
byMasters
Rating: 3 out of 5 stars
3/5
Computer Vision and Applications: A Guide for Students and Practitioners,Concise Edition
Ebook
Computer Vision and Applications: A Guide for Students and Practitioners,Concise Edition
byBernd Jahne
Rating: 5 out of 5 stars
5/5

Programming For You

Skip carousel

Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
Ebook
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
byTimothy C. Needham
Rating: 4 out of 5 stars
4/5
Python: Learn Python in 24 Hours
Ebook
Python: Learn Python in 24 Hours
byAlex Nordeen
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
Ebook
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
byJoseph Labrecque
Rating: 5 out of 5 stars
5/5
Python Machine Learning By Example
Ebook
Python Machine Learning By Example
byYuxi (Hayden) Liu
Rating: 4 out of 5 stars
4/5
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
Ebook
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
byi Code Academy
Rating: 5 out of 5 stars
5/5
Coding All-in-One For Dummies
Ebook
Coding All-in-One For Dummies
byNikhil Abraham
Rating: 4 out of 5 stars
4/5
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
Ebook
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
byAnthony Adams
Rating: 4 out of 5 stars
4/5
Java for Beginners: A Crash Course to Learn Java Programming in 1 Week
Ebook
Java for Beginners: A Crash Course to Learn Java Programming in 1 Week
byBrady Ellison
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
Ebook
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
byJason Scotts
Rating: 4 out of 5 stars
4/5
Mastering Windows PowerShell Scripting
Ebook
Mastering Windows PowerShell Scripting
byBrenton J.W. Blawat
Rating: 4 out of 5 stars
4/5
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
Ebook
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
byMark Chan
Rating: 5 out of 5 stars
5/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
Ebook
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
byJames Tudor
Rating: 5 out of 5 stars
5/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
HTML & CSS QuickStart Guide: The Simplified Beginners Guide to Developing a Strong Coding Foundation, Building Responsive Websites, and Mastering the Fundamentals of Modern Web Design
Ebook
HTML & CSS QuickStart Guide: The Simplified Beginners Guide to Developing a Strong Coding Foundation, Building Responsive Websites, and Mastering the Fundamentals of Modern Web Design
byDavid DuRocher
Rating: 4 out of 5 stars
4/5
HTML & CSS: Learn the Fundaments in 7 Days
Ebook
HTML & CSS: Learn the Fundaments in 7 Days
byMichael Knapp
Rating: 4 out of 5 stars
4/5
Learn JavaScript in 24 Hours
Ebook
Learn JavaScript in 24 Hours
byAlex Nordeen
Rating: 3 out of 5 stars
3/5
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Modern C++ for Absolute Beginners: A Friendly Introduction to C++ Programming Language and C++11 to C++20 Standards
Ebook
Modern C++ for Absolute Beginners: A Friendly Introduction to C++ Programming Language and C++11 to C++20 Standards
bySlobodan Dmitrović
Rating: 0 out of 5 stars
0 ratings
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
The Absolute Beginner's Guide to Binary, Hex, Bits, and Bytes! How to Master Your Computer's Love Language
Ebook
The Absolute Beginner's Guide to Binary, Hex, Bits, and Bytes! How to Master Your Computer's Love Language
byGreg Perry
Rating: 5 out of 5 stars
5/5
SQL All-in-One For Dummies
Ebook
SQL All-in-One For Dummies
byAllen G. Taylor
Rating: 3 out of 5 stars
3/5
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
Ebook
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
byYana Kortsarts
Rating: 5 out of 5 stars
5/5
Python Essentials
Ebook
Python Essentials
bySteven F. Lott
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
HTML in 30 Pages
Ebook
HTML in 30 Pages
byU.Q. Magnusson
Rating: 5 out of 5 stars
5/5
TensorFlow in 1 Day: Make your own Neural Network
Ebook
TensorFlow in 1 Day: Make your own Neural Network
byKrishna Rungta
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

41. Bob Nystrom
Podcast episode
41. Bob Nystrom
byIt's All Widgets! Flutter Podcast
0 ratings
0% found this document useful
Computational Thinking & Learning Python During an AI Revolution
Podcast episode
Computational Thinking & Learning Python During an AI Revolution
byThe Real Python Podcast
0 ratings
0% found this document useful
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
Podcast episode
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
byGoogle Cloud Platform Podcast
100%
100% found this document useful
JIT Compilation and Exascale Computing with Hal Finkel: Rob and Jason are joined by Hal Finkel from the US Department of Energy. They first talk to Hal about the LLVM 13 release and why the release notes were lacking. Then they talk to Hal about his C++ JIT Proposal, the Clang prototype and how it could be...
Podcast episode
JIT Compilation and Exascale Computing with Hal Finkel: Rob and Jason are joined by Hal Finkel from the US Department of Energy. They first talk to Hal about the LLVM 13 release and why the release notes were lacking. Then they talk to Hal about his C++ JIT Proposal, the Clang prototype and how it could be...
byCppCast
0 ratings
0% found this document useful
#111 The Rise of the Julia Programming Language
Podcast episode
#111 The Rise of the Julia Programming Language
byDataFramed
0 ratings
0% found this document useful
Past, Present and Future of C++ with Bjarne Stroustrup: Rob and Jason are joined by Bjarne Stroustrup, designer and original implementer of C++ to discuss the current state of C++, his vision for the future as well as some discussion of the past. Bjarne Stroustrup is the designer and original implementer...
Podcast episode
Past, Present and Future of C++ with Bjarne Stroustrup: Rob and Jason are joined by Bjarne Stroustrup, designer and original implementer of C++ to discuss the current state of C++, his vision for the future as well as some discussion of the past. Bjarne Stroustrup is the designer and original implementer...
byCppCast
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Being Bayesian: This episode explores the root concept of what it is to be Bayesian: describing knowledge of a system probabilistically, having an appropriate prior probability, know how to weigh new evidence, and following Bayes's rule to compute the revised...
Podcast episode
Being Bayesian: This episode explores the root concept of what it is to be Bayesian: describing knowledge of a system probabilistically, having an appropriate prior probability, know how to weigh new evidence, and following Bayes's rule to compute the revised...
byData Skeptic
0 ratings
0% found this document useful
A Chaos Engineering & Jeli Sandwich with Nora Jones: Nora Jones is the founder and CEO at Jeli, makers of an incident analysis platform that leverages data to recommend productive solutions to the problems at hand. Before this role, she was Head of Chaos Engineering and Human Factors at Slack, a senior soft
Podcast episode
A Chaos Engineering & Jeli Sandwich with Nora Jones: Nora Jones is the founder and CEO at Jeli, makers of an incident analysis platform that leverages data to recommend productive solutions to the problems at hand. Before this role, she was Head of Chaos Engineering and Human Factors at Slack, a senior soft
byScreaming in the Cloud
0 ratings
0% found this document useful
Dart and Crafting Interpreters with Bob Nystrom: Rob and Jason are joined by Bob Nystrom from Google. They first discuss git commands explained via cats and an analysis of how Visual Studio 2022 could use all your RAM. Then they talk to Bob about some of the programming languages he's created, his...
Podcast episode
Dart and Crafting Interpreters with Bob Nystrom: Rob and Jason are joined by Bob Nystrom from Google. They first discuss git commands explained via cats and an analysis of how Visual Studio 2022 could use all your RAM. Then they talk to Bob about some of the programming languages he's created, his...
byCppCast
0 ratings
0% found this document useful
Make Database Performance Optimization A Playful Experience With Ottertune: An interview with Andy Pavlo about his work on Ottertune to automatically tune your database configuration for better performance.
Podcast episode
Make Database Performance Optimization A Playful Experience With Ottertune: An interview with Andy Pavlo about his work on Ottertune to automatically tune your database configuration for better performance.
byData Engineering Podcast
0 ratings
0% found this document useful
2: Pytest vs Unittest vs Nose: Choosing a test framework
Podcast episode
2: Pytest vs Unittest vs Nose: Choosing a test framework
byTest and Code
0 ratings
0% found this document useful
C# and IL2CPP with Josh Peterson: Rob and Jason are joined by Josh Peterson to talk about C# and some of the similarities and differences between the Managed language and C++, he also talks about his work at Unity 3D on IL2CPP. Josh is a programmer working at Unity Technologies, where...
Podcast episode
C# and IL2CPP with Josh Peterson: Rob and Jason are joined by Josh Peterson to talk about C# and some of the similarities and differences between the Managed language and C++, he also talks about his work at Unity 3D on IL2CPP. Josh is a programmer working at Unity Technologies, where...
byCppCast
0 ratings
0% found this document useful
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
OpenTelemetry for Databases: Empowering DevOps through sqlcommenter with Nimesh Bhagat: Optimizing or debugging database calls has to become as easy as optimizing your application code based on logs, metrics or traces your observability platform provides to developers. It has to be doable by the development and DevOps teams who are...
Podcast episode
OpenTelemetry for Databases: Empowering DevOps through sqlcommenter with Nimesh Bhagat: Optimizing or debugging database calls has to become as easy as optimizing your application code based on logs, metrics or traces your observability platform provides to developers. It has to be doable by the development and DevOps teams who are...
byPurePerformance
0 ratings
0% found this document useful
Scalable Databases on Kubernetes
Podcast episode
Scalable Databases on Kubernetes
byThe Cloudcast
0 ratings
0% found this document useful
Powering your Copilot for Data – with Artem Keydunov of Cube.dev
Podcast episode
Powering your Copilot for Data – with Artem Keydunov of Cube.dev
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library: Cloud data warehouses and the introduction of the ELT paradigm has led to the creation of multiple options for flexible data integration, with a roughly equal distribution of commercial and open source options. The challenge is that most of those options are complex to operate and exist in their own silo. The dlt project was created to eliminate overhead and bring data integration into your full control as a library component of your overall data system. In this episode Adrian Brudaru explains how it works, the benefits that it provides over other data integration solutions, and how you can start building pipelines today.
Podcast episode
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library: Cloud data warehouses and the introduction of the ELT paradigm has led to the creation of multiple options for flexible data integration, with a roughly equal distribution of commercial and open source options. The challenge is that most of those options are complex to operate and exist in their own silo. The dlt project was created to eliminate overhead and bring data integration into your full control as a library component of your overall data system. In this episode Adrian Brudaru explains how it works, the benefits that it provides over other data integration solutions, and how you can start building pipelines today.
byData Engineering Podcast
0 ratings
0% found this document useful
#08 - Tech stack: Metabase, Superset, Redash, Grafana
Podcast episode
#08 - Tech stack: Metabase, Superset, Redash, Grafana
byTOPP - The Open Podcast Podcast
0 ratings
0% found this document useful
How Data Platforms Affect ML & AI // Jake Watson // #207
Podcast episode
How Data Platforms Affect ML & AI // Jake Watson // #207
byMLOps.community
0 ratings
0% found this document useful
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
Podcast episode
Building An Internal Database As A Service Platform At Cloudflare: Data persistence is one of the most challenging aspects of computer systems. In the era of the cloud most developers rely on hosted services to manage their databases, but what if you are a cloud service? In this episode Vignesh Ravichandran explains how his team at Cloudflare provides PostgreSQL as a service to their developers for low latency and high uptime services at global scale. This is an interesting and insightful look at pragmatic engineering for reliability and scale.
byData Engineering Podcast
0 ratings
0% found this document useful
Potluck - How to Pick a Tech Stack × useEffect × setTimeout × Staying Focused: In this episode of Syntax, Wes and Scott answer your questions about picking the right tech stack, whether useEffect is still useful, benefit to use uses setTimeout, and more! Linode - Sponsor Whether you’re working on a personal project or...
Podcast episode
Potluck - How to Pick a Tech Stack × useEffect × setTimeout × Staying Focused: In this episode of Syntax, Wes and Scott answer your questions about picking the right tech stack, whether useEffect is still useful, benefit to use uses setTimeout, and more! Linode - Sponsor Whether you’re working on a personal project or...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts: We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs. Our system is motivated by the limitations of current frameworks, which restrict the dynamic routing in MoE layers to satisfy the constraints of existing softwar...
Podcast episode
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts: We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs. Our system is motivated by the limitations of current frameworks, which restrict the dynamic routing in MoE layers to satisfy the constraints of existing softwar...
byPapers Read on AI
0 ratings
0% found this document useful
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts: We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs. Our system is motivated by the limitations of current frameworks, which restrict the dynamic routing in MoE layers to satisfy the constraints of existing softwar...
Podcast episode
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts: We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs. Our system is motivated by the limitations of current frameworks, which restrict the dynamic routing in MoE layers to satisfy the constraints of existing softwar...
byPapers Read on AI
0 ratings
0% found this document useful
The State of Kubernetes
Podcast episode
The State of Kubernetes
byThe Cloudcast
0 ratings
0% found this document useful
081 Mastering Memory Aware .NET Software Development with Konrad Kokosa: The .NET Runtime – whether .NET Framework or .NET Core – provides many ways to optimize memory management. But they don’t come in the form of configuration switches as we know if from Java. While there are a handful of settings, the .NET Runtime...
Podcast episode
081 Mastering Memory Aware .NET Software Development with Konrad Kokosa: The .NET Runtime – whether .NET Framework or .NET Core – provides many ways to optimize memory management. But they don’t come in the form of configuration switches as we know if from Java. While there are a handful of settings, the .NET Runtime...
byPurePerformance
0 ratings
0% found this document useful
The Rise of MLOps - Theofilos Papapanagiotou
Podcast episode
The Rise of MLOps - Theofilos Papapanagiotou
byDataTalks.Club
0 ratings
0% found this document useful
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
Podcast episode
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
byMLOps.community
0 ratings
0% found this document useful
LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML
Podcast episode
LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
Hasty Treat - Desktop Apps + New Tech We Love: In this Hasty Treat, Scott and Wes talk about the hottest new tech they love! Linode - Sponsor Whether you’re working on a personal project or managing enterprise infrastructure, you deserve simple, affordable, and accessible cloud computing...
Podcast episode
Hasty Treat - Desktop Apps + New Tech We Love: In this Hasty Treat, Scott and Wes talk about the hottest new tech they love! Linode - Sponsor Whether you’re working on a personal project or managing enterprise infrastructure, you deserve simple, affordable, and accessible cloud computing...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful

Skip carousel

Fast And Easy Image Processing
Linux Format
Article
Fast And Easy Image Processing
Apr 4, 2023
Credit: https://github.com/oguzhaninan/korkut Shashank Sharma is a trial lawyer in New Delhi and an avid Arch user. He’s been writing about open source software for 20 years and lawyering for 10. You wouldn’t think of the command line as the go-to re
4 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
Why There Are No Nuclear Airplanes
The Atlantic
Article
Why There Are No Nuclear Airplanes
Jan 20, 2019
5 min read
Mucking About With AI
APC
Article
Mucking About With AI
May 22, 2023
2 min read
Why ‘Killer Robots’ Are Becoming a Real Threat – and an Ethics Test
The Christian Science Monitor
Article
Why ‘Killer Robots’ Are Becoming a Real Threat – and an Ethics Test
Aug 31, 2017
Dozens of CEOs from firms with a hand in artificial intelligence have issued a joint warning that autonomous weapons pose the risk of warfare that is cheaper, and can occur faster, than ever.
5 min read
Why Are We Stuck With M.2 When U.2 Is So Much Better?
APC
Article
Why Are We Stuck With M.2 When U.2 Is So Much Better?
May 22, 2023
4 min read
Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
QEMU, KVM And The Other Ones
Linux Format
Article
QEMU, KVM And The Other Ones
Feb 9, 2021
4 min read
Boost Your 3d Skills With Our Biggest Ever Tips Guide!
3D World
Article
Boost Your 3d Skills With Our Biggest Ever Tips Guide!
Jul 17, 2019
2 min read
Open Source Processors
Linux Format
Article
Open Source Processors
Jun 2, 2020
8 min read
Rokoko Studio 2.0
3D World
Article
Rokoko Studio 2.0
Feb 23, 2021
1 min read
AI See You…
Linux Format
Article
AI See You…
Jun 27, 2023
5 min read
Computer-aided Design
Linux Format
Article
Computer-aided Design
Jul 25, 2023
13 min read
Observability Of The Kernel And Containers
Linux Format
Article
Observability Of The Kernel And Containers
Apr 4, 2023
Mihalis Tsoukalos is currently working on Time Series. You can reach him at: @mactsouk. For our final delve into eBPF, we’re tackling applications, the kernel and Docker containers. At the end of the day, all Linux machines execute code for applicat
10 min read
How To Simulate Electronic Circuits
Linux Format
Article
How To Simulate Electronic Circuits
Jun 2, 2020
9 min read
Text Docs To Rich Docs
Linux Format
Article
Text Docs To Rich Docs
Dec 17, 2019
6 min read
Tweak And Tune Your Own Kernel Scheduler
Linux Format
Article
Tweak And Tune Your Own Kernel Scheduler
Nov 14, 2023
SCHEDTOOL Credit: https://github.com/freequaos/schedtool OUR EXPERT QUICK TIP The first time you compile your own kernel, prepare the disk for handling up to 12GB of new data. Also reserve a good chunk of time and your favourite brew. A compile runs
11 min read
Build A Static Analysis Development Pipeline
Linux Format
Article
Build A Static Analysis Development Pipeline
Jul 27, 2021
9 min read
Overall Usefulness
Linux Format
Article
Overall Usefulness
Sep 22, 2020
3 min read
RapidWeaver Classic
MacFormat
Article
RapidWeaver Classic
Aug 23, 2022
3 min read
Fast And Easy Image Processing
APC
Article
Fast And Easy Image Processing
Jun 19, 2023
5 min read
MacOS
MacFormat
Article
MacOS
Nov 19, 2019
Can I update Adobe App Manager to 64-bit? > Not directly, but you should ensure that all other Adobe apps, Acrobat Reader in particular, are 64-bit. When you upgrade to Catalina, old components like this won’t run, are disabled, and easy to remove. W
3 min read
The Verdict Static Site Generators
Linux Format
Article
The Verdict Static Site Generators
Oct 19, 2021
2 min read
Alternative Circuit Simulators
Linux Format
Article
Alternative Circuit Simulators
Jun 2, 2020
You may find the way gEDA works to be a little clumsy, especially if you find the command-line cumbersome. An alternative solution is Qucs-S. This project started as an independent project, but some designers then forked it to include Ngspice, XYCE a
1 min read
How To Add EBPF Observability To Your Project
Techfastly
Article
How To Add EBPF Observability To Your Project
Apr 1, 2022
3 min read
FLASK Web Frameworks
Linux Format
Article
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read
Exploring The Hyper Interface
Linux Format
Article
Exploring The Hyper Interface
Jul 2, 2019
12 min read
Adding Libraries
Linux Format
Article
Adding Libraries
Jul 28, 2020
In both your compiler settings and the Makefile you can choose where to look for supporting libraries. Your IDE can sometimes write this file for you. However, when you add a framework or library to your project, you have to import it yourself. You u
1 min read
One-day Projects To Improve Your Business Network
PC Pro Magazine
Article
One-day Projects To Improve Your Business Network
Apr 10, 2022
8 min read
Chatty AI man
Linux Format
Article
Chatty AI man
Jun 27, 2023
At the time of writing, the official way to get access to the model data involves visiting A https://bit.ly/lxf304form and filling out the form. Sadly, practical experience teaches that non-edu email addresses usually don’t get a positive result. The
4 min read

Related categories

Skip carousel

Reviews for Accelerating MATLAB with GPU Computing

Rating: 3 out of 5 stars

3/5

1 rating0 reviews

Book preview

Accelerating MATLAB with GPU Computing - Jung W. Suh

Preface

MATLAB is a widely used simulation tool for rapid prototyping and algorithm development. Many laboratories and research institutions face growing demands to run their MATLAB codes faster for computationally heavy projects after simple simulations. Since MATLAB uses a vector/matrix representation of data, which is suitable for parallel processing, it can benefit a lot from GPU acceleration.

Target Readers and Contents

This book is aimed primarily at the graduate students and researchers in the field of engineering, science, and technology who need huge data processing without losing the many benefits of MATLAB. However, MATLAB users come from various backgrounds and do not necessarily have much programming experience. For those whose backgrounds are not from programming, GPU acceleration for MATLAB may distract their algorithm development and introduce unnecessary hassles, even when setting the environment. This book targets the readers who have some or a lot of experience on MATLAB coding but not enough depth in either C coding or the computer architecture for parallelization. So readers can focus more on their research and work by avoiding non-algorithmic hassles in using GPU and CUDA in MATLAB.

As a primer, the book will start with the basics, walking through the process of setting MATLAB for CUDA (in Windows and Mac OSX), creating c-mex and m-file profiling, then guide the users through the expert-level topics such as third-party CUDA libraries. It also provides many practical ways to modify users’ MATLAB codes to better utilize the immense computational power of graphics processors.

This book guides the reader to dramatically maximize the MATLAB speed using NVIDIA’s Graphics Processing Unit (GPU). NVIDIA’s Compute Unified Device Architecture (CUDA) is a parallel computing architecture originally designed for computer games but is getting a reputation in the general science and technology fields for its efficient massive computation power. From this book, the reader can take advantage of the parallel processing power of GPU and abundant CUDA scientific libraries for accelerating MATLAB code with no or less effort and time, and bring readers’ researches and works to a higher level.

Directions of this Book

GPU Utilization Using c-mex Versus Parallel Computing Toolbox

This book deals with Mathworks’s Parallel Computing Toolbox in Chapter 5. Although Mathworks’s Parallel Computing Toolbox is a useful tool for speeding up MATLAB, the current version still has its limitation in making the Parallel Computing Toolbox a general speeding-up solution, in addition to the extra cost of purchasing the toolbox. Especially, since the Parallel Computing Toolbox targets distributed computing over multicore, multiple computers and/or cluster machines as well as GPU processing, GPU optimization for speeding up the user’s code is comparatively limited both in speeding-up and supporting MATLAB functions. Furthermore, if we limit to Mathworks’s the Parallel Computing Toolbox only, then it is difficult to find an efficient way to utilize the abundant CUDA libraries to their maximum. In this book, we address both the strengths and the limitations of the current Parallel Computing Toolbox in Chapter 5. For the purpose of general speeding up, GPU-utilization through c-mex proves a better approach and provides more flexibility in current situation.

Tutorial Approach Versus Case Study Approach

As the book’s title says, we take more of a tutorial approach. MATLAB users may come from many different backgrounds, and web resources are scattered over Mathworks, NVIDIA, and private blogs as fragmented information. The tutorial approach from setting the GPU environment to acquiring critical (but compressed) hardware knowledge for GPU would be beneficial to prospective readers over a wide spectrum. However, this book also has two chapters (Chapters 7 and 8) that include case examples with working codes.

CUDA Versus OpenCL

When we prepared the proposal of this book, we also considered OpenCL as a topic, because the inclusion of OpenCL would attract a wider range of readers. However, while CUDA is more consistent and stable, because it is solely driven by NVIDIA, the current OpenCL has no unified development environment and is still unstable in some areas, because OpenCL is not governed by one company or institution. For this reason, installing, profiling, and debugging OpenCL are not yet standardized. As a primer, this may distract the focus of this book. More importantly, for some reason Mathworks is very conservative in its support of OpenCL, unlike CUDA. Therefore, we decided not to include OpenCL in this edition of our book. However, we will again consider whether to include OpenCL in future editions if increased needs come from market or Mathworks’ direction changes.

After reading this book, the reader, in no time, will experience an amazing performance boost in utilizing reader’s MATLAB codes and be better equipped in research to enjoy the useful open-source resources for CUDA. The features this book covers are available on Windows and Mac.

1 Accelerating MATLAB without GPU

This chapter deals with basic accelerating methods for MATLAB codes in an intrinsic way, which means simple code optimization without using GPU or C-MEX. This chapter covers vectorization for parallel processing, preallocation for efficient memory management, tips to increase your MATLAB codes, and step-by-step examples that show the code improvements.

Keywords

Vectorization; elementwise operation; vector/matrix operation; memory preallocation; sparse matrix form

1.1 Chapter Objectives

In this chapter, we deal with the basic accelerating methods for MATLAB codes in an intrinsic way – a simple code optimization without using GPU or C-MEX. You will learn about the following:

• The vectorization for parallel processing.

• The preallocation for efficient memory management.

• Other useful tips to increase your MATLAB codes.

• Examples that show the code improvements step by step.

1.2 Vectorization

Since MATLAB has the vector/matrix representation of its data, vectorization can help to make your MATLAB codes run faster. The key for vectorization is to minimize the usage of a for-loop.

Consider the following two m files, which are functionally the same:

The left nonVec1.m has a for-loop to calculate the sum, while the right Vec1.m has no for-loop in the code.

>> nonVec1

Elapsed time is 0.944395 seconds.

y =

−1.3042e+48

>> Vec1

Elapsed time is 0.330786 seconds.

y =

−1.3042e+48

The results are same but the elapsed time for Vec1.m is almost three times less than that for nonVec1.m. For better vectorization, utilize the elementwise operation and vector/matrix operation.

1.2.1 Elementwise Operation

The * symbol is defined as matrix multiplication when it is used on two matrices. But the .* symbol specifies an elementwise multiplication. For example, if x = [1 2 3] and v = [4 5 6],

>> k = x .* v

k =

4 10 18

Many other operations can be performed elementwise:

>> k = x .^2

k =

1 4 9

>> k = x ./ v

k =

0.2500 0.4000 0.5000

Many functions also support this elementwise operation:

>> k = sqrt(x)

k =

1.0000 1.4142 1.7321

>> k = sin(x)

k =

0.8415 0.9093 0.1411

>> k = log(x)

k =

0 0.6931 1.0986

>> k = abs(x)

k =

1 2 3

Even the relational operators can be used elementwise:

>> R = rand(2,3)

R =

0.8147 0.1270 0.6324

0.9058 0.9134 0.0975

>> (R > 0.2) & (R < 0.8)

ans =

0 0 1

0 0 0

>> x = 5

x =

>> x >= [1 2 3; 4 5 6; 7 8 9]

ans =

1 1 1

1 1 0

0 0 0

We can do even more complicated elementwise operations together:

>> A = 1:10;

>> B = 2:11;

>> C = 0.1:0.1:1;

>> D = 5:14;

>> M = B ./ (A .* D .* sin(C));

1.2.2 Vector/Matrix Operation

Since MATLAB is based on a linear algebra software package, employing vector/matrix operation in linear algebra can effectively replace the for-loop, and result in speeding up. Most common vector/matrix operations are matrix multiplication for combining multiplication and addition for each element.

If we consider two column vectors, a and b, the resulting dot product is the 1×1 matrix, as follows:

If two vectors, a and bto get the 1×1 matrix, resulting from the combination of multiplication and addition, as follows.

as the dot products of x with the rows of A:

1.2.3 Useful Tricks

In many applications, we need to set upper and lower bounds on each element. For that purpose, we often use if and elseif statements, which easily break vectorization. Instead of if and elseif statements for bounding elements, we may use min and max built-in functions:

>> ifExample

Elapsed time is 0.878781 seconds.

y =

5.8759e+47

>> nonifExample

Elapsed time is 0.309516 seconds.

y =

5.8759e+47

Similarly, if you need to find and replace some values in elements, you can also avoid if and elseif statements by using the find function to keep vectorization.

Enjoying the preview?

Page 1 of 1

Accelerating MATLAB with GPU Computing: A Primer with Examples

About this ebook

Jung W. Suh

Related authors

Related to Accelerating MATLAB with GPU Computing

Related ebooks

Programming For You

Related podcast episodes

Related articles

Related categories

Reviews for Accelerating MATLAB with GPU Computing

What did you think?

Book preview

Accelerating MATLAB with GPU Computing - Jung W. Suh

Preface

Target Readers and Contents

Directions of this Book

GPU Utilization Using c-mex Versus Parallel Computing Toolbox

Tutorial Approach Versus Case Study Approach

CUDA Versus OpenCL

1

Accelerating MATLAB without GPU

Keywords

1.1 Chapter Objectives

1.2 Vectorization

1.2.1 Elementwise Operation

1.2.2 Vector/Matrix Operation

1.2.3 Useful Tricks