原创 OpenMP and multicore

 2010-3-12 16:08  2764 18 18 分类: 消费电子

Dear Readers,

In the last article, we saw about the so called “embrassingly parallel operations” which can easily take advantage of multicore systems. In this article, let us see one more way of getting performance improvements on multicore systems. OpenMP is one of them.

OpenMP specifications was originally defined by industry vendors like Sun, Intel in 1997. It was popular in Symmetric Multiprocessing (SMP) systems. A typical SMP system is a multiprocessor computer hardware where two or more identical processors are connected to a single shared memory and are all processors run same OS instance.

Surprisingly, today's multicore systems are similar to the SMP architecture. Instead of multiple processors, we have multiple cores. All cores access the common shared memory and run same OS instance. That is why, a solution like OpenMP which is from SMP era is suddenly finding a renewed interest in the multicore systems of today.

The OpenMP specification is defined for C/C++/Fortran languages. It consists of three parts: compiler directives, runtime library and environment variables. The code is instrumented with directives and it gets compiled with the openMP supported compiler. The code is linked with the runtime library for generating the executable. There are some runtime environment variables that control the code execution.

An OpenMP program works like this:

Start as a single process called the master thread. The master thread executes sequentially like any other normal program, until the first parallel region construct is encountered.
The master thread then creates a team of parallel threads
The statements in the parallel region construct are then executed in parallel among the various threads created
When the individual threads complete the statements in the parallel region construct, they synchronize and terminate, leaving only the master thread

Since the process of creation, starting and joining of threads is done automatically, programmers are relieved of the complexities. The model also allows variables to be locked and shared between the threads and supports fairly advanced features.

Here is an example code from Wikipedia:

int main(int argc, char **argv) {
    const int N = 100000;
    int i, a[N];

    #pragma omp parallel for    - Compiler directive
    for (i = 0; i < N; i++)
        a = 2 * i;

    return 0;
}

As first step, the code is compiled with OpenMP enabled compiler. An environment variable, something like OMP_NUM_THREADS is set to the number of threads and the program is executed. Suppose, OMP_NUM_THREADS is set to 4. Code starts normally, but when it reaches the for loop, it creates 4 threads, and each thread does the matrix multiplication for 100000/4=25000 different entries. This speeds up the processing as four threads work in parallel, on different cores.

As we keep increasing the the value of OMP_NUM_THREADS, one could see a decrease in time and improvement in performance, till system bus bottlenecks start showing up.

The advantages of the openMP include:

Learning curve is low as it builds on existing languages through #pragma commands

It hides thread semantics

one could do incremental parallelization across the code and see the effects. This “Change and See” approach gives confidence to programmers

It supports of good set platforms (C/C++/Fortran on Linux/Windows)

supports both coarse/fine grained parallelism.

Main disadvantage of OpenMP is that it needs specific tool chains (compilers, runtime). Not all compiler tool chains support OpenMP. Popular ones include Sun Studio tool chain and GNU 4.3.1.

OpenMP can get a big performance improvement on a multicore systems as each thread can run on each core separately and hence translates to better performance.

Then why OpenMP is not so well known in mainstream? It is because OpenMP gives big performance gains to mainly mathematical and scientific computing needs like large matrix multiplications. For a desktop application or server application, OpenMP may not be of great help unless the application logic has such code.

multicore openmp

写原创有奖励！2025面包板原创奖励正在进行中

最新发表 推荐阅读 明星博主 原创博文 年度排行 博文排行博文评论 FPGA/CPLD MCU/ 嵌入式模拟电源/新能源测试测量通信智能手机处理器与DSP PCB 汽车电子消费电子智能硬件物联网软件与OS 采购与分销供应链管理工程师职场 EDA/ IP/ 设计与制造无人机机器人/ AI 医疗电子工业电子管理


 写博文

 点赞（18）

 收藏

分享到： 
 

上一篇： Embarrassingly Parallel Applications!

下一篇： Simple technique that improves single threaded app performance

PARTNER CONTENT

换一换> 更多>

文章评论（0条评论）
登录后参与讨论

您需要登录后才可以评论登录 | 立即注册

用户3673633

文章：10 阅读：27819 评论：3 赞：171

 好友  私信个人主页

文章 10

原创 0

阅读 27819

评论 3

赞 171

最新评论更多

大概是因为，增强型不驱动时是阻断的，置于恒压电源中，待机是全电路待机，电源也因管子不通而不会被加载。 ...

路青云评论博文 2025-6-20

耗尽型的MOSFET

好

腾恩科技-彭 ... 评论博文 2025-6-20

固态继电器与驱动隔离器：电力系统的守护者 ...

通过初步的通讯多端口数据的采集与收发，和使用多线程的的工作方式，以及多进程的任务服务方式，实现了T536数据采集与收发的程序 ...

用户1750409 ... 评论博文 2025-6-20

3W小夜灯报废

最新博文

老挂钟，修理，失败

3W小夜灯报废

细数IGBT测试指标及应对检测方案 ...

资料下载

本周热帖

电路第5版邱关源教材电子版 ...

C#+WPF开发全自动温湿度控制系统课程 ...

直流转交流：应用于汽车的200VA高频逆 ...

交直流电源，有电路图、PCB和源代码 ...

基于STM32正弦波逆变器设计（分享学习 ...

【资料下载-第三季】电路考试冲刺、30 ...

电子电路大全收藏

晶振起振靠的是什么呢

电子背散射衍射（EBSD）分析入门：晶 ...

80V降5V1.5A激光灯恒流驱动器H5628K ...

最新资讯

芯语最新

一个VCO全搞定？7倍频程线性音高方案 ...

SiC MOSFET 并联的关键技术

OpenAI奥特曼的权力套现：80家关联公 ...

中国厂商主导蜂窝物联网模块市场，Q1 ...

蔚来芯片业务独立实体落地合肥，李斌 ...

长城汽车重大调整！

理想车内起火，车主维权！ ...

特斯拉“Robotaxi”细节曝光

突发！美国拟取消台积电、三星、海力 ...

IEEE全球区块链大会（GBC）在沪盛大开 ...

EE直播间
更多

Keysight World Tech Day 线上直播-AI 驱动的超高速传输测试分论坛直播时间： 06月26日 13:30

材料介电常数的精确表征和测试直播时间： 07月03日 10:00

在线研讨会
更多

Mercury基于展频技术的医疗时钟EMI抑制方案

AI 巨型芯片，性能越强，测试越难，如何破局？

利用先进精密仪器仪表解决方案，优化研发并加快产品上市

ST 在大功率热管理系统中的电机控制系统方案（AI 数据中心/暖通空调/电池储能系统/变频制冷）

热门推荐

低成本、高效率，轻松搞定AI驱动的超高速传输测试！
自动驾驶的未来在何处？
深度剖析：测试电源与光伏储能背后的半导体奥秘
震惊！这家半导体公司竟隐藏着如此多黑科技

我要评论

 0

 18



 分享到微信

 分享到微博

 分享到QQ

 点击右上角，分享到朋友圈我知道啦

请使用浏览器分享功能我知道啦

关闭站长推荐 /4

2025第1期拆解活动：赢示波器、运动相机、热像仪等！

示波器、影石运动全景相机、大疆无人机、高清红外热成像仪；树莓派5等等

【下载】电源设计工程师指南（共542页）

本书共542页，深受设计工程师欢迎，作为硅基与第三代半导体的实用工具书，本手册将成为专业技术人员实现优化功率和小信号开关、电源转换和管理的必备指南。

【2025面包板社区内容狂欢节】发帖/回帖赢25万E币！

活动时间：即日起——2025年全年（发完20万E币为止！）

社区内容发布、审核与管理！

严厉打击刷流量发广告等行为

原创 OpenMP and multicore

文章评论（0条评论）