如何使用python找出CPU的数量

https://stackoverflow.com/questions/1006289

06-07-2019
|

题

我想知道使用Python的本地机器上的CPU数量。结果应该是 user/real 作为输出 time(1) 当使用最佳缩放的仅用户空间程序调用时。

解决方案 2

如果您对当前流程的可用处理器数量感兴趣，则必须检查 cpuset 。否则（或者如果cpuset未使用），请 multiprocessing.cpu_count（） 是Python 2.6及更高版本的出路。以下方法可以回溯到旧版Python中的几种替代方法：

import os
import re
import subprocess


def available_cpu_count():
    """ Number of available virtual or physical CPUs on this system, i.e.
    user/real as output by time(1) when called with an optimally scaling
    userspace-only program"""

    # cpuset
    # cpuset may restrict the number of *available* processors
    try:
        m = re.search(r'(?m)^Cpus_allowed:\s*(.*),
                      open('/proc/self/status').read())
        if m:
            res = bin(int(m.group(1).replace(',', ''), 16)).count('1')
            if res > 0:
                return res
    except IOError:
        pass

    # Python 2.6+
    try:
        import multiprocessing
        return multiprocessing.cpu_count()
    except (ImportError, NotImplementedError):
        pass

    # https://github.com/giampaolo/psutil
    try:
        import psutil
        return psutil.cpu_count()   # psutil.NUM_CPUS on old versions
    except (ImportError, AttributeError):
        pass

    # POSIX
    try:
        res = int(os.sysconf('SC_NPROCESSORS_ONLN'))

        if res > 0:
            return res
    except (AttributeError, ValueError):
        pass

    # Windows
    try:
        res = int(os.environ['NUMBER_OF_PROCESSORS'])

        if res > 0:
            return res
    except (KeyError, ValueError):
        pass

    # jython
    try:
        from java.lang import Runtime
        runtime = Runtime.getRuntime()
        res = runtime.availableProcessors()
        if res > 0:
            return res
    except ImportError:
        pass

    # BSD
    try:
        sysctl = subprocess.Popen(['sysctl', '-n', 'hw.ncpu'],
                                  stdout=subprocess.PIPE)
        scStdout = sysctl.communicate()[0]
        res = int(scStdout)

        if res > 0:
            return res
    except (OSError, ValueError):
        pass

    # Linux
    try:
        res = open('/proc/cpuinfo').read().count('processor\t:')

        if res > 0:
            return res
    except IOError:
        pass

    # Solaris
    try:
        pseudoDevices = os.listdir('/devices/pseudo/')
        res = 0
        for pd in pseudoDevices:
            if re.match(r'^cpuid@[0-9]+, pd):
                res += 1

        if res > 0:
            return res
    except OSError:
        pass

    # Other UNIXes (heuristic)
    try:
        try:
            dmesg = open('/var/run/dmesg.boot').read()
        except IOError:
            dmesgProcess = subprocess.Popen(['dmesg'], stdout=subprocess.PIPE)
            dmesg = dmesgProcess.communicate()[0]

        res = 0
        while '\ncpu' + str(res) + ':' in dmesg:
            res += 1

        if res > 0:
            return res
    except OSError:
        pass

    raise Exception('Can not determine number of CPUs on this system')

其他提示

如果你有一个版本＆gt; = 2.6的python，你可以简单地使用

import multiprocessing

multiprocessing.cpu_count()

http://docs.python.org/library/multiprocessing.html# multiprocessing.cpu_count

另一种选择是使用 psutil 库，在这些情况下很有用：

>>> import psutil
>>> psutil.cpu_count()
2

这应该适用于 psutil （Unix和Windows）支持的任何平台。

请注意，在某些情况下 multiprocessing.cpu_count 可能会引发 NotImplementedError ，而 psutil 将能够获取CPU的数量。这只是因为 psutil 首先尝试使用 multiprocessing 所使用的相同技术，如果失败，它还会使用其他技术。

在Python 3.4+中： os.cpu_count（）。

multiprocessing.cpu_count（）是根据此函数实现的，但如果 os.cpu_count（）返回，则引发 NotImplementedError （“无法确定CPU的数量”）。

import os

print(os.cpu_count())

平台独立：

psutil.cpu_count（逻辑=假）

https://github.com/giampaolo/psutil/blob/master/INSTALL.rst

multiprocessing.cpu_count（）将返回逻辑CPU的数量，因此如果你有一个具有超线程的四核CPU，它将返回 8 。如果需要物理CPU的数量，请使用python绑定到hwloc：

#!/usr/bin/env python
import hwloc
topology = hwloc.Topology()
topology.load()
print topology.get_nbobjs_by_type(hwloc.OBJ_CORE)

hwloc旨在跨操作系统和体系结构移植。

len（os.sched_getaffinity（0））是您通常想要的

https://docs.python.org/3/library /os.html#os.sched_getaffinity

os.sched_getaffinity（0）（在Python 3中添加）返回可用的CPU集合，考虑到 sched_setaffinity Linux系统调用，它限制了一个进程及其进程的CPU孩子们可以继续活下去。

0 表示获取当前进程的值。该函数返回允许的CPU的 set（），因此需要 len（）。

另一方面，

multiprocessing.cpu_count（）只返回物理CPU的总数。

差异尤其重要，因为某些群集管理系统（如 Platform LSF ）限制了作业CPU使用 sched_getaffinity 。

因此，如果您使用 multiprocessing.cpu_count（），您的脚本可能会尝试使用比可用内核更多的内核，这可能会导致过载和超时。

我们可以通过限制 taskset 实用程序的亲和力来具体看到差异。

例如，如果我在我的16核心系统中将Python限制为仅1核（核心0）：

taskset -c 0 ./main.py

使用测试脚本：

main.py

#!/usr/bin/env python3

import multiprocessing
import os

print(multiprocessing.cpu_count())
print(len(os.sched_getaffinity(0)))

然后输出是：

16
1

默认情况下，

nproc 尊重关联：

taskset -c 0 nproc

输出：

和 man nproc 非常明确：

打印可用的处理单元数

nproc 具有 - all 标志，用于获取物理CPU数量的不太常见的情况：

taskset -c 0 nproc --all

这种方法的唯一缺点是它似乎只是UNIX。我认为Windows必须有一个类似的亲和力API，可能是 SetProcessAffinityMask ，所以我想知道它为什么没有被移植。但我对Windows一无所知。

在Ubuntu 16.04，Python 3.5.2中测试。

无法弄清楚如何添加到代码或回复消息，但这里支持jython，你可以在放弃之前加入：

# jython
try:
    from java.lang import Runtime
    runtime = Runtime.getRuntime()
    res = runtime.availableProcessors()
    if res > 0:
        return res
except ImportError:
    pass

你也可以使用“joblib”。以此目的。

import joblib
print joblib.cpu_count()

此方法将为您提供系统中的cpus数量。但是需要安装joblib。有关joblib的更多信息，请访问 https://pythonhosted.org/joblib/parallel.html

或者你可以使用python的numexpr包。它有很多简单的函数，有助于获取有关系统cpu的信息。

import numexpr as ne
print ne.detect_number_of_cores()

这些为您提供超线程 CPU 数量

multiprocessing.cpu_count()
os.cpu_count()

这些为您提供虚拟机 CPU 计数

psutil.cpu_count()
numexpr.detect_number_of_cores()

仅当您在虚拟机上工作时才重要。

如果您没有Python 2.6，则另一个选择：

import commands
n = commands.getoutput("grep -c processor /proc/cpuinfo")

许可以下： CC-BY-SA 和归因

不隶属于 StackOverflow