Allowing GPU memory growth

By default, TensorFlow maps nearly all of the GPU memory of all GPUs (subject to CUDA_VISIBLE_DEVICES) visible to the process. This is done to more efficiently use the relatively precious GPU memory resources on the devices by reducing memory fragmentation.

In some cases it is desirable for the process to only allocate a subset of the available memory, or to only grow the memory usage as is needed by the process. TensorFlow provides two Config options on the Session to control this.

The first is the allow_growth option, which attempts to allocate only as much GPU memory based on runtime allocations: it starts out allocating very little memory, and as Sessions get run and more GPU memory is needed, we extend the GPU memory region needed by the TensorFlow process. Note that we do not release memory, since that can lead to even worse memory fragmentation. To turn this option on, set the option in the ConfigProto by:

config = tf.ConfigProto()config.gpu_options.allow_growth = Truesession = tf.Session(config=config, ...)

The second method is the per_process_gpu_memory_fraction option, which determines the fraction of the overall amount of memory that each visible GPU should be allocated. For example, you can tell TensorFlow to only allocate 40% of the total memory of each GPU by:

config = tf.ConfigProto()config.gpu_options.per_process_gpu_memory_fraction = 0.4session = tf.Session(config=config, ...)

This is useful if you want to truly bound the amount of GPU memory available to the TensorFlow process.

原文地址：https://www.cnblogs.com/jiu0821/p/9166281.html

时间： 2024-08-30 02:22:09

Allowing GPU memory growth的相关文章

Reducing and Profiling GPU Memory Usage in Keras with TensorFlow Backend

keras 自适应分配显存 & 清理不用的变量释放 GPU 显存 Intro Are you running out of GPU memory when using keras or tensorflow deep learning models, but only some of the time? Are you curious about exactly how much GPU memory your tensorflow model uses during training? Are

CUDA ---- Memory Model

Memory kernel性能高低是不能单纯的从warp的执行上来解释的.比如之前博文涉及到的,将block的维度设置为warp大小的一半会导致load efficiency降低,这个问题无法用warp的调度或者并行性来解释.根本原因是获取global memory的方式很差劲. 众所周知,memory的操作在讲求效率的语言中占有极重的地位.low-latency和high-bandwidth是高性能的理想情况.但是购买拥有大容量,高性能的memory是不现实的,或者不经济的.因此,我们就要尽量

转载：.NET Memory Leak: XmlSerializing your way to a Memory Leak

原文地址:http://blogs.msdn.com/b/tess/archive/2006/02/15/532804.aspx I hate to give away the resolution in the title of the blog since it takes away a lot of the suspense:) but I can't figure out a better way to name the blog posts and still keep them ni

CUDA系列学习（二）CUDA memory & variables

本文来介绍CUDA的memory和变量存放,分为以下章节: (一).CPU Memory 结构 (二).GPU Memory结构 (三).CUDA Context (四).kernel设计 (五).变量 & Memory 5.1 global arrays 5.2 global variables 5.3 Constant variables 5.4 Register 5.5 Local Array 5.6 Shared Memory 5.7 Texture Memory 5.8 总结 (一).

GPU openEXR image(RGBA) -> gray image

<1> Basic #include <stdio.h> #include <cuda_runtime.h> #include <device_launch_parameters.h> #define NUM 15 __global__ void square(float *dout,float *din) { int idx = threadIdx.x; float f = din[idx]; dout[idx] = f*f; } int main(int

[Attila GPU] Attila OGL2/D3D9 GPU C Model Simulator

http://www.opengpu.org/forum.php?mod=viewthread&tid=1094&highlight=Attila 查看: 4979|回复: 14 [Attila GPU] Attila OGL2/D3D9 GPU C Model Simulator [复制链接] ic.expert 管理员注册时间 2007-7-11 积分 32646 串个门加好友打招呼发消息电梯直达 1# 发表于 2009-10-19 01:29:41 |只看该

About Memory Analysis

关于内存分析About Memory Analysis 每当应用程序创建对象时,都会为它们分配内存.传统上,它已被应用的工作跟踪这些对象并释放他们时,他们不再需要的内存可以分配其他对象.自动引用计数(ARC)是一种通过让系统负责内存管理而使事情变得更容易的特性.在启用ARC的情况下,系统处理监控对象分配,并在适当时释放它们,只剩下很少的应用程序要做.然而,不管内存是如何管理的,即使是最好的应用程序设计也会遇到难以分离的偶尔内存问题.Whenever your app creates object

关于多个程序同时launch kernels on the same GPU

原谅我中英文混杂. 现在,我需要多个程序同时运行,每个程序都会多次运行GPU kernel.这些Kernels 能否并行执行呢? 答案是不能并行执行 (除非使用 GPU multi-process server) 如果是runtime 创建的primary context,一个程序的多个线程可以共享:通过使用stream,可以实现多个kernel并行执行. 如果是driver 创建的 standard context,一个程序的多个线程是不能共享的:可以通过context migration,

GPU keylogger && GPU Based rootkit(Jellyfish rootkit)

catalog 1. OpenCL 2. Linux DMA(Direct Memory Access) 3. GPU rootkit PoC by Team Jellyfish 4. GPU keylogger 5. DMA Hack 1. OpenCL OpenCL(Open Computing Language)是第一个面向异构系统通用目的并行编程的开放式.免费标准,也是一个统一的编程环境,便于软件开发人员为高性能计算服务器.桌面计算系统.手持设备编写高效轻便的代码,而且广泛适用于多核心处

猜你喜欢

【非凡程序员】 OC第十四节课（代理模式二闹铃响了）

这是一个人定闹钟的例子,就是人委托闹钟叫醒自己 main函数: #import <Foundation/Foundation.h>#import "Person.h"# ...

python中变量命名

一综述: 二全局变量(包含函数和类): (1)正常变量x: *通过module.x能够使用. *通过from module import *能够使用. (2)以"_"开头变量 ...

json+一般处理程序读取数据库数据

一般处理程序的语法结构 string jsoncallback = context.Request["jsoncallback"]; 声明变量前台传值使用 stri ...

android Listview scrollto 问题

============问题描述============ listview 调用 scrollto函数后,列表显示的数据丢失了. 例:列表有6项item,listview能显示4项,当调用scroll ...

Lucene入门案例一

1. 配置开发环境官方网站:http://lucene.apache.org/ Jdk要求:1.7以上创建索引库必须的jar包(lucene-core-4.10.3.jar,lucene-anal ...

Maven 的assembly插件使用

Maven 的assembly插件使用: 最近在做一个小工程,利用java启动运行. 为了简单方便使用运行,利用maven的assembly将需要使用的jar都打包到一个jar中.这样无论拷贝到哪里, ...

Java线程中yield与join方法的区别

摘要:有人推崇产品,有人推崇运营,也有人推崇战略-到底该推崇什么?李智勇系统地分析了这三者之间的思路,并引用黑格尔的一句话,给出了自己的看法:在尺度中已经蕴含本质,这在产品.运营.战略的侧重上体现的非 ...

如何更改Ubuntu的root密码

安装Ubuntu系统时,只提示了设定用户密码,该密码可用于普通用户暂时获取root的权限,执行一些需要root权限的操作,而没有要求我们设置root密码,在需要用到root密码时,却想不起来,很尴尬啊 ...

数据结构例程——串的顺序存储应用

本文针对数据结构基础系列网络课程(4):串中第3课时串的顺序存储应用. 例1:串比较问题: 设计实现串比较运算的算法算法思路 (1)比较s和t两个串共同长度范围内的对应字符: ① 若s的字符> ...

HTTPD 工作原理

http :HyperText Transer Protocol 超文本传输协议超链接:能够在文档间跳转的文本,而这些链接我们称之为超链接 URI:能够很好的让客户端去识别网上不同文档的一种机制:统 ...

盲目出炉的支付场景到底有多少是鸡肋？

当下,花样百出的移动支付已经席卷了大众生活的方方面面.以手机.可穿戴设备等移动智能终端为基础,多个移动支付应用在疯狂地攻城略地,刷足了存在感.尤其是支付宝和微信支付之间的红包大战,彻底点燃了大众使用移 ...

Asp.net MVC4 ExtJS权限管理系统源码 C#开发框架源码

开发环境:VS2010或以上数据库:SQL Server 2008 r2 MVC版本:Asp.net mvc 4.0 ExtJs版本:ext-4.2 功能介绍 1.多标签,js动态加载模式,全aja ...

tomcat环境变量配置

jdk安装在:D:\Java\JDK\jdk1.8.0_11; tomcat文件夹放在:C:\Users\DSL\apache-tomcat-7.0.53 则配置: 系统变量: JAVA_HOME D ...

jquery点击div以外的区域触发事件

1 <html> 2 <head> 3 <meta http-equiv="Content-Type" content="text/html ...

权限控制方案之——基于URL拦截

概述: 在系统开发过程中需要考虑的一个重要的问题就是权限问题,权限问题也是安全问题的一个范畴,我们要求在用户登录系统之后,要控制用户可以访问的系统资源,使得用户只可以访问到系统事先分配好的资源:这里的 ...

MySQL数据库密码如何修改？

解决方法: 方法一使用phpmyadmin,这是最简单的了,修改mysql库的user表, 不过别忘了使用PASSWORD函数. 方法二使用mysqladmin,这是前面声明的一个特例. my ...

Html5 Canvas斗地主游戏

过完年来公司,没什么事,主管说研究下html5 游戏,然后主管就给了一个斗地主的demo,随后我就开始看代码, 现在我看了html5以及canvas相关知识和斗地主的demo后,自己用demo上的素材 ...

避免rm误删除

useradd fileremove find ./ -mtime +30 -type f > /tmp/file.txt tar cvfz /tmp/$filename.tgz -T /tmp ...

程序瘦身

核心压缩,而不是纵向铺张,确保代码.程序的涵养. 如果压缩核心的话就转向了C..java本身也可以,我还没到那种程度需要用到C. Java被设计出来,就用到了一些编程思想,我还没有吃透这些思想.在之前 ...

多线程编程（四）GCD

前文中,我们介绍了多线程的基本概念和多线程编程实现的两种方式,本文,我们介绍一下最后一种多线程编程工具,也是最重要的一种:GCD. 1. GCD简介 1.1 什么是GCD GCD全称是Grand Ce ...

专题

随机推荐

© 2024 憋错料 | info#biecuoliao.com | 10 q. 0.023 s.