Python + OpenCV2 系列：2 - 图片操作

这些相当于我的学习笔记，所以并没有很强的结构性和很全的介绍，请见谅。

1 读取、写入图像

下面是一个简短的载入图像、打印尺寸、转换格式及保存图像为.png的例子：

# -*- coding: utf-8 -*-
import cv2
import numpy as np
# 读入图像
im = cv2.imread(‘../data/empire.jpg‘)

# 打印图像尺寸
h, w = im.shape[:2]
print h, w 

# 保存原jpg格式的图像为png格式图像
cv2.imwrite(‘../images/ch10/ch10_P210_Reading-and-Writing-Images.png‘,im)

# 注：imread默认读取的是RGB格式，所以即使原图像是灰度图，读出来仍然是三个通道，所以，在imread之后可以添加参数

# 注：这里是相对路径: \与/是没有区别的，‘’ 和 “” 是没有区别的。 ../表示返回到上一级目录下，./表示与该源码文件同一级目录下。

# 注：函数imread()将图像返回为一个标准的NumPy数组。

1.1 相关注释

cv2.imread

Python: cv2.imread(filename[, flags])

Parameters:

Parameters:	filename – Name of file to be loaded. flags – Flags specifying the color type of a loaded image: CV_LOAD_IMAGE_ANYDEPTH - If set, return 16-bit/32-bit image when the input has the corresponding depth, otherwise convert it to 8-bit. CV_LOAD_IMAGE_COLOR - If set, always convert image to the color one CV_LOAD_IMAGE_GRAYSCALE - If set, always convert image to the grayscale one >0 Return a 3-channel color image. Note In the current implementation the alpha channel, if any, is stripped from the output image. Use negative value if you need the alpha channel. =0 Return a grayscale image. 如果是灰度图就用这个就好了。例如：`cv2.imread`‘../data/empire.jpg‘，0) <0 Return the loaded image as is (with alpha channel).

filename – Name of file to be loaded.
flags –

Flags specifying the color type of a loaded image:
- CV_LOAD_IMAGE_ANYDEPTH - If set, return 16-bit/32-bit image when the input has the corresponding depth, otherwise convert it to 8-bit.
- CV_LOAD_IMAGE_COLOR - If set, always convert image to the color one
- CV_LOAD_IMAGE_GRAYSCALE - If set, always convert image to the grayscale one
- >0 Return a 3-channel color image.
  
  Note
  
  In the current implementation the alpha channel, if any, is stripped from the output image. Use negative value if you need the alpha channel.
- =0 Return a grayscale image. 如果是灰度图就用这个就好了。例如：cv2.imread‘../data/empire.jpg‘，0)
- <0 Return the loaded image as is (with alpha channel).

cv2.imwrite

Python: cv2.imwrite(filename, img[, params])

Parameters:

Parameters:	filename – Name of the file. image – Image to be saved. params – Format-specific save parameters encoded as pairs `paramId_1, paramValue_1, paramId_2, paramValue_2, ...` . The following parameters are currently supported: For JPEG, it can be a quality ( `CV_IMWRITE_JPEG_QUALITY` ) from 0 to 100 (the higher is the better). Default value is 95. For PNG, it can be the compression level ( `CV_IMWRITE_PNG_COMPRESSION` ) from 0 to 9. A higher value means a smaller size and longer compression time. Default value is 3. For PPM, PGM, or PBM, it can be a binary format flag ( `CV_IMWRITE_PXM_BINARY` ), 0 or 1. Default value is 1.

filename – Name of the file.
image – Image to be saved.
params –

Format-specific save parameters encoded as pairs paramId_1, paramValue_1, paramId_2, paramValue_2, ... . The following parameters are currently supported:
- For JPEG, it can be a quality ( CV_IMWRITE_JPEG_QUALITY ) from 0 to 100 (the higher is the better). Default value is 95.
- For PNG, it can be the compression level ( CV_IMWRITE_PNG_COMPRESSION ) from 0 to 9. A higher value means a smaller size and longer compression time. Default value is 3.
- For PPM, PGM, or PBM, it can be a binary format flag ( CV_IMWRITE_PXM_BINARY ), 0 or 1. Default value is 1.

2 图像RGB, HSV 通道分离

# Convert BGR to r,g,b
b,g,r = cv2.split(im)

# Convert BGR to HSV
image_hue_saturation_value = cv2.cvtColor(im, cv2.COLOR_BGR2HSV)
h,s,v=cv2.split(image_hue_saturation_value)

# Convert BGR to gray
image_gray = cv2.cvtColor(im, cv2.COLOR_BGR2GRAY)

# 注：RGB channels is indexed in B G R which is different from matlab。

# 注：Any channels could be split using cv2.split， pay attention to the sequence of channels

2.1 相关注释

Python: cv2.split(m[, mv]) → mv

Parameters:	src – input multi-channel array. mv – output array or vector of arrays; in the first variant of the function the number of arrays must match `src.channels()`; the arrays themselves are reallocated, if needed.

Python: cv2.cvtColor(src, code[, dst[, dstCn]]) → dst

Parameters:

Parameters:	src – input image: 8-bit unsigned, 16-bit unsigned ( `CV_16UC...` ), or single-precision floating-point. dst – output image of the same size and depth as `src`. code – color space conversion code (see the description below). dstCn – number of channels in the destination image; if the parameter is 0, the number of the channels is derived automatically from `src` and `code` .

src – input image: 8-bit unsigned, 16-bit unsigned ( CV_16UC... ), or single-precision floating-point.
dst – output image of the same size and depth as src.
code – color space conversion code (see the description below).
dstCn – number of channels in the destination image; if the parameter is 0, the number of the channels is derived automatically from src and code .

3 图像矩阵的操作（点乘，复制，截取，1到N维矩阵）

# mask seed 3D matrixseed_mask_single_channel_list = np.array([[[1,0,0],[0,0,0],[0,0,0]],[[0,1,0],[0,0,0],[0,0,0]],[[0,0,1],[0,0,0],[0,0,0]],　　　　　　　　　　　　　　　　　　　[[0,0,0],[1,0,0],[0,0,0]],[[0,0,0],[0,1,0],[0,0,0]],[[0,0,0],[0,0,1],[0,0,0]],　　　　　　　　　　　　　　　　　　　[[0,0,0],[0,0,0],[1,0,0]],[[0,0,0],[0,0,0],[0,1,0]],[[0,0,0],[0,0,0],[0,0,1]]])

# cut image
image_new_sample = image_source[:200,:200] #取前200个行和列的元素，python是从0开始的，所以0:200表示的是0-199这200个元素，取不到200.而初始位置0可以省略

#separate channel
mask_singel_channel = np.tile(seed_mask_single_channel_list[1],(70,70))[:200,:200] #第一个3*3的mask作为一个单元进行复制成为70行,70列，截取前200行,200列
single_channel_image = mask_singel_channel * image_new_sample #表示点乘

# 注：矩阵的操作用Numpy这个类库进行。

3.1 相关注释

numpy.array(object, dtype=None, copy=True, order=None, subok=False, ndmin=0)

Parameters:	object : array_like An array, any object exposing the array interface, an object whose __array__ method returns an array, or any (nested) sequence. dtype : data-type, optional The desired data-type for the array. If not given, then the type will be determined as the minimum type required to hold the objects in the sequence. This argument can only be used to ‘upcast’ the array. For downcasting, use the .astype(t) method. copy : bool, optional If true (default), then the object is copied. Otherwise, a copy will only be made if __array__ returns a copy, if obj is a nested sequence, or if a copy is needed to satisfy any of the other requirements (dtype, order, etc.). order : {‘C’, ‘F’, ‘A’}, optional Specify the order of the array. If order is ‘C’ (default), then the array will be in C-contiguous order (last-index varies the fastest). If order is ‘F’, then the returned array will be in Fortran-contiguous order (first-index varies the fastest). If order is ‘A’, then the returned array may be in any order (either C-, Fortran-contiguous, or even discontiguous). subok : bool, optional If True, then sub-classes will be passed-through, otherwise the returned array will be forced to be a base-class array (default). ndmin : int, optional Specifies the minimum number of dimensions that the resulting array should have. Ones will be pre-pended to the shape as needed to meet this requirement.
Returns:	out : ndarray An array object satisfying the specified requirements.

Parameters:

object : array_like

An array, any object exposing the array interface, an object whose __array__ method returns an array, or any (nested) sequence.

dtype : data-type, optional

The desired data-type for the array. If not given, then the type will be determined as the minimum type required to hold the objects in the sequence. This argument can only be used to ‘upcast’ the array. For downcasting, use the .astype(t) method.

copy : bool, optional

If true (default), then the object is copied. Otherwise, a copy will only be made if __array__ returns a copy, if obj is a nested sequence, or if a copy is needed to satisfy any of the other requirements (dtype, order, etc.).

order : {‘C’, ‘F’, ‘A’}, optional

Specify the order of the array. If order is ‘C’ (default), then the array will be in C-contiguous order (last-index varies the fastest). If order is ‘F’, then the returned array will be in Fortran-contiguous order (first-index varies the fastest). If order is ‘A’, then the returned array may be in any order (either C-, Fortran-contiguous, or even discontiguous).

subok : bool, optional

If True, then sub-classes will be passed-through, otherwise the returned array will be forced to be a base-class array (default).

ndmin : int, optional

Specifies the minimum number of dimensions that the resulting array should have. Ones will be pre-pended to the shape as needed to meet this requirement.

Returns:

out : ndarray

An array object satisfying the specified requirements.

e.g. 最外层始终都是[],所以如果是1维就一个[],2维就2个，N维就N个

>>> np.array([1, 2, 3])
array([1, 2, 3])
>>> np.array([[1, 2], [3, 4]])
array([[1, 2],
       [3, 4]])
>>> np.array([1, 2, 3], ndmin=2)
array([[1, 2, 3]])

numpy.tile(A, reps)

Parameters:	A : array_like The input array. reps : array_like The number of repetitions of A along each axis.
Returns:	c : ndarray The tiled output array

Parameters:

A : array_like

The input array.

reps : array_like

The number of repetitions of A along each axis.

Returns:

c : ndarray

The tiled output array

e.g.

>>> b = np.array([[1, 2], [3, 4]])
>>> np.tile(b, 2)
array([[1, 2, 1, 2],
       [3, 4, 3, 4]])
>>> np.tile(b, (2, 1))
array([[1, 2],
       [3, 4],
       [1, 2],
       [3, 4]])

时间： 2024-12-29 23:25:30

Python + OpenCV2 系列：2 - 图片操作的相关文章

Python + OpenCV2 系列：1 - 配置

Python+OpenCV2+Eclipse+Windos 8.1(32bits): 最初的目的是做图像处理,opencv强大的社区支持,让我想从matlab转到opencv框架下进行试验,而Python和Eclipse的优缺点就不罗列了.下面是 https://www.python.org/about/主页的概括 Python is powerful... and fast; plays well with others; runs everywhere; is friendly & easy

python入门系列：文件操作

操作模式 r """以只读方式打开文件(默认),文件的指针会放在文件的开头若文件不存在,会报错"""f = open("a.txt", "r") # 在同级目录下打开文件content = f.read()print(content)w """以只写方式打开文件,文件的指针也在文件开头,进行覆盖写入(先清空后写入)若文件不存在,会创建一个新的文件""&quo

Python + OpenCV2 系列：3 - python 字符串，类，编码规范

首先,强烈推荐<<简明 Python 教程>> Swaroop, C. H. 著沈洁元译其实,这本书里已经把python的最基本的用法,编码等等介绍的很好,这里把我用到的进行一下总结. 1. 字符串连接字符串数组转义字符 Excellent Reference Link :http://www.w3cschool.cc/python/python-strings.html 字符串 ● 使用单引号(')你可以用单引号指示字符串,就如同'Quote me on this'这

Python学习系列之文件操作

Pyhton文件打开方式 with= open('文件路径','打开模式') as f:#PS:python3提供了with语句来帮我们自动调用close方法,所以说无论打开文件是否出错都能自动正确的关闭文件 Python打开文件的模式基本模式带'+'的模式带'b'的模式 #提示:以b方式打开时,读取到的内容是字节类型,写入时也需要提供字节类型带'+'和带'b'的模式 Python文件读取方式 Pyhton文件写入方式 Python文件操作所提供的方法 close(self) 关闭已经打

[js高手之路] html5 canvas系列教程 - 图片操作(drawImage,clip,createPattern)

接着上文[js高手之路] html5 canvas系列教程 - 文本样式(strokeText,fillText,measureText,textAlign,textBaseline)继续,本文介绍的内容是canvas开发,特别是游戏中,比较常用的内容:图片处理.在游戏中的资源大多数都是加载图片. 一.canvas怎么加载图片? canvas提供3种图片加载的API调用方式. drawImage( image, dx, dy ) drawImage( image, dx, dy, dw, dh

基于python：opencv简单图片操作

一.主要函数 1. cv2.imread():读入图片,共两个参数,第一个参数为要读入的图片文件名,第二个参数为如何读取图片,包括cv2.IMREAD_COLOR:读入一副彩色图片:cv2.IMREAD_GRAYSCALE:以灰度模式读入图片:cv2.IMREAD_UNCHANGED:读入一幅图片,并包括其alpha通道. 2.cv2.imshow():创建一个窗口显示图片,共两个参数,第一个参数表示窗口名字,可以创建多个窗口中,但是每个窗口不能重名:第二个参数是读入的图片. 3.cv2.wai

Python学习系列（五）(文件操作及其字典)

Python学习系列(五)(文件操作及其字典) Python学习系列(四)(列表及其函数) 一.文件操作 1,读文件在以'r'读模式打开文件以后可以调用read函数一次性将文件内容全部读出,也可以指定每次read读多少字节,例如: 1 #coding:utf-8 2 fn='test1.py' 3 fp=open(fn,'r') #以读的方式打开文件,文件必须首先存在和,.文件在同一目录下py 4 print 'reading pos:',fp.tell() 5 r=fp.read(20) #

python selenium系列（三）常用操作类型及方法

一前言开展WEB UI自动化的核心思路,无非就是找到元素,然后操作元素这两个内容.在python selenium系列(二)元素定位方式一文中,已经介绍了如何找到元素这项技能,本文将介绍第二项内容,即如何操作已经找到的元素. 二操作方法分类总体来说,可以将操作大体分成四类,即浏览器操作.键盘操作.鼠标操作.js脚本. 1. 浏览器常用操作方法: 方法描述 driver.maximize_window() 窗口最大化 driver.back() 页面返回 driver.forward(

Python实战：美女图片下载器，海量图片任你下载

Python应用现在如火如荼,应用范围很广.因其效率高开发迅速的优势,快速进入编程语言排行榜前几名.本系列文章致力于可以全面系统的介绍Python语言开发知识和相关知识总结.希望大家能够快速入门并学习Python这门语言. 本文是在前一部分Python基础之上Python实战:Python爬虫学习教程,获取电影排行榜,再次升级的Python网页爬虫实战课程. 1.项目概述. 利用XPath和requests模块进行网页抓取与分析,达到网页图片下载的效果. 抓爬图片地址:http://www.2c