XIV(4)--Flash caching

1,Flash Cache简介

Flash cache 本身是Facebook的一个开源项目,(准确的说是一个Linux的模块),可以动态加载。Flashcache通过在文件系统(VFS)和设备驱动之间新增了一次缓存层,来实现对热门的缓存。Flashcache是另一种缓存,一般用SSD作为介质的缓存(一般的缓存用的是内存),通过将传统硬盘上的热门数据缓存到SSD上,然后利用SSD优秀的读性能,来加速系统。这个方法较之内存缓存,没有内存快,但是空间可以比内存大很多。如下图:

现在很多硬件厂商也会在存储设备中增加这个功能来,例如IBM XIV的Flash cache和 EMC的VFCache

2, XIV Flash Cache

从11.1.0开始,XIV Gen3开始支持可选的Flash caching feature,极大的提高小数据块,随机读的性能,适用于Data patterns一直改变的环境。

XIV flash caching is implemented as an extension of the primary cache layer.   每个模块400GB Cache,那一个系统就是6TB(400GB*15),从11.4开始,每个Module能支持800GB。Flash cache 只用于读操作,当不再需要Cache中的数据时,直接drop掉就行了。

XIV flash caching overview

Flash caching算法是嵌入在XIV Firmware中的,能自动适应相应的IO类型,对用户透明,不需要管理员手动的做performance turning。

XIV系统中有2种类型的Cache:main 和 extended  
--The main cache handles host write I/Os and then destages them directly to the disk drive. //Main cache主要处理主机写IO然后直接Flush到Disk    
--The extended cache handles the caching of random read miss operations less than 64 KB. //Extended cache主要处理小于64KB的随机读操作    

Sequential read prefetches (larger than 64 KB) are handled in main dynamic random access memory (DRAM) cache. //大于64KB的读操作是没main cache处理的


3,Flash Cache learning

A flash cache map is built as read misses occur in the DRAM cache. The process, known as flash cache learning


关于XIV中Main cache和Extended Cache(Flash Cache)是如何扮演各自角色的,请看下图:

XIV flash cache learning

The cache node immediately checks the extended cache for the requested I/O. If the requested I/O exists in the extended flash cache, it is served to the host through the main cache. The I/O operation is now complete and is recorded as a flash cache read hit.

If the operation results in a true read miss (not in the DRAM cache and not in extended flash cache), the request is forwarded in an unmodified state to the disk drive (SAS layer). The I/O is retrieved from the disk drive and served to the host through the main cache. From a host perspective, the I/O operation is now complete and is recorded as a read miss. The related pages are copied into reserved buffers in the main cache.

Important: Any read larger than 64 KB bypasses the extended flash cache.

当Buffer中的data达到512KB时,会顺序地写到Flash cache中。这种方式延长了Flash cache中的寿命。

Note: XIV在系统重启和Firmware升级中能保存Flash Cache中的数据

XIV Storage System software Version 11.2 introduced improved flash caching algorithms,providing a performance boost of up to 4.5 times over systems without flash cache for random database-type workloads. This boost is accomplished by storing and computing all flash cache-related data integrity checking tasks in DRAM rather than on the flash cache.


4,Approachesfor Using SSDs in a Storage System

Approach
of the tier. With policies that moved the data … or As
a caching layer. This is the approach of XIV.

5,SSD failure

  • No redistribution in case of failure/phase-out
  • SSD Failure:

-Reinitialize the metadata of the SSD (so it is not used)
  -The degraded module continue to server reads from its DRAM cache and large sequential reads from its disks
  -Small read misses are redirected to the secondary

  • SSD Phase-out and not failed

- Its data is invalidated (on writes)
=> if phased-in not all data is lost


During
a rebuild, following a SAS disk failure in a module, the data on SSD in that
module is

not
invalidated. Rather, it is gradually updated to contain the new data blocks
(the same way

the
DRAM does).

------------------------------------------------------------------------------

后记:关于EMC VFCache和IBM Flash Cache,感兴趣的可以看看比特网的下图对比。

参考文章:http://storage.chinabyte.com/223/12261223.shtml

时间: 2024-11-08 08:02:07

XIV(4)--Flash caching的相关文章

XIV(1)—Hardware Overview Cont.

接第一篇<XIV(1)- Hardware Overview>讲述了XIV系统的大体的硬件组成,本篇继续讲XIV系统的Hardware Architecture. 1,Data Module/Interface Module XIV Model A14 Data Module/Interface Module Data Module和Interface Module的区别在于Interface Module有主机IO适配器和G网卡适配器,还有就是Data Module是一个Low voltag

XIV(3)--Read/Write Operations

XIV系列: <XIV(1)-Hardware Overview> <XIV (2)--Logical system concepts> 之前的文章曾经说过HOST发过来的data会在XIV上存2份,即Primary Copy和Secondary Copy.只有当这2份都同时存在时,系统才是Full Redundancy状态.那主机在XIV上读写数据分别是怎么进行的呢?请看本篇: -Each write is written to the cache of two data mod

XIV (2)--Logical system concepts

先来看看几个XIV中最基本的概念: Partitions     The fundamental building block of a logical volume is known as a partition. Partition是1MB(1024KB),包括a primary copy 或者 secondary copy of data,每个Partition都对应一个单独的物理磁盘的某个区域,存储管理员不知道也无法控制一个Partition对应到哪个disk,它是由系统算法控制的.Se

Flask(十一)flash与APScheduler 实现定时任务

from flask import Flask from flask_apscheduler import APScheduler # 引入APScheduler class Config(object): # 创建配置,用类 JOBS = [ # 任务列表 { # 任务字典(细节) 'id': 'job1', 'func': '__main__:job_1', 'args': (1, 2), 'trigger': 'cron', 'hour': 19, 'minute': 27 }, { #

XIV(1)&mdash;Hardware Overview

IBM XIV被业内称为革命性的新一代存储架构,它是采用了大规模并行的分布式网格存储技术,使用了"Scale out"(横向扩展)的存储架构,利用多路网格模块并行分担存储负荷,并通过细粒度数据分布算法保证数据的恒定均衡分布,它不但提高了系统的可靠性.可用性和存取效率,还易于扩展. 1,System models and components Machine type 2812-A14    Machine type 2810-A14     Machine type 2812-114

XIV(1)- Hardware Overview

IBM XIV被业内称为革命性的新一代存储架构,它是采用了大规模并行的分布式网格存储技术,使用了"Scale out"(横向扩展)的存储架构,利用多路网格模块并行分担存储负荷,并通过细粒度数据分布算法保证数据的恒定均衡分布,它不但提高了系统的可靠性.可用性和存取效率,还易于扩展. 1,System models and components Machine type 2812-A14 Machine type 2810-A14 Machine type 2812-114 Machine

Flex开发实战(二)--Flash Builder 4.7下载和安装及破解方法

摘要:开发Flex我们需要使用Flash Builder等IDE,但是Flash Builder又不是开源的.目前Flash Builder开发工具最新的版本是4.14.1,本文章主要是讲述Flash Builder 4.7下载,Flash Builder 4.7安装,以及Flash Builder 4.7破解的方法(破解方案都是通用的),帮助一些想更好地学习Flex开发的朋友快速入门. 本内容是针对一些买不起正版,又想学习 Flex 开发的吊丝们.本教程是由本程序员亲测,下面也会讲到在安装的时

(转)flash的Socket通讯沙箱和安全策略问题

一.沙箱和安全策略问题 1.此问题发生在连接时,准确地说是连接前,分别两种情况: 1.本地播放 本地播放时,默认情况下Flash Player将不允许swf访问任何网络. 访问http://www.macromedia.com/support/documentation/en/flashplayer/help/settings_manager04.html,将swf加入到许可列表,即可解除限制. 2.WEB发布 发布在WEB上的swf, 将可能面临跨域的问题. 2.Flash中的通信方式有两种:

XIV(5)-- Data Recovery Protection (XDRP)

和大多数存储系统一样,XIV也提供多地容灾的解决方案.XIV Data Recovery Protection (XDRP)有三种实现方式, Synchronous Mirroring; ASynchronous Mirroring; Data Migration.除此之外,当然也支持Flashcopy,VolumeCopy 一.Synchronous Mirroring XDRP是在两个或多个XIV系统之间做real-time copy,支持 Fiber Channel 或iSCSI link