pugixml库之xml解析库

前言：

本文介绍c++编写的xml解析库——pugixml，能解析xml内容，支持xpath解析，同时能够跨linux平台，非常方便。

总结一下使用步骤和简单的使用方法：

使用pugixml库需要三个文件：pugiconfig.h/pugixml.h/pugixml.cpp，可直接从gugixml官网下载，将其加入工程，使用处包含头文件pugiconfig.h/pugixml.h即可。

加载xml文件，使用xml_document类的load_file接口:

    std::strFile = "../test.xml";
    pugi::xml_document doc;
    if (!doc.load_file(strFile.c_str()))
    {  return ;}

加载xml格式的字符串，使用xml_document类的load接口:

  std::strText = "testing";
    pugi::xml_document doc;
    if (!doc.load(strText.c_str()))
    {  return ;}

xml节点读取，如：xml文件params.xml：

 <?xml version="1.0" encoding="utf-8" ?>
   <root>
    <!-- 输入参数配置 -->
    <form ip="10.2.134.243" port="80" action="sisserver.php">
    <input name="data_type" value="POI" />
    <input name="query_type" value="TQUERY" />
    <input name="category" value="" /> 

    <!-- 查询词的返回结果xpath配置 -->
    <xpath poiroot="//list/poi" idfield="pguid" namefield="name"/>
    <!-- 评分权重配置 r1~r4-期望结果的权重，n1~n10-实际查询结果的排名权重-->
    <weight>
     <!-- 查询词正常得分阀值 -->
     <threshold>3</threshold>
     <!-- 计算分数分布情况的步长值 -->
     <step>0.5</step>
    </weight>
  </root>

读取代码：

std::string strFile = "/bak/workspace/test/src/params.xml";
    pugi::xml_document doc;
    if (!doc.load_file(strFile.c_str()))
    {return 0;}
    pugi::xml_node form = doc.child("root").child("form");
    std::string ip = form.attribute("ip").value();
    std::string port = form.attribute("port").value();

    char cBuf[2083];
    sprintf(cBuf, "http://%s:%s/%s?", ip.c_str(), port.c_s());
    std::string strTemp(cBuf);
    std::string m_strURLBase = strTemp;

    for (pugi::xml_node input = form.first_child(); input;input = input.next_sibling())
    {
        std::string strValue = input.attribute("value").value();
        if (!strValue.empty())
        {
            std::string strName = input.attribute("name").value();
            sprintf(cBuf, "%s=%s&", strName.c_str(), strValue.c_str());
            std::string strTemp(cBuf);
            m_strURLBase += strTemp;
        }
    }

    //读取xpath
    pugi::xml_node xpath = doc.child("root").child("xpath");
    std::string m_strPOIRoot = xpath.attribute("poiroot").value();
    std::string m_strPOIID = xpath.attribute("idfield").value();

    //读取评分权重
    pugi::xml_node weight = doc.child("root").child("weight");
    float m_fThred = atof(weight.child_value("threshold"));
    float m_fStep = atof(weight.child_value("step"));

xpath解析，如xml格式的字符串strWebContent:

<?xml version="1.0" encoding="utf-8" ?>
   <root>
    <list count="3" time"10">
    <poi>
       <pguid>123</pguid>
       <name>xx1</name>
    </poi>
    <poi>
       <pguid>456</pguid>
       <name>xx2</name>
    </poi>
    <poi>
       <pguid>789</pguid>
       <name>xx3</name>
    </poi>
    </list>
  </root>

其中，xpath根路径：m_strPOIRoot="//list/poi",

需要取值的项：strPOIID=“pguid”，strPOINam=“name”。

读取代码：

 //从strWebContent内容中解析出pguid和name
  pugi::xml_document doc;
  pugi::xml_parse_result result = doc.load(strWebContent.c_str());
  if (!result)
  {return -1;}
  pugi::xpath_node_set tools = doc.select_nodes(m_strPOIRoot.c_str());
  for (pugi::xpath_node_set::const_iterator it = tools.begin();
      it !=  tools.end(); ++it)
  {
     pugi::xpath_node node = *it;
     string strPOI = node.node().child_value(m_strPOIID.c_str());
     string strName = node.node().child_value(m_strPOIName.c_str());
  }

时间： 2024-12-20 16:30:07

pugixml库之xml解析库的相关文章

IOS学习：常用第三方库（GDataXMLNode：xml解析库）

IOS学习:常用第三方库(GDataXMLNode:xml解析库) 分类: ios开发学习2013-05-30 09:45 4443人阅读评论(0) 收藏举报 iOSXMLGDataXMLNode 一.GDataXMLNode说明 GDataXMLNode是Google提供的用于XML数据处理的类集.该类集对libxml2--DOM处理方式进行了封装,能对较小或中等的xml文档进行读写操作且支持XPath语法. 使用方法: 1.获取GDataXMLNode.h/m文件,将GDataXMLNo

EXPAT(XML解析库)

一.简介 expat是一个由C语言编写的XML解析库.James Clark创建了这个库,现在是制定XML标准的W3组织的技术leader.现在的版本是2.0.2.0开始就由Clark Cooper领导的开发组在sourceforge.net负责开发. expat是一个基于SAX模型的.非验证(默认,v1.2开始提供验证接口,需要用户手动处理)的轻量级XML解释器.目前XML的解析主要有两大模型:SAX和DOM.其中SAX(Simple API for XML)是基于事件的解析方法.基本工作原理

Swift XML解析库 - SwiftyXMLParser

经过在CocoaPods中筛选以后,发这个这个比较好用,整理出来如果有需要可以在Pods命令端输入: pod search xml 这样会搜索出很多相关Xml的第三方库 SwiftyXMLParser是在Swift下的一个xml解析第三方库,支持索引,支持keyValue形式访问, 环境和语言支持:iOS 8.0+ , Swift 2.0+ 使用: let urlStr = ApiUrl + "/" + xmlFile(newsType) Alamofire.request(.GE

JSP标准标签库(JSTL)--XML标签库 x

³在开发中XML解析的操作是非常烦琐的,幸运的是在JSTL中专门提供了用于XML解析的操作,这样用户就可以不用费力的去研究SAX或DOM等操作的使用,就可以轻松的进行XML文件的解析处理. XML标签: No. 功能分类标签名称描述 1 核心操作 <x:out> 输出XPath指定的内容 2 <x:parse> 进行XML解析 3 <x:set> 将内容保存在属性范围之中 4 流程控制 <x:if> 判断XPath指定的内容是否满足条件 5 <x:

Python的html和xml解析库Beautiful Soup

93、解析库之re，Beautifulsoup

本篇导航: 介绍基本使用遍历文档树搜索文档树总结 re模块在之前的python进阶中有讲过不再做过多的阐述,本篇为BeautifulSoup库的分析 20.collections模块和re模块(正则表达式详解) 一.介绍 Beautiful Soup 是一个可以从HTML或XML文件中提取数据的Python库.它能够通过你喜欢的转换器实现惯用的文档导航,查找,修改文档的方式.Beautiful Soup会帮你节省数小时甚至数天的工作时间.你可能在寻找 Beautiful Soup3 的文

Python_爬虫_BeautifulSoup网页解析库

BeautifulSoup网页解析库 from bs4 import BeautifulSoup 0.BeautifulSoup网页解析库包含的几个解析器 Python标准库[主要,系统自带;] 使用方法: BeautifulSoup(markup,"html.parser")[注:markup是html文档] Python的内置标准库案例: ` from bs4 import BeautifulSoup ` soup = BeautifulSoup.(html,'html.pa

python爬虫之解析库Beautiful Soup

Beautiful Soup4操作为何要用Beautiful Soup Beautiful Soup是一个可以从HTML或XML文件中提取数据的Python库.它能够通过你喜欢的转换器实现惯用的文档导航,查找,修改文档的方式, 是一个标签的形式,来进行查找的,有点像jquery的形式.提升效率,我们在进行爬虫开发的时候,进程会用到正则来进行查找过滤的操作,纯手动会及其浪费时间. Beautiful Soup示例摘自官网 html_doc = """ <html>

解析库之beautifulsoup模块

一介绍 Beautiful Soup 是一个可以从HTML或XML文件中提取数据的Python库.它能够通过你喜欢的转换器实现文档导航,查找,修改文档的方式,Beautiful Soup会帮你节省数小时甚至数天的工作时间,你可能在寻找 Beautiful Soup3 的文档,Beautiful Soup 3 目前已经停止开发,官网推荐在现在的项目中使用Beautiful Soup 4, 移植到BS4 #安装 Beautiful Soup pip install beautifulsoup4 #