Create and format Word documents using R software and Reporters package

Word document is one of the file formats widely used by many people, including researchers, for editing their text and tracking changes between different authors.

There are different solutions to generate a Word document from R software :

  • R2wd package : unfortunately it depends on statconnDCOM or RDCOMClient servers, and works on Windows only.
  • knitr + rmarkdown + pandoc : this is an easy way to quickly create a Word report but the output is very basic.
  • ReporteRs package, by David Gohel, provides easy to use functions to write and format Word documents. It can be also used to generate Word document from a template file with logos, fonts, etc. ReporteRs is Java-based solution, so it works on Windows, Linux and Mac OS systems.

The aim of this R tutorial is to show you how to easily and quickly export outputs (including data table, plots, paragraphs of text and R scripts) from R statistical software to a Microsoft Word document (.docx file format) using ReporteRs package.

Install and load the ReporteRs R package

ReporteRs can be installed and loaded as follow :

install.packages(‘ReporteRs‘) # Install
library(‘ReporteRs‘) # Load

Note that ReporteRs requires Java (>= 1.6) ; make sure you have an installed JRE

The version of Java installed on your computer, can be checked with this R code :

system("java -version")

Create a simple Word document

Word document can be created using the docx() function. Then, contents can be added to the document using the functions below:

  • addTitle: Add a title
  • addParagraph: Add paragraphs of text
  • addFlexTable: Add a table
  • addPlot: Add a plot generated in R
  • addImage: Add external images
  • addTOC: Add a table of contents
  • addRScript: highlight and add R code
  • addMarkdown: Add markdown
  • addPageBreak: Add a page break

After adding contents, you have to use the writeDoc() function to write the Word document into a .docx file.

Add texts : title and paragraphs of texts

Texts can be added as a title or a paragraph using the R functions addTitle() or addParagraph(). A simplified format of these functions are :

addTitle(doc, value, level=1) # Add titles
addParagraph(doc, value) # Add a paragraph of texts
  • doc : docx object.
  • value : the text to add as a title or a paragraph.
  • level : an integer specifying the heading level. e.g : 1 for title1, 2 for title2, 3 for title3, etc.

addTitle() and addParagraph() functions can be used as follow :

library(ReporteRs)
# Create a word document to contain R outputs
doc <- docx()
# Add a title to the document
doc <- addTitle(doc, "Simple Word document", level=1)
# Add a paragraph of text into the Word document
doc <- addParagraph(doc, "This simple Word document is created using R software and ReporteRs package. It contains just a simple text.")
# Add a sub title
doc <- addTitle(doc, "What is R language?", level = 2)
doc <- addParagraph(doc, "R is a language and environment for statistical computing and graphics. R provides a wide variety of statistical and graphical techniques, and is highly extensible.")
doc <- addTitle(doc, "What is ReporteRs", level = 2)
doc <- addParagraph(doc, "ReporteRs is a package to write and format easily a Word document from R software.")
# Write the Word document to a file
writeDoc(doc, file = "r-reporters-simple-word-document.docx")

Format the text of a Word document using R software

Text properties : font, color, size

Among the text properties available in a Word document, there are :

font family (e.g : “Arial”), font size (e.g : 11), font weight (e.g: bold), font style (e.g : italic), underlined text, vertical align(superscript, subscript) and color (e.g: “blue”).

These properties are shown in the figure below :

.

This section describes how to format and write a Word document using R software

Change the global font size and font family

The default font size and font family can be changed as follow :

options( "ReporteRs-fontsize" = 12,
         "ReporteRs-default-font" = "Arial")

Note that, this changing will be applied to the whole content of the Word document. An example is shown in the next section.

Change the appearance of a “Piece Of Text” (pot)

.

The function pot() [Pieces Of Text] is used to format text chunks before adding into the document. pot() function can be also used to create a hyperlink and a footnote. A simplified format is :

pot(value="", format = textProperties())
  • value : the text to be formatted
  • format : the properties to use for formatting the text

The possible values for the parameter format are the following functions :

  • textProperties() : the text formatting properties
  • textBold() : shortcut for bold text
  • textItalic() : shortcut for italic text
  • textBoldItalic() : shortcut for bold-italic text
  • textNormal() : shortcut for normal text

The arguments for these functions are :

  • color : font color; e.g : color=“#000000” or color = “black”.
  • font.size : a integer indicating the font size.
  • font.weight : the font weight. Possible values are “normal” or “bold”.
  • font.style : the font style. Possible values are “normal” or “italic”.
  • underlined : a logical value specifying if the text should be underlined.
  • font.family : the font family; e.g : “Arial”.
  • vertical.align : a character indicating font vertical alignments. Expected values are “baseline”" or “subscript” or “superscript”. Default value is baseline.
  • shading.color : background color of the text (e.g “#000000” or “black”)

pot() function can be used as follow :

# Formatted text
my_text <- ‘This is a ‘ +
         pot(‘formatted text‘, textProperties(color="blue")) +
        ‘ created using‘ + pot(‘ReporteRs‘, textBold()) + ‘package‘
# Create a hyperlynk
my_link <- pot( ‘Click here to visit STHDA website!‘,
          hyperlink = ‘http://www.sthda.com/english‘,
          format=textBoldItalic(color = ‘#428BCA‘, underline = TRUE ))

The R code below creates a Word document containing a formatted text and a hyperlink :

doc <- docx()
# Change the default font size and font family
options(‘ReporteRs-fontsize‘=12, ‘ReporteRs-default-font‘=‘Arial‘)
# Add a formatted paragraph of texts
#++++++++++++++++++++++++++++++
doc <- addTitle(doc, "Formatted text", level=1)
# Define a style to highlight a text
highlight_style <- textProperties(color=‘#1163A5‘,font.size = 20,
                font.weight = ‘bold‘, font.family = ‘Courier New‘ )
my_text = ‘This ‘ + pot(‘Word document‘, highlight_style) +
  ‘ is created using‘ +
  pot(‘ R software‘, textProperties(color="red", font.size=18)) +
  ‘ and‘+
  pot(‘ ReporteRs‘, textBoldItalic(color="#F0A91B", underlined=TRUE)) + ‘ package.‘
doc <- addParagraph(doc, my_text)
# Add a hyperlink
doc <- addTitle(doc, "STHDA Web site", level=1)
my_link <- pot(‘Click here to visit STHDA web site!‘,
          hyperlink = ‘http://www.sthda.com/english‘,
          format=textBoldItalic(color = ‘blue‘, underline = TRUE ))
doc <- addParagraph(doc, my_link)
# Write the Word document to a file
writeDoc(doc, file = "r-reporters-formatted-word-document.docx")

Add plots and images

The functions addPlot() and addImage() can be used for adding a plot or an external image to the document. addPlot() works with all R plots (base graphics, lattice, ggplot2 and grid).

The format of these 2 functions are :

# Add plots
# fun : R plotting function
# ... : other arguments to pass to the plotting function
addPlot(doc, fun, ...)
# Add images
# filename : path to the external image
addImage(doc, filename)

The R code below creates a Word document containing an introduction, a box plot, a histogram and an image (downloaded from STHDA web site).

doc <- docx() # Create a Word document
# Add a title
doc <- addTitle(doc, "Word document with plots and Images",
                level = 1)
# Add an introduction
doc <- addTitle(doc, "Introduction", level = 2)
doc = addParagraph(doc, value ="This Word document is created using R software and ReporteRs package. The goal of this section is to show you how to add plots and images into a Word document. This can be done easily using the functions addPlot() and addImages().")
# Add a box plot
doc <- addTitle(doc, "Box plot using R software", level = 2)
boxplotFunc<-function(){
  boxplot(len ~ dose, data = ToothGrowth,
      col = rainbow(3), main = "Guinea Pigs‘ Tooth Growth",
      xlab = "Vitamin C dose mg", ylab = "tooth length")
}
doc <- addPlot(doc, boxplotFunc)

doc <- addPageBreak(doc) # Go to the next page
# Add a histogram
doc <- addTitle(doc, "Histogram plot", level = 2)
doc <- addPlot(doc, function() hist(iris$Sepal.Width, col="lightblue"))
# Change point size of plotted text (in pixels, default = 12)
doc <- addTitle(doc, "Histogram with pointsize = 18", level = 2)
doc <- addPlot(doc, function() hist(iris$Sepal.Width,col="lightblue"), pointsize=18)
# Add an image
# +++++++++++++++++++++++++++
# download an image from STHDA web site
download.file(url="http://www.sthda.com/sthda/RDoc/figure/easy-ggplot2/ggplot2-histogram-demo.png",
              destfile="ggplot2-histogram-demo.png", quiet=TRUE)
doc <- addTitle(doc, "Image from STHDA web site", level = 2)
doc <- addImage(doc, "ggplot2-histogram-demo.png")
# Write the Word document to a file
writeDoc(doc, file = "r-reporters-word-document-with-plot.docx")

The Word document created by the R code above is available here : R software and ReporteRs package - Word document containing plots

  1. Note that, for addPlot() function, you can use the arguments width and height to specify the plot width and height in inches (default values are 6 X 6)
  2. Another optional argument of addPlot() function is par.properties which value can be parRight()parLeft()parJustify() for the plot alignments:
doc <- addPlot( doc = doc, fun = function(){
    barplot( 1:5, par.properties = parCenter() )}
  )
  1. For addImage() function, the allowed file formats are PNG, WMF, JPEG and GIF images..

Add a table

The function addFlexTable() is used to add a simple or customized table.

  1. The first step is to create a table using one of the functions below :
  • FlexTable() to create a ‘flexible’ table which can be easily formatted
  • vanilla.table() which is shortcut to quickly produce a nice FlexTable
  1. The second step is to add the created table into the Word document using addFlexTable() function as follow :
# doc : docx object
# flextable : FlexTable object
addFlexTable(doc, flextable)

Add a simple table

doc <- docx()
data<-iris[1:5, ]
# Add a first table : Default table
doc <- addTitle(doc, "Default table")
doc <- addFlexTable( doc, FlexTable(data))
doc <- addParagraph(doc, c("", "")) # 2 line breaks
# Add a second table, theme : vanilla table
doc <- addTitle(doc, "Vanilla table")
doc <- addFlexTable( doc, vanilla.table(data))
writeDoc(doc, file = "r-reporters-word-document-add-table.docx")

Add a zebra striped table

setZebraStyle() function can be used to color odd and even rows differently; for example, odd rows in gray color and even rowsin white color.

doc <- docx()
data<-iris[1:5, ]
# Zebra striped tables
doc <- addTitle(doc, "Zebra striped tables")
MyFTable <- vanilla.table(data)
MyFTable <- setZebraStyle(MyFTable, odd = ‘#eeeeee‘, even = ‘white‘)
doc <- addFlexTable( doc, MyFTable)
writeDoc(doc, file = "r-reporters-word-document-zebra.docx")

Add lists : ordered and unordered lists

Ordered and unordered lists can be added using addParagraph() function as follow :

doc = addParagraph(doc,
  value = c(‘Item 1‘, "Item 2", "Item 3")
  par.properties = parProperties(list.style = ‘ordered‘, level = 1 )
  • value : a set of texts to be added as a list
  • par.properties : the paragraph formatting properties
    • list.style : possible values are ‘unordered’ and ‘ordered’
    • level : a numeric value indicating the level of the item to be added in the list

Add simple lists

doc <- docx()
# Ordered list
doc <- addTitle(doc, "Ordered List")
doc <- addParagraph(doc, value= c("Item 1", "Item 2", "Item 3"),
          par.properties =  parProperties(list.style = ‘ordered‘))
# Unordered list
doc <- addTitle(doc, "Unordered List")
doc <- addParagraph(doc, value= c("Item 1", "Item 2", "Item 3"),
          par.properties =  parProperties(list.style = ‘unordered‘))
writeDoc(doc, file = "r-reporters-word-document-lists.docx")

Add multi-level lists

To simplify the code, we’ll first define some levels to be used for creating multi-level lists.

doc <- docx()
# Define some levels for ordered lists (ol)
ol1 = parProperties(list.style = "ordered", level = 1)
ol2 = parProperties(list.style = "ordered", level = 2)
# Define some levels for unordered lists (ul)
ul1 = parProperties(list.style = "unordered", level = 1)
ul2 = parProperties(list.style = "unordered", level = 2)
# Multi-lvel ordered list
doc <- addTitle(doc, "Ordered List")
doc <- addParagraph(doc, value= "Item 1", par.properties =  ol1)
doc <- addParagraph(doc, value= "Item 1.1", par.properties =  ol2)
doc <- addParagraph(doc, value= "Item 1.2", par.properties =  ol2)
doc <- addParagraph(doc, value= "Item 2", par.properties =  ol1)
# Multi-lvel unordered list
doc <- addTitle(doc, "Unordered List")
doc <- addParagraph(doc, value= "Item 1", par.properties =  ul1)
doc <- addParagraph(doc, value= "Item 1.1", par.properties =  ul2)
doc <- addParagraph(doc, value= "Item 1.2", par.properties =  ul2)
doc <- addParagraph(doc, value= "Item 2", par.properties =  ul1)
writeDoc(doc, file = "r-reporters-word-document-multilevel-lists.docx")

Add a footnote to a Word document

footnote is a set of paragraphs written at the bottom of the Word document.

The steps for adding a footnote into a document are :

  1. Create a footnote object using the function Footnote()
  2. Add a text to the footnote using the function addParagraph()
library(ReporteRs)
doc <- docx()
# create footnotes
footnote1 <- Footnote( ) # footnote1 about R
footnote1 <- addParagraph(footnote1, "R is a free software for statistical computing and graphics.")
footnote2 <- Footnote( ) # footnote2 about ReporteRs package
footnote2 <- addParagraph(footnote2, "ReporteRs is an R package to write and format easily a Word document.")
# Use the footnotes when writing a paragraph into the document
doc <- addTitle(doc, "Word document with a footnote", level=1)
doc <- addParagraph(doc,
      "This Word document is created using " +
      pot("R software", footnote=footnote1)+ " and "+
      pot("ReporteRs", footnote=footnote2) +" package."
      )
writeDoc(doc, file = "r-reporters-word-document-footnote.docx")

 

Add R scripts

The function addRScript() can be used as follow :

doc <- docx()
r_code <- ‘summary(cars$dist)
x <- rnorm(100)
hist(x)
‘
# Change the backgroud-color
doc <- addRScript(doc, text = r_code,
    par.properties= parProperties(shading.color = ‘gray90‘))
writeDoc(doc, file = "r-reporters-word-add-r-code.docx")

Add a table of contents into a Word document

To add a table of contents, the function addTOC() can be used.

Note that, when a table of content (TOC) is added into a Word document, a message box is displayed when opening the file for the first time. This message asks if you want to update TOC entries. This is not an error and you should click ‘Yes’ to update TOC entries. You should save the document to avoid this to happen the next time.

doc <- docx() # Create a Word document
# Add a title
doc <- addTitle(doc, "Create a Word document with TOC", level = 1)
# Add a table of contents
doc <- addTOC(doc)
doc <- addPageBreak(doc) # go to the next page
# Add an introduction
doc <- addTitle(doc, "Introduction", level = 2)
doc = addParagraph(doc, value ="This Word document is created using R software and ReporteRs package. The goal of this section is to show you how to add a table of contents into a Word document. This can be done easily using the function addTOC.")
# Add a box plot
doc <- addTitle(doc, "Box plot using R software", level = 2)
boxplotFunc<-function(){
  boxplot(len ~ dose, data = ToothGrowth,
      col = rainbow(3), main = "Guinea Pigs‘ Tooth Growth",
      xlab = "Vitamin C dose mg", ylab = "tooth length")
}
doc <- addPlot(doc, boxplotFunc)

doc <- addPageBreak(doc) # go to the next page
# Add plots
#+++++++++++++++++++++
doc <- addTitle(doc, "Basic plots using R software", level = 2)
doc <- addParagraph(doc, "R is a free software for plotting and data analysis. This chapter contains examples of graphs generated using R.")
# Add a histogram
doc <- addTitle(doc, "Histogram", level = 3)
doc <- addPlot(doc, function() hist(iris$Sepal.Width,
                                  col="lightblue"))
doc <- addPageBreak(doc) # go to the next page
# Add a bar plot
doc <- addTitle(doc, "Bar plot", level = 3)
doc <- addPlot(doc, function() barplot(VADeaths))
# Write the Word document to a file
writeDoc(doc, file = "r-reporters-word-document-toc.docx")

The Word document created by the R code above is available here : R software and ReporteRs package - Word document with a table of contents

Infos

This analysis has been performed using R (ver. 3.1.0).

You can read more about ReporteRs and download the source code at the following link :

GitHub (David Gohel): ReporteRs

Enjoyed this article? I’d be very grateful if you’d help it spread by emailing it to a friend, or sharing it on Twitter, Facebook or Linked In.

Show me some love with the like buttons below... Thank you and please don‘t forget to share and comment below!!

原文地址:https://www.cnblogs.com/nkwy2012/p/9056940.html

时间: 2024-08-27 03:57:17

Create and format Word documents using R software and Reporters package的相关文章

Microsoft Word Documents from ASP.NET

原网页链接: http://www.codeproject.com/Articles/3959/Microsoft-Word-Documents-from-ASP-NET 官网demo链接;可以不用注册下载; http://download.csdn.net/detail/sat472291519/7794739 在ASP.NET中创建和打开Microsoft Word文档 简介 在一个ASP.NET项目中需要创建和打开来操作Microsoft Word文档. 文章演示了如何使用Microsof

Interop.Word Documents.Open is null

问题描述 程序在Windows Server 2012 R2调用Word组件正常,但是换到Windows Server 2008 R2之后,程序异常. 代码 Microsoft.Office.Interop.Word.Application word = new Microsoft.Office.Interop.Word.Application(); Document doc = word.Documents.Open(filename); //doc is null 解决 step1 创建文件

a note of R software write Function

Functionals “To become significantly more reliable, code must become more transparent. In particular, nested conditions and loops must be viewed with great suspicion. Complicated control flows confuse programmers. Messy code often hides bugs.” — Bjar

IIS 调用Microsoft.Office.Interop.Word.Documents.Open 返回为null

控制面板->管理工具->组件服务->计算机->我的电脑->DCom配置->找到Microsoft Word文档 之后 单击属性打开此应用程序的属性对话框. 2. 单击标识选项卡,然后选择交互式用户. 3.单击"安全"选项卡,分别在"启动和激活权限"和"访问权限"组中选中"自定义",然后 自定义->编辑->添加ASP.NET账户和IUSER_计算机名 * 这些帐户仅在计算机上安装有

数据库CRUD操作:C:create创建(添加)、R:read读取、U:update:修改、D:delete删除

1.注释语法:--,#2.后缀是.sql的文件是数据库查询文件3.保存查询4.在数据库里面 列有个名字叫字段   行有个名字叫记录 CRUD操作:create 创建(添加)read 读取update 修改delete 删除 1.添加数据insert into Info values('p009','张三',1,'n001','2016-8-30 12:9:8') ; 给特定的列添加数据insert into Info (code,name) values('p010','李四');自增长列的处理

python处理word文件:win32com用法详解

目标:用python处理doc文件 方法:引入win32com模块 ************************************************************************** 一.安装 ************************************************************************** 首先要先下载安装win32com模块(起先在linux下装不成功,后在windows下面成功了...) 下载地址:http

python读取word表格内容(1)

1.首页介绍下word表格内容,实例如下: 每两个表格后面是一个合并的单元格 2.引入win32com模块 3.具体代码 1 #http://www.jb51.net/article/70318.htm 2 #http://www.cnblogs.com/graphics/articles/2953665.html 3 #http://shouce.jb51.net/python/ 4 import win32com,re 5 from win32com.client import Dispat

《R in Nutshell》 读书笔记(连载)

R in Nutshell 前言 例子(nutshell包) 本书中的例子包括在nutshell的R包中,使用数据,需加载nutshell包 install.packages("nutshell") 第一部分:基础 第一章 批处理(Batch Mode) R provides a way to run a large set of commands in sequence and save the results to a file. 以batch mode运行R的一种方式是:使用系统

VSTO:使用C#开发Excel、Word【7】

                        第2章Office解决方案介绍 Office解决方案的三种基本模式现在,您了解Office对象模型的基本模式,本章将介绍开发人员如何模拟和构建其Office解决方案. 使用Office构建的大多数解决方案都遵循三种模式之一 Office 自动化执行 Office 加载项 Office 文档背后的代码 一个自动化可执行文件是一个独立于Office控制和自动化Office应用程序的程序.可以使用Visual Studio等开发工具创建自动化可执行文件.