Aspose.Words可以通过使用[Document]构造函数和其他文档格式一样输入纯文本数据。
Example
输入一个纯文本文件到一个Aspose.Words文档对象里面。
C#
using System; using System.IO; using System.Reflection;using System.Text; using Aspose.Words; namespace LoadTxt { class Program { public static void Main(string[] args) { // Sample infrastructure. string exeDir = Path.GetDirectoryName(Assembly.GetExecutingAssembly().Location) + Path.DirectorySeparatorChar;string dataDir = new Uri(new Uri(exeDir), @"../../Data/").LocalPath; // The encoding of the text file is automatically detected.Document doc = new Document(dataDir + "LoadTxt.txt"); // Save as any Aspose.Words supported format, such as DOCX. doc.Save(dataDir + "LoadTxt Out.docx"); } } }
Visual Basic
Imports Microsoft.VisualBasic Imports System Imports System.IO Imports System.ReflectionImports System.Text Imports Aspose.Words Namespace LoadTxt Friend Class Program Public Shared Sub Main(ByVal args() As String) ‘ Sample infrastructure. Dim exeDir As String = Path.GetDirectoryName(System.Reflection.Assembly.GetExecutingAssembly().Location) + Path.DirectorySeparatorCharDim dataDir As String = New Uri(New Uri(exeDir), "../../Data/").LocalPath ‘ The encoding of the text file is automatically detected. Dim doc As New Document(dataDir & "LoadTxt.txt") ‘ Save as any Aspose.Words supported format, such as DOCX. doc.Save(dataDir & "LoadTxt Out.docx") End Sub End Class End Namespace
文本导入功能
纯文本格式是一种基本的格式,不需要高级的文本处理器查看或编辑,然而一些纯文本文件试图证明更复杂的格式例如列表和缩进。例如列表可以表示为一系列每个从相同的字符开始的线。
Aspose.Words试图检测和加载一些特性进入一个新文档例如等价的Microsoft word功能而不是纯文本。
下表显示了文本导入引擎的关键特性:
Feature | Details |
Text encoding | The following encoding are supported: Latin1 BigEndianUnicode UTF-16 UTF-7 UTF-8 |
Import of ordered lists | 1.Arabic number with dot or right parenthesis e.g 1. or 2). Multilevel list are supported only supported when using dot. 2.Uppercase or lowercase Latin letter with dot or right parenthesis e.g a. or b). |
Import of unordered lists | Unordered lists are imported from consecutive lines which start with any of the following characters: *,--, o, . |
Paragraph indentation | Left indent and first line indent are detected and imported for paragraphs using appropriate number space characters at the beginning of the paragraph. |
Paragraph detection | Rules for detecting a new paragraph start: 1.If next line left indent isn’t equal with the current paragraph’s left indent. 2.An empty line starts a new paragraph. 3.Any list detected starts a new paragraph. |
样本转换
样本输入(纯文本文件)
输出文档
文本文件加载到Aspose的结果,保存为如下文档。
注意,前面的空间解释为缩进,列表被加载适当的列表功能。
查看更多Aspose.Words使用教程
时间: 2024-11-13 15:00:58