Combining Lexical and Grammatical Features to Improve Readability Measures for First and Second Language Texts.-paper

http://www.aclweb.org/anthology/N07-1058

Volume:Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference
Authors:Michael Heilman | Kevyn Collins-Thompson | Jamie Callan | Maxine Eskenazi
Month:April
Year:2007
Venues:NAACL | HLT

数据不公开

1、introduction

L1英语学习者而言，英语水平很高的时候的语法能力其实和开始学英语的时候差不多，因为他们的语法是在使用中互动中学会的，而L2是在课本中学会的，所以L2高级学习者的语法可能不可强。所以grammer对于L2的readability的预测和评估很重要，比如动词时态、被动时态等。

2、language model readability prediction for first language texts

统计语言模型比传统公式的好处：

1）短文本和web文本上的准确率更高

2）给出概率分布而不是一个预测值

3）语言模型可以提供更多关于文本中单词相对难度的数据

我们的统计模型用的是多项式贝叶斯分布（就跟上一篇paper一样）

虽然unigram是weak model，但是会比tri、bi这种更复杂的模型要求更少的数据集

3、grammatical construction readability prediction for second language texts

3.1 features for grammer-based prediction

斯坦福parser用来产生constituent structure trees

PCFG scores可以用来过滤掉预料中有问题的文本

默认训练集是Penn Treebank来parser，因为该文本和L2学习者的阅读材料是相近的

原文地址：https://www.cnblogs.com/rosyYY/p/10164809.html

时间： 2024-11-13 10:58:16

Combining Lexical and Grammatical Features to Improve Readability Measures for First and Second Language Texts.-paper的相关文章

Learning JavaScript Design Patterns -- A book by Addy Osmani

Learning JavaScript Design Patterns A book by Addy Osmani Volume 1.6.2 Tweet Copyright © Addy Osmani 2015. Learning JavaScript Design Patterns is released under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 unported license. It

10 Easy Steps to a Complete Understanding of SQL

原文出处:http://tech.pro/tutorial/1555/10-easy-steps-to-a-complete-understanding-of-sql(已经失效,现在收集如下) Too many programmers think SQL is a bit of a beast. It is one of the few declarative languages out there, and as such, behaves in an entirely different w

Java开源框架推荐（全）

Build Tool Tools which handle the buildcycle of an application. Apache Maven - Declarative build and dependency management which favors convention over configuration. It's preferable to Apache Ant which uses a rather procedural approach and can be di

My ECMAScript 7 wishlist

With ECMAScript 6 now feature complete, any further changes to the core of JavaScript will happen in ECMAScript 7. I’m pretty excited about the changes coming in ECMAScript 6 and there are already some great ECMAScript 7 features such as Object.obser

Shell Style Guide

Shell Style Guide Revision 1.26 Paul ArmstrongToo many more to mention Each style point has a summary for which additional information is available by toggling the accompanying arrow button that looks this way: ▽. You may toggle all summaries with th

Linux Shell常用函数

Functions for Transforming Text Functions allow you to do text processing in the makefile to compute the files to operate on or the commands to use. You use a function in a function call, where you give the name of the function and some text (the arg

2006-7有价值的Kean博客——Calling ObjectARX functions from a .NET Application（PInvoke）

One of the really compelling features of .NET is its ability to call "legacy" unmanaged C++ APIs. I say "legacy", but we use this facility regularly to call APIs that are far from being considered defunct (the C++ version of ObjectARX

PLSQL Coding Standard

Naming and Coding Standards for SQL and PL/SQL "The nice thing about standards is that you have so many to choose from." - Andrew S Tanenbaum Introduction This document is mentioned in a discussion on the OTN forums. One of the first comments be

Make a Website

Introduction Web pages are created using HTML and CSS. HTML is used to establish a page's structure. It also lets us add text, links and images. CSS is used to control the design and layout of the page. HTML elements HTML elements are the building bl

猜你喜欢

solr5.5.4单机版安装

Solr 编辑 Solr是一个独立的企业级搜索应用服务器,它对外提供类似于Web-service的API接口.用户可以通过http请求,向搜索引擎服务器提交一定格式的XML文件,生成索引:也可以通过H ...

js实现关于数据字典的使用和数据存放的策略

项目中的页面经常会和数据字典的值进行查询,一个一个去用ajax去请求,无疑很浪费时间,当时我的想法是做一个js的工具类,里面放这么几个方法, 1.getAll() ...

4、HQL

1.基本查询 1.不带条件的查询 2.带条件的查询 3.通过参数进行查询 4. 通过命名参数进行查询 5.查询空元素 2.常用查询 1.列表查询(in ()) 2.投影查询 3.投影一个元素查询(注意 ...

上喂恫再纳吠内嚎迂焕鼻韧琅透再

www.ebay.com/cln/188v1052v1051-pblhrnjhj/20141130/138340777013 www.ebay.com/cln/188v1052v1051-prbrxt ...

TeamViewer 的早期版本下载

对于10及上以的:https://www.teamviewer.com/zhcn/download/previous-versions/ 5~9的版本下载:https://community.team ...

洛谷OJ P1379 八数码难题解题报告

洛谷OJ P1379 八数码难题解题报告 by MedalPluS 题目描述在3×3的棋盘上,摆有八个棋子,每个棋子上标有1至8的某一数字.棋盘中留有一个空格,空格用0来表示.空格周围的棋子可 ...

日期提取函数EXTRACT

EXTRACT extracts and returns the value of a specified datetime field from a datetime or interval exp ...

子类继承父类后想要扩展父类方法

1 >>> class PClass(object): 2 def setInfo(self,sex='Male'): 3 self.gender = sex 4 5 6 >& ...

GBK UTF-16 UTF-8 编码表

GBK UTF-16 UTF-8 ================== D2BB 4E00 E4 B8 80 一 B6A1 4E01 E4 B8 81 丁 C6DF 4E03 E4 ...

用定时器在某个时间点删除文件夹

package cn.idcast8; import java.io.File; import java.text.ParseException; import java.text.SimpleDat ...

一九爱心系统定制开发

一九爱心系统定制平台开发 150-1305-3356 不久前,上海长宁区茅台路旁一家便利店也设置过一个共享冰箱,每天晚上7点,冰箱前都会排起长长的队伍,并有专门的店员维持秩序,分发免费食物. [一九爱 ...

Lucene5学习之使用MMSeg4j分词器

分类:程序语言|标签:C|日期: 2015-05-01 02:00:24 MMSeg4j是一款中文分词器,详细介绍如下: 1.mmseg4j 用 Chih-Hao Tsai 的 MMSeg 算法(ht ...

指令篇：连接文件（软连接和硬链接）___ ln

软连接:相当于Windows里面的快捷方式,删除了原文件之后,会影响连接文件.软连接可以在磁盘上面跨分区把文件aa 软连接到文件aa1,软连接之后查看改文件,里面有一个箭头:aa1 -> aa ...

.ashx datatable转excel

using System;using System.Collections;using System.Collections.Generic;using System.Data;using Syste ...

zabbix3.0.3 自定义 agent rpm 包

由于大家的系统环境不同,以及想对agent做一些基础的配置更改,这时候就要用到自定义rpm了,下面是我自己在生产环境使用的 SPEC 文件(通俗一点的,实用版) 打包的流程就无需多言,不熟悉的请自行g ...

Android 开源框架

不推荐使用UltimateAndroid.KJFrameForAndroid.ThinkAndroid.Afinal.xUtil等这种集成网络请求.图片加载.数据库ORM.视图依赖注入.UI框架等的集 ...

HDU 4336——Card Collector——————【概率dp】

Card Collector Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others)To ...

用Vmware安装centos5

Vmware安装过程就不详述了,这里从创建虚拟机开始记录. 选择创建虚拟机 ? 下一步 ? 选择稍后安装 ? 选择安装的操作系统版本,需要说明的是,CentOs 5 就是RHEL 5 ? 设置虚拟机名 ...

用 PowerShell收集服务器日检报告，并发邮件给管理员

-----提供AD\Exchange\Lync\Sharepoint\CRM\SC\O365等微软产品实施及外包,QQ:185426445.电话18666943750 博文Powershell程序及部 ...

前端性能优化（DOM篇）

原文链接:https://segmentfault.com/a/1190000000490322 缓存DOM对象 JavaScript的DOM操作可以说是JavaScript最重要的功能,我们经常要根 ...

专题

随机推荐

© 2024 憋错料 | info#biecuoliao.com | 11 q. 0.025 s.