学术论文好词佳句

[1] The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle(使...混乱) and hide more or less the different explanatory factors(可解释的因素) of variation behind the data.

[2] For that reason, much of the actual effort in deploying machine learning algorithms goes into the design of preprocessing pipelines(预处理流程) and data transformations that result in a representation of the data that can support effective machine learning.

[3] However, the one-hot representation of a word suffers from data sparsity: Namely, for words that are rare in the labeled training data, their corresponding model parameters will be poorly estimated. Moreover, at test time, the model cannot handle words that do not appear in the labeled training data. These limitations of one-hot word representations have prompted researchers to investigate unsupervised methods for inducing(诱导，推导) word representations over large unlabeled corpora.

[4] With the increase in available data parallel machine learning has become an increasingly pressing problem.

[5] Given that(鉴于) the bandwidth of storage and network per computer has not been able to keep up with the increase in data, the need to design data analysis algorithms which are able to perform most steps in a distributed fashion without tight constraints on communication has become ever more pressing.

[6] Three recent papers attempted to break this parallelization barrier, each of them with mixed success(喜忧参半).

[7] Unfortunately, these algorithms are not applicable to a MapReduce setting since the latter is fraught with(充满了) considerable latency and bandwidth constraints between the computers.

时间： 2024-11-08 14:03:32

学术论文好词佳句

学术论文好词佳句的相关文章

如何阅读学术论文、聆听学术报告 —— 叶志明

北京邮电大学关于研究生在读期间公开发表学术论文要求的规定(2014年10月修订)

撰写学术论文

阅读学术论文的心得体会from小木虫

学术论文摘要的写作方法

如何读懂一篇学术论文

116.001 - 爱折腾之用 Kindle 读学术论文是什么体验？

Hadoop是原Yahoo的Doug Cutting根据Google发布的学术论文研究而来

如何在线进行学术论文查重