LUBM (Lehigh university benchmark) tutorial

This little tutorial teaches you step by step how to use LUBM to benchmark your semantic system.

1. Here is the official site of LUBM: http://swat.cse.lehigh.edu/projects/lubm/

2. download files as follows:

- the Univ-Ben benchmark ontology: http://swat.cse.lehigh.edu/onto/univ-bench.owl

- the data generator (UBA) v1.7: http://swat.cse.lehigh.edu/projects/lubm/uba1.7.zip

- the tester (UBT) v1.1: http://swat.cse.lehigh.edu/projects/lubm/ubt1.1.zip

3. what LUBM does is that it provide a small ontology of college departments, classes and professors. A data generater generates arbitrary large ABox statements. 14 different queries are provided in SPARQL 1.0 to test the system response time.

4.   if you are using linux, you need to download a fix for UBA data generator: http://swat.cse.lehigh.edu/projects/lubm/GeneratorLinuxFix.zip

This fix makes sure the Generator works with Linux paths.

The reason is because UBA generates path with a "\", but Linux uses "/" in the path.

You unzip this fix patch, it‘s called "Generator.java" as well.

You unzip the UBA v1.7, and go to src/edu/lehigh/swat/bench/uba directory, to replace the original Generator.java with this fix Generator.java

If you don‘t use the fix, it will generate the data in the path like this:

/home/rui/Desktop\University0_18.daml generated

But when you go to Desktop, it‘s not there. That‘s because Linux path doesn‘t support "\".

5. You need to compile the java file.

My unzipped UBA v1.7 files are in /home/rui/Desktop/UBA1.7/src/edu/lehigh/swat/bench/uba directory

Yes, there is also a class directory in /home/rui/Desktop/UBA1.7/classes ..., but we are not going to use it.

I go to  /home/rui/Desktop/UBA1.7/src/edu/lehigh/swat/bench/uba,

then type the compile command:

javac -d /home/rui/Desktop *.java

This command will compile all the java file, and the compiled class will be stored in /home/rui/Desktop.

After this, you go to Desktop, and you will see a folder called "edu".

You open that folder and you will see that it‘s actually edu/lehigh/swat/bench/uba/*.class, these class files are what we want.

6. Now we can run the data generator

You are now in Desktop directory, and run the command:

java edu.lehigh.swat.bench.uba.Generator -univ <value> -index <value> -seed <value> -daml <value> -onto <value>

options:
      -univ number of universities to generate; 1 by default
      -index starting index of the universities; 0 by default
      -seed seed used for random data generation; 0 by default
      -daml generate DAML+OIL data; OWL data by default
      -onto url of the univ-bench ontology ( http://swat.cse.lehigh.edu/onto/univ-bench.owl ) ( http://www.lehigh.edu/~zhp2/2004/0401/univ-bench.owl [easier for query test])

7. You will have the generated data then.

8. There are 14 SPARQL 1.0 queries provided here: http://swat.cse.lehigh.edu/projects/lubm/queries-sparql.txt

时间: 2024-12-28 03:14:18

LUBM (Lehigh university benchmark) tutorial的相关文章

[转载]涉足计算机视觉领域要知道的

                   涉足计算机视觉领域要知道的 做 机器视觉和图像处理方面的研究工作,最重要的两个问题:其一是要把握住国际上最前沿的内容:其二是所作工作要具备很高的实用背景.解决第一个问题的办法就 是找出这个方向公认最高成就的几个超级专家(看看他们都在作什么)和最权威的出版物(阅读上面最新的文献),解决第二个问题的办法是你最好能够找到一个实 际应用的项目,边做边写文章. 做好这几点的途径之一就是利用网络资源,利用权威网站和专家们的个人主页. 依照下面目录整理:[1]研究群体(国

机器视觉全球顶级实验室

前序机器视觉和图像处理的研究工作,做到以下两点非常重要:1.把握国际最前沿的内容2.所做的工作具备很高的实用性上述两点的解决方法分别为:找到这个方向公认成就最高的几个超级专家:找到实际应用的项目,边做边写文章 做好这几点的途径之一就是利用网络资源,利用权威网站和专家们的个人主页:以下大家收集了机器视觉和图像处理领域的核心材料.依照下面目录整理:研究群体(国际国内)专家主页前沿国际国内期刊与会议搜索资源GPL 软件资源[1]研究群体用来搜索国际知名计算机视觉研究组(CV Groups):国际计算机

UVA 10042 Smith Numbers(数论)

Smith Numbers Background While skimming his phone directory in 1982, Albert Wilansky, a mathematician of Lehigh University , noticed that the telephone number of his brother-in-law H. Smith had the following peculiar property: The sum of the digits o

国内,美国数学专业排名

全国数学专业排名 一.数学一级学科最新排行 重点优势学科单位:1复旦大学2浙江大学3北京大学4山东大学5中国科学技术大学6南开大学7清华大学8吉林大学9四川大学10北京师范大学11南京大学 优势学科单位:1大连理工大学2华东师范大学3武汉大学4中山大学5西安交通大学6兰州大学7同济大学8华中科技大学9中南大学10东南大学11厦门大学12上海大学13重庆大学14首都师范大学15哈尔滨工业大学16云南大学17北京理工大学18苏州大学19北京交通大学20湖南大学21上海交通大学22湖南师范大学23新疆

Smith Numbers(分解质因数)

Smith Numbers Time Limit: 1000MS   Memory Limit: 10000K Total Submissions: 14173   Accepted: 4838 Description While skimming his phone directory in 1982, Albert Wilansky, a mathematician of Lehigh University,noticed that the telephone number of his b

A - Smith Numbers POJ

While skimming his phone directory in 1982, Albert Wilansky, a mathematician of Lehigh University,noticed that the telephone number of his brother-in-law H. Smith had the following peculiar property: The sum of the digits of that number was equal to

Toward Scalable Systems for Big Data Analytics: A Technology Tutorial (I - III)

ABSTRACT Recent technological advancement have led to a deluge of data from distinctive domains (e.g., health care and scientific sensors, user-generated data, Internet and financial companies, and supply chain systems) over the past two decades. The

YUM repository and package management: Complete Tutorial

Every operating system must have some or the other way to install a program. What's important is the fact that the user must not be given the responsibility of managing the overhead involved in the installation of the program. You would ask,what's th

视觉中的深度学习方法CVPR 2012 Tutorial Deep Learning Methods for Vision

Deep Learning Methods for Vision CVPR 2012 Tutorial  9:00am-5:30pm, Sunday June 17th, Ballroom D (Full day) Rob Fergus (NYU), Honglak Lee (Michigan), Marc'Aurelio Ranzato (Google) Ruslan Salakhutdinov(Toronto), Graham Taylor(Guelph), Kai Yu(Baidu)  O