Predictive Analytics for Business 2

Methodology Map:

Methodology map helps us to determine which method I am going to use to solve the business problem:

1. NON-predictive problems:
  a. Geospatial analysis: this type of peoblem use location based data to drive your conclusions. (coordinates, distance, geographic location ect.)

  b. Segmentation is to group data together.

  c. Aggregation is the methodology that simply means calculating a value across a group or dimension and is commonly used in data analysis.

  d. Descripitive statistics means to use statistics to describe the data.(Mean, Median, Mode, Standard Deviation, Interqurtile Range)

2. Data Rich vs. Data Poor:

  How to determine Data rich or data poor?

  Whether we have data on what we are trying to predict.

If data poor:

  Setup an experiment  (A/B test)


3. Regression model vs Classfication model

4. Regression(numeric) model:

  a. Continuous model.

  b. Time-based model.

5.  Classfication(non-numeric) model:

  a. Binary (yes or no)

  b. Non-binary(mutiple opinions)

时间: 2024-08-26 09:21:36

STAT2020 PREDICTIVE ANALYTICS – PROJECT S2/2019OVERVIEWThis assessment involves writing a report that summarises a statistical learning related investigation that you haveconducted on data that you have chosen yourself. The investigation must involve

The Building Blocks-Enterprise Applications Part 2- Information Management and Business Analytics

1. Business Analytic Applications Data Analytics Also referred to as 'Business Analytics' or 'Business Intelligence' Although basic reporting capabilities have been built into ERP systems since their inception, there is increasing interest in making

12 Top Open Source Data Analytics Apps

1. Hadoop It would be impossible to talk about open source data analytics without mentioning Hadoop. This Apache Foundation project has become nearly synonymous with big data, and it enables large-scale distributed processing of extremely large data

Commonly used terms in Data and Analytics

General terms Analytics as a Service (AaaS) The provision of analytics through Web-delivered technologies. These solutions offer businesses an alternative to developing internal hardware setups to perform business analytics. Artificial Intelligence (

IAB303 Data Analytics Assessment Task

Assessment TaskIAB303 Data Analyticsfor Business InsightSemester I 2019Assessment 2 – Data Analytics NotebookName Assessment 2 – Data Analytics NotebookDue Sun 28 Apr 11:59pmWeight 30% (indicative weighting)Submit Jupyter Notebook via BlackboardRatio

【转】SAP HANA学习资料大全[非常完善的学习资料汇总]

Check out this SDN blog if you plan to write HANA Certification exam Videos available at HANA Academy



Decision Boundaries for Deep Learning and other Machine Learning classifiers

Decision Boundaries for Deep Learning and other Machine Learning classifiers H2O, one of the leading deep learning framework in python, is now available in R. We will show how to get started with H2O, its working, plotting of decision boundaries and

使用Spark Streaming + Kudu + Impala构建一个预测引擎

随着用户使用天数的增加,不管你的业务是扩大还是缩减了,为什么你的大数据中心架构保持线性增长的趋势?很明显需要一个稳定的基本架构来保障你的业务线.当你的客户处在休眠期,或者你的业务处在淡季,你增加的计算资源就处在浪费阶段:相对应地,当你的业务在旺季期,或者每周一每个人对上周的数据进行查询分析,有多少次你忒想拥有额外的计算资源. 根据需求水平动态分配资源 VS 固定的资源分配方式,似乎不太好实现.幸运的是,借助于现今强大的开源技术,可以很轻松的实现你所愿.在这篇文章中,我将给出一个解决例子,基于流式