pkb contents > bi | just under 3693 words | updated 12/30/2017

1. What is BI?

Business intelligence (BI) systems are a type of management information system (MIS) that supports managerial (strategic and operational) decision-making. Sharda et al. (2014) offer the “business pressures-responses-support” model of BI, in which:

1.1. Does BI have value?

In theory, BI adds value by improving decisions. BI may enable organizations to answer their questions faster, or to pose new questions and gain new insights. Faster answers may support faster actions; new insights may indicate and/or support new courses of action. But very clearly, the value of BI depends on (1) the quality of the data being fed into the system; (2) the quality of the analysis performed on the data; (3) the capacity to turn analysis into decisions, and most fundamentally, (4) the capacity to act on decisions. (Senge's concept of a learning organization is relevant here---it is an organization that can not only make and act on decisions, but also reflect on outcomes and improve them through iteration.)

1.2. History of BI

Per Sharda et al. (2014):

Also:

2. BI systems

2.1. Generic BI system architecture

2.1.1. Food system metaphor

2.2. BI system capabilities

Per Sharda et al. (2004), BI tools provide varying combinations of the following broad functionalities:

Gartner's Magic Quadrant for BI and Analytics Platforms (Sallam et al., 2017) compares software across the following capabilities:

2.3. Implementing BI systems

2.3.1. Common problems with BI initiatives

Per Sharda et al. (2014, pp. 73-74):

2.3.2. Scoping BI systems

2.3.2.1. Maturity models

A Big Data maturity model, per Zhu et al. (2014, p. 26):

The Data Warehousing Institute (TDWI) also has maturity models for:

2.3.2.2. Big Data

See notes on big data.

2.3.2.3. Real-time BI

Per Sharda et al. (2014, p. 81):

Traditional DW Environment Active DW Environment
Strategic decisions only Strategic and tactical decisions
Results sometimes hard to measure Results measured with operations
Daily, weekly, monthly data currency acceptable; summaries often appropriate Only comprehensive detailed data available within minutes is acceptable
Moderate user concurrency High number (1000 or more) of users accessing and querying the system simultaneously
Highly restrictive reporting used to confirm or check existing processes and patterns; often uses predeveloped summary tables or data marts Flexible ad hoc reporting, as well as machine-assisted modeling (e.g., data mining) to discover new hypotheses and relationships
Power users, knowledge workers, internal users Operational staffs, call centers, external users

3. BI technologies

3.1. ETL

Data must be extracted from operational systems; transformed so that it is clean, conformant with data quality standards, and aligned with the logical structure of the data warehouse; and finally loaded into the data warehouse. Per Sharda et al. (2014), important factors to consider in selecting ETL tools:

3.2. Data warehousing

A data warehouse is a data store that is used to

Per Sharda et al., many data warehouses have the following characteristics:

3.2.1. Why a data warehouse?

Per Sharda et al. (2014, p. 47):

3.2.2. Dimensional modeling

Dimensional modeling is data modeling to optimize retrieval (read rather than write). Star schema (denormalized) and snowflake schema (normalized) are common.

3.2.3. Various data warehouse architectures

3.2.3.1. Choosing an architecture

Per Sharda et al. (2014):

More factors, from Ariyachandra and Watson (2005) qtd in Sharda et al. (2014, p. 55):

3.2.4. Data warehouse performance

Per Sharda et al. (2014, pp. 304-305):

3.3. OLAP

"Simply, OLAP is an approach to quickly answer ad hoc questions by executing multidimensional analytic queries against organizational data repositories" (Sharda et al., 2014, p. 69). The disctinction between transaction and analytics databases arises from the current state of computer science, viz., you must optimize for either reads or writes. In addition to this basic distinction, there are subtypes of OLAP databases (HTAP, MOLAP, ROLAP, etc.) with varying functionality.

Name AKA Function Goals
OLTP operational database captures each record: emails, credit card transactions, webpage views, … efficiency, control
OLAP data warehouse ops --> data warehouse --> OLAP --> UI/dashboard aggregation, efficiency, accuracy, access

To enable OLAP, data is stored in multidimensional cubes. These cubes can be efficiently sliced on a single dimension or diced on several; a user can drill down or up for different levels of detail; a user can roll-up a dimension, running calculations on it and its relationships; and a user may pivot to "change the dimensional orientation of a report or ad hoc query-display page" (Sharda et al., 2014, p. 71).

3.4. Interpreting the data

3.4.1. Analytic roles

3.4.1.1. Business analysts

BA is a field, and its practitioners---business analysts---are perhaps the frontline users of data from BI systems; they play a major role in translating this data into action by relating it back to business processes and decisions. BABOK is the gold standard description of BA skills, but Brandenberg (n.d.) offers the following short version:

3.4.1.2. Data scientists

(see notes on data science for an overview of data science skills)

Versus BI and BA:

3.4.2. Analytic deliverables

Per Sharda et al. (2014):

According to Sharda et al. (2014) and summarized here, SAS published a white paper describing different "levels" of analytics:

Standard Reporting
  • Historical perspective
  • Standard KPI or data parameters
  • Focused on short-term goals and objectives
Customized Reporting
  • Flexible reporting
  • Focused on problem solving
  • Historical perspective
Drill down analysis
Alerts & Notifications
  • Management by exception
  • Pre-defined business process
  • Real-time feedback
Statistical Analysis
  • Correlation analysis
  • Discriminant analysis
  • Regression analysis
Forecasting
  • Trends
  • Pattern recognition
  • Decision-making capability
Predictive Modeling
  • Prognostics
  • Data-driven decisions
Optimization
  • Enable innovation
  • Continuous improvement
  • Adaptive feedback

3.4.2.1. Performance management

Business performance management (BPM) entails measuring and improving actual performance versus KPIs and goals that have been established in correspondance with managerial strategy or local appetite; see notes on performance management for further discussion. BPM intersects with BI because it is one of the primary intended uses of BI data---to understand and improve operations. The content of BI dashboards may be stongly determined by performance management goals and techniques.

3.4.2.2. Data mining

See notes on data mining for discussion of the business applications and implementation of prediction, association, and clustering techniques.

3.4.2.3. Text & web analytics

See notes on text analytics.

3.4.2.4. Data viz & visual analytics

Per Sharda et al. (2014, pp. 114-116), data visualization is increasingly important capability of BI software because it makes meaning in the data more accessible to more users---and visual analytics is a new term meant to describe data viz that goes beyond description to the realm of business analytics, i.e. "diagnostic ... prescriptive and predictive". See notes on data visualization regarding the conversion of data into informative visuals; see notes on dashboard design regarding the presentation of multiple data visuals, combined to facilitate insights.

3.5. Reporting

Sharda et al. call reporting "an essential part of the larger drive toward improved managerial decision making and organizational knowledge management" and credit it with serving multiple internal functions:

3.5.1. Types of reports

Per Sharda et al. (2014, pp. 99-100), a report is "any communication artifact prepared with the specific intention of conveying information in a presentable form to whoever needs it"---which, in a business context, includes "memos, minutes, lab reports, sales reports, progress reports, justification reports, compliance reports, annual reports, and policies and procedures" (I object to the inclusion of policies and procedures in this list, because I would call them documentation).

Reports can be for internal or external audiences, and prepared on a periodic or ad hoc basis. Regarding external reporting, the Data Foundation's Standard Business Reporting effort is an attempt to reduce reporting costs by standardizing national and international governmental reporting requirements.

Type Purpose Length Timing Audience Tone
Informal ... < 10 pgs. Periodic/routine Internal Personal pronouns & contractions OK
Formal Communicate results of deeper research/analysis 10-100 pgs. ... Varies Formal language; table of contents, executive summary
Short "investigative, compliance, and situational focused" (short) Often periodic Varies Factual

Sharda et al. (2014) cite Hill's (2013) typology of business reports:

3.5.1.1. Dashboards

See notes on dashboard design.

3.5.2. Best practices for reporting

3.5.2.1. Data storytelling

From Sharda et al. (2014, p. 117), to present data as a story, ask: "Who are the characters? What is the drama or challenge? What hurdles have to be overcome? And at the end of your story, what do you want your audience to do as a result? (connect your 'call to action' with existing managerial conversations, if possible). They also cite Fink and Moore (2012):

4. Sources

4.1. Cited

Brandenberg, L. (n.d.) What Business Analyst skills are important for a new BA? Retrieved from http://www.bridging-the-gap.com/business-analyst-skills-important/

Fink, E., & Moore, S. J. (2012). Five best practices for telling great stories with data. Tableau Software, Inc. Retrieved from http://tableausoftware.com/whitepapers/telling-stories-with-data

Sallam, R, L., Howson, C., Idoine, C. J., Oestreich, T. W., & Laurence, J. (2017). Magic Quadrant for Business Intelligence and Analytics Platforms. Gartner. Retrieved from https://cdn2.hubspot.net/hubfs/2172371/Q1%202017%20Gartner.pdf?t=149626062

Sharda, R., Delen, D., & Turban, E. (2014). Business intelligence: A managerial perspective on analytics (3rd ed.). New York City, NY: Pearson.

4.2. References

4.3. Read

4.4. Unread