Exact matches only

Search in title

Search in content

Filter by Categories

101 concepts level I

101 concepts level II

2021 Level I Corporate Finance Full Videos

2021 Level I Economics Full Videos

2021 Level I Ethics Full Videos

2021 Level I FRA Full Videos

2021 Level I Portfolio Management Full Videos

2021 Level I Quantitative Methods Full Videos

Advice and How to Study Videos

All-Levels

Alternative Investments

Alternative Investments (AI)

BookLet Top Level

Corporate Issuers

Corporate Issuers (CI)

Demystified Videos

Derivatives

Derivatives (DV)

Economics

Economics (EC)

Equity

Equity Investments (EI)

Ethical and Professional Standards (ES)

Ethics

featured

Financial Reporting and Analysis

Financial Statement Analysis (FSA)

Fixed Income

Fixed Income (FI)

Level I

Level II

Level III

LM01 Alternative Investment Features, Methods, and Structures

LM01 Categories, Characteristics, and Compensation Structures of Alternative Investments

LM01 Corporate Structures and Ownership

LM01 Derivative Instrument and Derivative Market Features

LM01 Ethics and Trust in the Investment Profession

LM01 Fixed-Income Instrument Features

LM01 Fixed-Income Securities: Defining Elements

LM01 Introduction to Financial Statement Analysis

LM01 Market Organization & Structure

LM01 Market Organization and Structure

LM01 Organizational Forms, Corporate Issuer Features, and Ownership

LM01 Portfolio Management Overview

LM01 Portfolio Management: An Overview

LM01 Rates and Returns

LM01 The Firm & Market Structures

LM01 Time Value of Money

LM01 Topics in Demand and Supply Analysis

LM02 Alternative Investment Performance and Returns

LM02 Analyzing Income Statements

LM02 Code of Ethics and Standards of Professional Conduct

LM02 Code of Ethics and Standards of Professional Conduct Profession

LM02 Financial Reporting Standards

LM02 Fixed Income Markets - Issuance Trading and Funding

LM02 Fixed-Income Cash Flows and Types

LM02 Forward Commitment and Contingent Claim Features and Instruments

LM02 Introduction to Corporate Governance and Other ESG Considerations

LM02 Investors and Other Stakeholders

LM02 Organizing, Visualizing, and Describing Data

LM02 Performance Calculation and Appraisal of Alternative Investments

LM02 Portfolio Risk & Return: Part I

LM02 Portfolio Risk and Return Part I

LM02 Security Market Indexes

LM02 The Firm and Market Structures

LM02 Time Value of Money in Finance

LM02 Understanding Business Cycles

LM03 Aggregate Output, Prices and Economic Growth

LM03 Analyzing Balance Sheets

LM03 Business Models & Risks

LM03 Corporate Governance: Conflicts, Mechanisms, Risks, and Benefits

LM03 Derivative Benefits, Risks, and Issuer and Investor Uses

LM03 Fiscal Policy

LM03 Fixed-Income Issuance and Trading

LM03 Guidance for Standards I-VII

LM03 Guidance for Standards I–VII

LM03 Introduction to Fixed Income Valuation

LM03 Investments in Private Capital: Equity and Debt

LM03 Market Efficiency

LM03 Portfolio Risk & Return: Part II

LM03 Portfolio Risk and Return Part II

LM03 Private Capital, Real Estate, Infrastructure, Natural Resources, and Hedge Funds

LM03 Probability Concepts

LM03 Statistical Measures of Asset Returns

LM03 Understanding Income Statements

LM04 An Introduction to Asset-Backed Securities

LM04 Analyzing Statements of Cash Flows I

LM04 Arbitrage, Replication, and the Cost of Carry in Pricing Derivatives

LM04 Basics of Portfolio Planning & Construction

LM04 Basics of Portfolio Planning and Construction

LM04 Capital Investments

LM04 Common Probability Distributions

LM04 Fixed-Income Markets for Corporate Issuers

LM04 Introduction to the Global Investment Performance Standards (GIPS)

LM04 Monetary Policy

LM04 Overview of Equity Securities

LM04 Probability Trees and Conditional Expectations

LM04 Real Estate and Infrastructure

LM04 Understanding Balance Sheets

LM04 Understanding Business Cycles

LM04 Working Capital and Liquidity.

LM05 Analyzing Statements of Cash Flows II

LM05 Capital Investments and Capital Allocation

LM05 Company Analysis: Past and Present

LM05 Fixed-Income Markets for Government Issuers

LM05 Introduction to Geopolitics

LM05 Introduction to Industry and Company Analysis

LM05 Monetary and Fiscal Policy

LM05 Natural Resources

LM05 Portfolio Mathematics

LM05 Pricing and Valuation of Forward Contracts and for an Underlying with Varying Maturities

LM05 Pricing and Valuation of Forward Contracts.

LM05 Sampling and Estimation

LM05 The Behavioral Biases of Individuals

LM05 Understanding Cash Flow Statements

LM05 Understanding Fixed-Income Risk and Return

LM05 Working Capital & Liquidity

LM06 Analysis of Inventories

LM06 Capital Structure

LM06 Cost of Capital-Foundational Topics

LM06 Equity Valuation: Concepts and Basic Tools

LM06 Financial Analysis Techniques

LM06 Fixed-Income Bond Valuation: Prices and Yields

LM06 Fundamentals of Credit Analysis

LM06 Hedge Funds

LM06 Hypothesis Testing

LM06 Industry and Competitive Analysis

LM06 International Trade

LM06 Introduction to Geopolitics

LM06 Introduction to Risk Management

LM06 Pricing and Valuation of Futures Contracts

LM06 Simulation Methods

LM07 Analysis of Long-Term Assets

LM07 Business Models

LM07 Capital Flows and the FX Market

LM07 Capital Structure

LM07 Company Analysis: Forecasting

LM07 Estimation and Inference

LM07 International Trade and Capital Flows

LM07 Introduction to Digital Assets

LM07 Introduction to Linear Regression

LM07 Inventories

LM07 Pricing and Valuation of Interest Rate and Other Swaps

LM07 Pricing and Valuation of Interest Rates and Other Swaps

LM07 Technical Analysis

LM07 Yield and Yield Spread Measures for Fixed-Rate Bonds.

LM08 Currency Exchange Rates

LM08 Equity Valuation: Concepts and Basic Tools

LM08 Exchange Rate Calculations

LM08 Fintech in Investment Management

LM08 Hypothesis Testing

LM08 Long Lived Assets

LM08 Measures of Leverage

LM08 Pricing and Valuation of Options

LM08 Topics in Long-Term Liabilities and Equity

LM08 Yield and Yield Spread Measures for Floating-Rate Instruments

LM09 Analysis of Income Taxes

LM09 Income Taxes

LM09 Option Replication Using Put-Call Parity

LM09 Option Replication Using Put–Call Parity

LM09 Parametric and Non-Parametric Tests of Independence

LM09 The Term Structure of Interest Rates: Spot, Par, and Forward Curves

LM10 Financial Reporting Quality

LM10 Interest Rate Risk and Return

LM10 Non-current (Long-Term) Liabilities

LM10 Simple Linear Regression

LM10 Valuing a Derivative Using a One-Period Binomial Model

LM11 Financial Analysis Techniques

LM11 Financial Reporting Quality

LM11 Introduction to Big Data Techniques

LM11 Yield-Based Bond Duration Measures and Properties

LM12 Applications of Financial Statement Analysis

LM12 Introduction to Financial Statement Modeling

LM12 Yield-Based Bond Convexity and Portfolio Properties

LM13 Curve-Based and Empirical Fixed-Income Risk Measures

LM14 Credit Risk

LM15 Credit Analysis for Government Issuers

LM16 Credit Analysis for Corporate Issuers

LM17 Fixed-Income Securitization

LM18 Asset-Backed Security (ABS) Instrument and Market Features

LM19 Mortgage-Backed Security (MBS) Instrument and Market Features

New Booklet Top level

Portfolio Management

Portfolio Management (PM)

Quantitative Methods

Quantitative Methods (QM)

Uncategorized

Please select your exam.

IFT Notes for Level I CFA^® Program

LM02 Organizing, Visualizing, and Describing Data

Part 2

5. Summarizing Data Using a Contingency Table

A contingency table is a tabular format that displays the frequency distributions of two or more categorical variables simultaneously. It can be used to find patterns between the variables.

Contingency tables are constructed by listing all levels of one variable as rows and all the levels of the other variables as columns in a table. For example, consider a contingency table created for a portfolio of 500 stocks based on two variables – sector and market capitalization.

	Market Capitalization Variable (3 Levels)
Sector Variable (4 Levels)	Small	Mid	Large	Total
Financial	44	38	20	102
FMCG	130	54	46	230
Information Technology	57	34	21	112
Real estate	30	16	10	56
Total	261	142	97	500

Key points to note from the table are:

Each cell shows the number of stocks of each sector with a given market cap level. For example, there are 130 small-cap FMCG stocks. This count is also called joint frequencies.
The joint frequencies are added across rows and across columns, and the corresponding sub-totals are called marginal frequencies. For example, the marginal frequency of FMCG sector is 230, and the marginal frequency of small cap stocks is 261

Contingency tables can also be created using relative frequencies based on total count. Each number is expressed as percentage of the total number of stocks. For example, small cap FMCG stocks are 130 / 500 = 26% of the portfolio.

Applications

One application of contingency tables is for evaluating the performance of a classification model (using a confusion matrix). Suppose we have a model for classifying companies into two groups: those that default on their bond payments and those that do not default. The table below shows a confusion matrix for a sample of 1,000 non-investment-grade bonds.

Predicted Default	Actual Default		Total
Predicted Default	Yes	No	Total
Yes	150	10	160
No	6	834	840
Total	156	844	1,000

The table shows that the classification model incorrectly predicts default in 10 cases where an actual default did not occur. It also incorrectly predicts no default in 6 cases where a default did actually occur.

Another application of contingency tables is to investigate a potential association between two categorical variables. One way to test the potential association is to follow a three-step process:

Add the marginal frequencies and overall total to the contingency table.
Use the marginal frequencies to construct a table with expected values of the observations.
Compare with chi-square value for a given level of significance.

These steps are demonstrated in the following example.

Example: Contingency Tables and Association between Two Categorical Variables

Suppose we randomly pick 200 mutual funds and classify them based on two parameters:

Fund style – Growth versus Value
Risk level – Low risk versus High risk.

This data is summarized in a 2 x 2 contingency table shown below.

	Low Risk	High Risk
Growth	67	19
Value	98	16

Calculate the number of growth funds and the number of value funds.
Calculate the number of low-risk and high-risk funds.
Describe how the contingency table is used to set up a test for independence between fund style and risk level.

Solution to 1:

The marginal frequency for growth is 67 + 19 = 86

The marginal frequency for value is 98 + 16 = 114

Solution to 2:

The marginal frequency for low risk is 67 + 98 = 165

The marginal frequency for high risk is 19 + 16 = 35

Solution to 3:

To conduct a chi-square test of independence, we perform the following three steps.

Step 1: Add the marginal frequencies and overall total to the contingency table. We also show the relative frequency table for observed values.

Observed Values				Observed Values
	Low Risk	High Risk			Low Risk	High Risk
Growth	67	19	86	Growth	78%	22%	100%
Value	98	16	114	Value	86%	14%	100%
	165	35	200

Step 2: Use the marginal frequencies to construct a table with expected values of the observations.

Expected Value_i,j = (Total Row _i × Total Column _j)/Overall Total

For example,

Expected value for Growth / Low Risk is: (86 x 165) / 200 = 70.95

Expected value for Value / High Risk is: (114 x 35) / 200 = 19.95`1 qA

The table of expected values and the corresponding relative frequency table is presented below:

Observed Values				Observed Values
	Low Risk	High Risk			Low Risk	High Risk
Growth	70.95	15.05	86	Growth	82.5%	17.5%	100%
Value	94.05	19.95	114	Value	82.5%	17.5%	100%
	165	35	200

Step 3: The actual values and the expected values are used to derive the chi-square test statistic. This is then compared to a value from the chi-square distribution table for a given level of significance. If the test statistic is greater than the chi-square distribution value, then we can conclude that there is significant association between the categorical variables.

Instructor’s Note: You will understand this step better when you go over the reading on ‘Hypothesis Testing’.

6. Data Visualization

Visualization refers to the presentation of data in pictorial or graphical format to aid understanding of the data and for gaining insights into the data. There are multiple data visualization techniques, which are covered in the following sub-sections.

6.1 Histogram and Frequency Polygon

Histogram: A histogram presents the distribution of numerical data by using the height of a bar to represent the absolute frequency of each bin. The advantage of the visual display is that we can quickly see where most of the observations lie.

Suppose we are evaluating 200 stocks presented in the following frequency distribution table.

Price Range	Number of Stocks
46.00 – 51.00	20
51.00 – 56.00	60
56.00 – 61.00	100
61.00 – 65.00	20

We can depict this data graphically through a histogram.

Frequency polygon:

A frequency polygon plots the midpoints of each interval on the X-axis and the absolute frequency of that interval on the Y-axis. Each point is then connected with a straight line.

Cumulative frequency distribution

Another graphical tool is the cumulative frequency distribution chart. Such a graph can plot either the cumulative frequency or cumulative relative frequency against the upper interval limit. The cumulative frequency distribution allows us to see how many or what percent of the observations lie below a certain value. The figure below is an example of a cumulative frequency distribution.

Notice that the slope is steep in the ‘51.00 -56.00’ to ’56.00 – 61.00’ segment because a large number of stocks (100) are added. The slope flattens out in the last segment because only 20 stocks are added in the last segment.

Example:

Which of the following statements is most likely to be inaccurate about histograms?

A histogram is the graphical equivalent of a frequency distribution.
A histogram is a form of a bar chart.
In a histogram, the height represents the relative frequency for each interval.

Solution:

C is correct. In a histogram, the height represents the absolute frequency for each interval.

6.2 Bar Chart

A bar chart is used to plot the frequency distribution of categorical data. Each bar represents a distinct category, and the bar’s height is proportional to the frequency of that category.

The bar chart below shows that the sector in which the portfolio holds the most stocks is FMCG, with 230 stocks, followed by the IT sector, with 112 stocks.

A grouped bar chart (also called a clustered bar chart) can be used to show the frequency distribution of multiple categorical variables simultaneously.

The chart below shows that small cap FMCG stocks have the highest frequency – 130. Also, we can easily observe that small cap stocks are the largest sub-group within each sector.

A stacked bar chart is an alternative form for presenting the frequency distribution of multiple categorical variables simultaneously.

Bar charts can also be presented vertically instead of horizontally as shown below. Normally, the height of each bar is proportional to the value it depicts. However, sometimes the y-axis may be truncated, in which case the heights may not be proportional to the depicted values. In such cases, the graph needs to be evaluated more carefully.