Exact matches only

Search in title

Search in content

Filter by Categories

101 concepts level I

101 concepts level II

2021 Level I Corporate Finance Full Videos

2021 Level I Economics Full Videos

2021 Level I Ethics Full Videos

2021 Level I FRA Full Videos

2021 Level I Portfolio Management Full Videos

2021 Level I Quantitative Methods Full Videos

Advice and How to Study Videos

All-Levels

Alternative Investments

Alternative Investments (AI)

BookLet Top Level

Corporate Issuers

Corporate Issuers (CI)

Demystified Videos

Derivatives

Derivatives (DV)

Economics

Economics (EC)

Equity

Equity Investments (EI)

Ethical and Professional Standards (ES)

Ethics

featured

Financial Reporting and Analysis

Financial Statement Analysis (FSA)

Fixed Income

Fixed Income (FI)

Level I

Level II

Level III

LM01 Alternative Investment Features, Methods, and Structures

LM01 Categories, Characteristics, and Compensation Structures of Alternative Investments

LM01 Corporate Structures and Ownership

LM01 Derivative Instrument and Derivative Market Features

LM01 Ethics and Trust in the Investment Profession

LM01 Fixed-Income Instrument Features

LM01 Fixed-Income Securities: Defining Elements

LM01 Introduction to Financial Statement Analysis

LM01 Market Organization & Structure

LM01 Market Organization and Structure

LM01 Organizational Forms, Corporate Issuer Features, and Ownership

LM01 Portfolio Management Overview

LM01 Portfolio Management: An Overview

LM01 Rates and Returns

LM01 The Firm & Market Structures

LM01 Time Value of Money

LM01 Topics in Demand and Supply Analysis

LM02 Alternative Investment Performance and Returns

LM02 Analyzing Income Statements

LM02 Code of Ethics and Standards of Professional Conduct

LM02 Code of Ethics and Standards of Professional Conduct Profession

LM02 Financial Reporting Standards

LM02 Fixed Income Markets - Issuance Trading and Funding

LM02 Fixed-Income Cash Flows and Types

LM02 Forward Commitment and Contingent Claim Features and Instruments

LM02 Introduction to Corporate Governance and Other ESG Considerations

LM02 Investors and Other Stakeholders

LM02 Organizing, Visualizing, and Describing Data

LM02 Performance Calculation and Appraisal of Alternative Investments

LM02 Portfolio Risk & Return: Part I

LM02 Portfolio Risk and Return Part I

LM02 Security Market Indexes

LM02 The Firm and Market Structures

LM02 Time Value of Money in Finance

LM02 Understanding Business Cycles

LM03 Aggregate Output, Prices and Economic Growth

LM03 Analyzing Balance Sheets

LM03 Business Models & Risks

LM03 Corporate Governance: Conflicts, Mechanisms, Risks, and Benefits

LM03 Derivative Benefits, Risks, and Issuer and Investor Uses

LM03 Fiscal Policy

LM03 Fixed-Income Issuance and Trading

LM03 Guidance for Standards I-VII

LM03 Guidance for Standards I–VII

LM03 Introduction to Fixed Income Valuation

LM03 Investments in Private Capital: Equity and Debt

LM03 Market Efficiency

LM03 Portfolio Risk & Return: Part II

LM03 Portfolio Risk and Return Part II

LM03 Private Capital, Real Estate, Infrastructure, Natural Resources, and Hedge Funds

LM03 Probability Concepts

LM03 Statistical Measures of Asset Returns

LM03 Understanding Income Statements

LM04 An Introduction to Asset-Backed Securities

LM04 Analyzing Statements of Cash Flows I

LM04 Arbitrage, Replication, and the Cost of Carry in Pricing Derivatives

LM04 Basics of Portfolio Planning & Construction

LM04 Basics of Portfolio Planning and Construction

LM04 Capital Investments

LM04 Common Probability Distributions

LM04 Fixed-Income Markets for Corporate Issuers

LM04 Introduction to the Global Investment Performance Standards (GIPS)

LM04 Monetary Policy

LM04 Overview of Equity Securities

LM04 Probability Trees and Conditional Expectations

LM04 Real Estate and Infrastructure

LM04 Understanding Balance Sheets

LM04 Understanding Business Cycles

LM04 Working Capital and Liquidity.

LM05 Analyzing Statements of Cash Flows II

LM05 Capital Investments and Capital Allocation

LM05 Company Analysis: Past and Present

LM05 Fixed-Income Markets for Government Issuers

LM05 Introduction to Geopolitics

LM05 Introduction to Industry and Company Analysis

LM05 Monetary and Fiscal Policy

LM05 Natural Resources

LM05 Portfolio Mathematics

LM05 Pricing and Valuation of Forward Contracts and for an Underlying with Varying Maturities

LM05 Pricing and Valuation of Forward Contracts.

LM05 Sampling and Estimation

LM05 The Behavioral Biases of Individuals

LM05 Understanding Cash Flow Statements

LM05 Understanding Fixed-Income Risk and Return

LM05 Working Capital & Liquidity

LM06 Analysis of Inventories

LM06 Capital Structure

LM06 Cost of Capital-Foundational Topics

LM06 Equity Valuation: Concepts and Basic Tools

LM06 Financial Analysis Techniques

LM06 Fixed-Income Bond Valuation: Prices and Yields

LM06 Fundamentals of Credit Analysis

LM06 Hedge Funds

LM06 Hypothesis Testing

LM06 Industry and Competitive Analysis

LM06 International Trade

LM06 Introduction to Geopolitics

LM06 Introduction to Risk Management

LM06 Pricing and Valuation of Futures Contracts

LM06 Simulation Methods

LM07 Analysis of Long-Term Assets

LM07 Business Models

LM07 Capital Flows and the FX Market

LM07 Capital Structure

LM07 Company Analysis: Forecasting

LM07 Estimation and Inference

LM07 International Trade and Capital Flows

LM07 Introduction to Digital Assets

LM07 Introduction to Linear Regression

LM07 Inventories

LM07 Pricing and Valuation of Interest Rate and Other Swaps

LM07 Pricing and Valuation of Interest Rates and Other Swaps

LM07 Technical Analysis

LM07 Yield and Yield Spread Measures for Fixed-Rate Bonds.

LM08 Currency Exchange Rates

LM08 Equity Valuation: Concepts and Basic Tools

LM08 Exchange Rate Calculations

LM08 Fintech in Investment Management

LM08 Hypothesis Testing

LM08 Long Lived Assets

LM08 Measures of Leverage

LM08 Pricing and Valuation of Options

LM08 Topics in Long-Term Liabilities and Equity

LM08 Yield and Yield Spread Measures for Floating-Rate Instruments

LM09 Analysis of Income Taxes

LM09 Income Taxes

LM09 Option Replication Using Put-Call Parity

LM09 Option Replication Using Put–Call Parity

LM09 Parametric and Non-Parametric Tests of Independence

LM09 The Term Structure of Interest Rates: Spot, Par, and Forward Curves

LM10 Financial Reporting Quality

LM10 Interest Rate Risk and Return

LM10 Non-current (Long-Term) Liabilities

LM10 Simple Linear Regression

LM10 Valuing a Derivative Using a One-Period Binomial Model

LM11 Financial Analysis Techniques

LM11 Financial Reporting Quality

LM11 Introduction to Big Data Techniques

LM11 Yield-Based Bond Duration Measures and Properties

LM12 Applications of Financial Statement Analysis

LM12 Introduction to Financial Statement Modeling

LM12 Yield-Based Bond Convexity and Portfolio Properties

LM13 Curve-Based and Empirical Fixed-Income Risk Measures

LM14 Credit Risk

LM15 Credit Analysis for Government Issuers

LM16 Credit Analysis for Corporate Issuers

LM17 Fixed-Income Securitization

LM18 Asset-Backed Security (ABS) Instrument and Market Features

LM19 Mortgage-Backed Security (MBS) Instrument and Market Features

New Booklet Top level

Portfolio Management

Portfolio Management (PM)

Quantitative Methods

Quantitative Methods (QM)

Uncategorized

Please select your exam.

IFT Notes for Level I CFA^® Program

LM05 Sampling and Estimation

Part 1

1. Introduction

A sample is a subset of a population. We can study a sample to infer conclusions about the population itself. For example, if all the stocks trading in the US are considered a population, then indices such as the S&P 500 are samples. We can look at the performance of the S&P 500 and draw conclusions about how all stocks in the US are performing. This process is known as sampling and estimation.

2. Sampling Methods

There are various methods for obtaining information on a population through samples. The information we obtain usually concerns a parameter, a quantity used to describe a population. To estimate a parameter, we use sample statistics. A statistic is a quantity used to describe a sample.

There are two reasons why sampling is used:

Time saving: In many cases it will be very time consuming to examine every member of the population.
Monetary saving: In some cases, examining every member of the population becomes economically inefficient.

There are two types of sampling methods:

Probability sampling: Every member of the population has an equal chance of being selected. Therefore, the sample created is representative of the population.
Non-probability sampling: Every member of the population may not have an equal chance of being selected. This is because sampling depends on factors such as the sampler’s judgement or the convenience to access data. Therefore, the sample created may not be representative of the population.

All else equal, the probability sampling method is more accurate and reliable as compared to the non-probability sampling method.

In the subsequent sections, we will discuss the following sampling methods:

Probability sampling
- Simple random sampling
- Systematic sampling
- Stratified random sampling
- Cluster sampling
Non-probability sampling
- Convenience sampling
- Judgement sampling

2.1 Simple Random Sampling

Simple random sampling is the process of selecting a sample from a larger population in such a way that each member of the population has the same probability of being included in the sample.

Sampling distribution

If we draw samples of the same size several times and calculate the sample statistic, the sample statistic will be different each time. The distribution of values of the sample statistic is called a sampling distribution.

For example, say you select 100 stocks from a universe of 10,000 stocks and calculate the average annual returns of these 100 stocks. Let’s say you get an average return of 15%. You repeat this process with a second sample of 100 stocks. This time, you get an average return of 14%. You keep repeating this process and each time you get a different average return. The distribution of these sample average returns is called a sampling distribution.

Sampling error

Sampling error is the difference between a sample statistic and the corresponding population parameter. The sampling error of the mean is given by:

$\rm Sampling \ error \ of \ the \ mean = \overline{x} - \mu$

For example, let’s say you want to estimate the average returns of 10,000 stocks. You draw a sample of 100 stocks and calculate the average return of these 100 stocks as 15%. However, the actual average of the 10,000 stocks was 12%. Then the sampling error = 15% – 12% = 3%.

Systematic sampling: In this technique, we select every kth member of the population until we have a sample of the desired size. Samples created using this technique should be approximately random.

Instructor’s Note: Researchers calculate the sampling interval ‘k’ by dividing the entire population size by the desired sample size.

2.2 Stratified Random Sampling

In stratified random sampling, the population is divided into subgroups based on one or more distinguishing characteristics. Samples are then drawn from each subgroup, with sample size proportional to the size of the subgroup relative to the population. Finally, samples from each subgroup are pooled together to form a stratified random sample.

The advantage of stratified random sampling is that the sample will have the same distribution of key characteristics as the overall population. This can help reduce the sampling error. Stratified random sampling therefore produces more precise parameter estimates than simple random sampling

For example, you divide the universe of 10,000 stocks as per their market capitalization such that you have 5,000 large cap stocks, 3,000 mid cap stocks, and 2,000 small cap stocks. In stratified random sampling, to select a total sample of 100 stocks, you will randomly select 50 large cap stocks, 30 mid cap stocks, and 20 small cap stocks and pool all these samples together to form a stratified random sample.

Example

Paul wants to categorize publicly listed stocks for his research project. He first divides the stocks into 15 industries. Then from each industry, he categorizes companies into three groups: small, medium, large. Finally, he divides these into value versus growth stocks. How many cells or strata does the sampling plan entail?

A. 20
B. 45
C. 90

Solution:

C is correct. This is an application of the multiplication rule of counting. The total number of cells is the product of 15, 3, and 2. Thus the answer is 90.

2.3 Cluster Sampling

Cluster sampling is similar to stratified random sampling as it also requires the population to be divided into subpopulation groups, called clusters. Each cluster is essentially a mini-representation of the entire population. Then some random clusters are chosen as a whole for sampling.

Instructor’s Note: Clusters are generally based on natural groups separating the population. For example, you might be able to divide your data into natural groupings like city blocks, voting districts, or school districts.

The main difference between cluster sampling and stratified random sampling is that in cluster sampling, the whole cluster is selected; and not all clusters are included in the sample. In stratified random sampling, however, only a few members from each stratum are selected; but all strata are included in the sample.

The difference between simple random sampling, stratified random sampling, and cluster sampling is illustrated in the figure below:

As compared to SRS and stratified sampling, cluster sampling is less accurate because the chosen sample may be less representative of the entire population. However, this method is the most time-efficient and cost-efficient amongst the three.

2.4 Non-Probability Sampling

The two major types of non-probability sampling methods are:

Convenience sampling: In this method, the researcher selects members from a population based on how easy it is to access the member i.e., data is collected from a conveniently available pool of respondents. The disadvantage of this method is that the sample selected may not be representative of the entire population. The advantage is that data can be collected quickly and at a low cost. Hence this method is particularly suitable for small-scale pilot studies.
Judgmental sampling: In this method, the researcher uses his knowledge and professional judgment to selectively handpick members from the population. The disadvantage of this method is that the sampling may be impacted by the researcher’s bias and the results may be skewed. The advantage of this method is that it allows the researcher to directly go to the target population of interest. For example, when auditing financial statements, experienced auditors can use their professional judgment to select important accounts or transactions that can provide sufficient audit coverage.

2.5 Sampling from Different Distributions

Instructor’s Note: This section does not contain testable concepts. The core point is presented below.

In addition to selecting an appropriate sampling method, researchers also need to be careful when sampling from a population that is not under one single distribution. In such cases, the larger population should be divided into smaller parts, and samples should be drawn from the smaller parts separately.

3. The Central Limit Theorem and Distribution of the Sample Mean

The sample mean is a random variable with a probability distribution known as the statistic’s sampling distribution. To understand this concept, consider the following population: last year’s returns on every stock traded in the United States. We are interested in the mean return of all stocks but do not have time to calculate the population mean. Hence, we draw a sample of 50 stocks and compute the sample mean. We then draw another sample of 50 stocks and compute the sample mean. This exercise can be repeated several times giving us a distribution of sample means. This distribution is called the statistic’s sampling distribution. The central limit theorem, explained below, helps us understand the sampling distribution of the mean.

3.1 The Central Limit Theorem

According to the central limit theorem, if we draw a sample from a population with a mean µ and a variance σ², then the sampling distribution of the sample mean:

will be normally distributed (irrespective of the type of distribution of the original population).
will have a mean of µ.
will have a variance of σ²/n.

For example, suppose the average return of the universe of 10,000 stocks is 12% and its standard deviation is 10%. Through central limit theorem we can conclude that if we keep drawing samples of 100 stocks and plot their average returns, we will get a sampling distribution that will be normally distributed with mean = 12% and variance of 10²/100 = 1%.

3.2 Standard Error of the Sample Mean

The standard deviation of the distribution of the sample means is known as the standard error of the sample mean.

When we know the population standard deviation, the standard error of the sample mean can be calculated as:

$\sigma_{\overline{X}} = \frac{\sigma}{\sqrt{n}}$

When we do not know the population standard deviation (σ) we can use the sample standard deviation (s) to estimate the standard error of the sample mean:

$s_{\overline{X}} = \frac{s}{\sqrt{n}}$

Example

The mean of a population is 12 and the standard deviation is 3. Given that the population comprises of 64 observations, what is the standard error of the sample mean?