MBA 202 — Research Methodology | Unit 3 & 4 Visual Notes

MBA 202 — Research Methodology

Unit 3 & Unit 4
Visual Study Notes

P.R. Pote Patil College of Engineering & Management, Amravati · Sem II 2025–26

Sampling Design Hypothesis Testing Z / t / F / χ² Tests Research Report Writing Data Analysis Software Ref: Beri · Bajpai · Cooper · Gupta · Brayman

Unit 3 Sampling Design & Hypothesis Testing

Big Picture: What is Sampling?

🎯 SAMPLING

📦 Population (Universe) 🔬 Sample (Subset) 📋 Sampling Frame 📐 Sample Size (n) ⚠️ Sampling Error 📊 Parameter vs Statistic

"Sampling is a process of learning about the population on the basis of a sample drawn from it." — G.C. Beri, Marketing Research

🏠 Real Example

Amul wants to know if Indians prefer its new flavoured milk. Surveying all 140 crore Indians is impossible. So Amul picks 2000 consumers from different cities → those 2000 = Sample. All Indians = Population.

Key Terminology

📚

Must-Know Terms at a Glance

Term	One-Line Definition	Example
Population	All elements sharing a common characteristic	All MBA students in Maharashtra
Sample	A smaller selected group from population	200 MBA students from 5 Amravati colleges
Sampling Frame	Complete list from which sample is drawn	Registered MBA students list
Sampling Unit	Single element selected in the sample	Each individual student
Parameter (μ, σ)	Describes the population — usually unknown	Avg income of ALL Infosys employees
Statistic (x̄, s)	Describes the sample — calculated from data	Avg income of 500 surveyed employees
Sampling Error	Difference: sample statistic vs population parameter	Reduced by ↑ sample size
Non-Sampling Error	Mistakes in collection / measurement — not sampling	Respondent misunderstands a question
Sampling Bias	Systematic error making sample unrepresentative	Over/under-representing a group
Census	Study every single element of the population	India's Population Census every 10 years

🧠 Memory Trick

Parameter → Population (both P) · Statistic → Sample (both S)
Greek letters (μ, σ) for Population · English letters (x̄, s) for Sample

Probability Sampling Methods

Every unit has a known, non-zero chance of selection. Results CAN be statistically generalized. ✅ Preferred for quantitative, conclusive research. (Brayman & Bell)

🎰

1. Simple Random Sampling (SRS)

Every unit has an equal & independent probability of selection — like a lottery.

Methods
Lottery / Random Number Table

📖 Example

Teacher writes 60 roll numbers on chits, picks 10 blindly. Each student = 10/60 = 16.7% chance.

✅ Unbiased ✅ Simple ❌ Impractical for large pop.

📏

2. Systematic Random Sampling

Pick first unit randomly, then every k^th unit after it.

k = N ÷ n (Interval = Population ÷ Sample Size)

📖 Example

Zomato: 10,000 delivery partners, need 500 → k = 20. Start at #7 → pick 7, 27, 47, 67...

✅ Quick & spread evenly ❌ Periodic pattern can cause bias

🥧

3. Stratified Random Sampling

Divide population into strata (non-overlapping subgroups), then randomly sample from each.

Proportionate

Sample from each stratum proportional to its size in population.

Disproportionate

Sample more from smaller/hard-to-reach strata for better representation.

📖 Example

Reliance Retail: Jio Mart 40%, Smart Bazaar 35%, Reliance Fresh 25%. Sample of 400 → 160 + 140 + 100.

🗺️

4. Cluster Sampling

Divide into clusters (geographic/natural). Randomly select clusters → study ALL units within selected clusters.

📖 Example

Govt. of India: randomly selects 10 districts → surveys ALL students in those 10 districts. Saves travel cost!

✅ Very cost-effective ❌ Higher sampling error

🪜

5. Multi-Stage Sampling

Extension of cluster sampling — sampling occurs in multiple stages, progressively narrowing down. Most practical for large national surveys. (Cooper & Schindler)

Select States

NASSCOM: randomly pick 10 states from 28

Select Cities

From each state, randomly select 5 cities

Select Companies

From each city, randomly select 10 companies

Select Employees

From each company, randomly select 20 employees

Non-Probability Sampling Methods

Selection based on judgment, convenience, or criteria — NOT random chance. Results CANNOT be statistically generalized. ✅ Used in exploratory, qualitative research. (Brayman & Bell)

🛍️

1. Convenience Sampling

Also called Accidental Sampling. Select units that are most easily accessible.

📖 Example

Researcher stands outside Nagpur mall, stops shoppers who happen to be there.

✅ Fastest & cheapest ❌ High bias
Best for: Pilot studies, pre-testing questionnaires

🧠

2. Purposive (Judgmental) Sampling

Researcher deliberately selects units they believe are most relevant and informative.

📖 Example

Studying AI adoption → deliberately selects senior IT managers from TCS, Infosys, Wipro. Only 15 people but all experts.

📊

3. Quota Sampling

Non-probability version of stratified sampling. Researcher fills fixed quotas from each subgroup by any convenient means.

📖 Example

HUL survey: collect exactly 100 male + 100 female + 75 urban + 75 rural respondents — fill these quotas however possible.

❄️

4. Snowball Sampling

Also called Chain Referral Sampling. Start small → existing respondents refer others → sample grows like a snowball.

📖 Example

Studying drug addiction: start with 5 known addicts → each refers 3 more → sample snowballs. No sampling frame exists for this population!

Best for: Hidden groups, rare populations, social network analysis

Probability vs Non-Probability — Quick Comparison

Basis	🎲 Probability	🤔 Non-Probability
Selection	Random, based on chance	Non-random, judgment/convenience
Equal Chance?	Known probability	Unknown probability
Generalizability	Can generalize ✅	Cannot generalize statistically ❌
Bias	Less biased	More prone to bias
Cost	More expensive, time-consuming	Cheaper, faster
Use Case	Quantitative, conclusive research	Qualitative, exploratory research
Examples	SRS, Stratified, Cluster, Systematic	Convenience, Purposive, Quota, Snowball

Hypothesis Testing — The Core Idea

⚖️

Think of it Like a Court Case

🏛️ In a Court

Accused is assumed INNOCENT until proven guilty beyond reasonable doubt.

📊 In Hypothesis Testing

Null Hypothesis (H₀) is assumed TRUE until data provides strong enough evidence to reject it.

📖 Example

Amul claims new milk increases customer satisfaction to 8/10. Survey of 100 → avg = 8.3. Is 8.3 truly higher than old avg 7.5 OR just due to chance? Hypothesis testing answers this scientifically.

H₀

Null Hypothesis

No effect, no difference, no relationship. The default assumption — the status quo. Researcher tries to disprove this.

Symbol: H₀ (H-naught)
Example: Average exam score = 60 marks (μ = 60)
Example: No significant difference in sales before/after ad campaign

H₁

Alternative Hypothesis

Effect exists, difference exists, relationship exists. What researcher wants to prove. Supported when H₀ is rejected.

Symbol: H₁ or Hₐ
Two-tailed: μ ≠ 60 (either direction)
One-tailed right: μ > 60 (greater than)
One-tailed left: μ < 60 (less than)

Type I & Type II Errors — The Matrix

H₀ TRUE in Reality

H₀ FALSE in Reality

We REJECT H₀

TYPE I ERROR (α)
False Positive
"We wrongly rejected a true H₀"
Probability = α

✅ CORRECT
Power of test = 1 − β

We FAIL TO REJECT H₀

✅ CORRECT
Confidence = 1 − α

TYPE II ERROR (β)
False Negative
"Missed a real effect"
Probability = β

🏥 Medical Example

Type I = Telling a healthy person they have COVID (false alarm).
Type II = Telling an infected person they're healthy (missed detection) — MORE DANGEROUS in medicine!

🧠 Memory Trick

Type Irror = Innocent person punished. Type II = Incorrect release (guilty person goes free).

7 Steps in Hypothesis Testing

State H₀ and H₁

Clearly define what you are testing and in which direction.

Choose Level of Significance (α)

Usually α = 0.05 (5%) in business research. → 95% confidence.

Select Appropriate Test Statistic

Z-test / t-test / F-test / Chi-square — based on data type and sample size.

Determine Critical Value

Look up statistical table using α and degrees of freedom.

Calculate Test Statistic

Use sample data to compute the test statistic value.

Compare & Make Decision

If calculated value > critical value → REJECT H₀. Else → Fail to Reject H₀. Same rule: if p-value < α → REJECT H₀.

Draw Conclusions

Interpret results in the context of the research question in plain language.

The 4 Test Statistics — When to Use What

🧭 Quick Decision Guide

📋 Data is CATEGORICAL (Gender, Stream, Preference)?

→

χ² Chi-Square Test

🔢 QUANTITATIVE data, LARGE sample (n≥30), σ known?

→

Z - Test

🔢 QUANTITATIVE data, SMALL sample (n<30) or σ unknown?

→

t - Test

🔢 QUANTITATIVE data, comparing 3 or MORE groups?

→

F - Test (ANOVA)

Z-Test — Large Sample, σ Known

Used when n ≥ 30 and population standard deviation (σ) is known. Uses the standard normal distribution.

Z = (x̄ − μ₀) ÷ (σ ÷ √n)

x̄ = Sample mean · μ₀ = Hypothesized mean · σ = Pop SD · n = Sample size

Test Type	α=0.05	α=0.01
Two-tailed	±1.96	±2.576
One-tailed Right	+1.645	+2.326
One-tailed Left	−1.645	−2.326

📖 Solved Example

HUL manager: avg daily sales of Surf Excel = ₹5000 (H₀: μ=5000). Survey of 36 outlets → x̄=₹5200, σ=₹600.
Z = (5200−5000) ÷ (600÷√36) = 200÷100 = 2.0
Critical value (two-tailed, α=0.05) = ±1.96. Since |Z|=2.0 > 1.96 → REJECT H₀
✅ Conclusion: Sales ARE significantly different from ₹5000.

t-Test — Small Sample, σ Unknown

Used when n < 30 OR σ is unknown. Uses t-distribution (thicker tails = more uncertainty). As n↑, t-distribution → normal distribution. (G.C. Beri)

t = (x̄ − μ₀) ÷ (s ÷ √n) df = n−1

s = Sample SD (used because σ unknown) · df = degrees of freedom

Three Types:

One-sample t-test: Compare sample mean to known value
Independent samples: Compare two different groups
Paired t-test: Before & after on SAME group

📖 Solved Example

Principal claims MBA students study 6 hrs/day. Survey of 16 students → x̄=5.2, s=1.2.
t = (5.2−6) ÷ (1.2÷√16) = −0.8÷0.3 = −2.67 df=15
Critical t (df=15, α=0.05, two-tailed) = ±2.131. Since |t|=2.67 > 2.131 → REJECT H₀
✅ Students study significantly FEWER than 6 hours per day.

F-Test / ANOVA — Comparing 3+ Groups

Compare means of 3 or more groups simultaneously. Doing multiple t-tests instead would increase Type I error! (Cooper & Schindler)

F = Mean Square Between Groups ÷ Mean Square Within Groups

F-value is always POSITIVE. Larger F = larger group differences.

Types of ANOVA:

One-Way ANOVA: 1 independent variable
Two-Way ANOVA: 2 independent variables
Also used in regression analysis

📖 Example

Tata Motors: Are monthly sales different across North, South, West regions?
H₀: μNorth = μSouth = μWest H₁: At least one is different.
⚠️ Important: F-test only tells you THAT a group is different — NOT WHICH groups differ. Use post-hoc tests (Tukey's HSD / Bonferroni) to find which ones.

χ²

Chi-Square Test — Categorical Data

Most important non-parametric test. Works with CATEGORICAL data (not numbers — categories). "One of the most useful tools in attribute data analysis." — G.C. Beri

χ² = Σ [ (O − E)² ÷ E ]

O = Observed frequency · E = Expected = (Row Total × Col Total) ÷ Grand Total
df = (rows−1) × (cols−1)

Two Main Uses:

Goodness of Fit: Do observed frequencies match expected? (1 variable)
Test of Independence: Are 2 categorical variables associated? (Most common)

Requirement: All expected frequencies ≥ 5

📖 Example — Your AI Research!

Survey 200 students: 1) Use ChatGPT? (Yes/No) 2) Stream? (MBA/Engineering/Commerce)
H₀: ChatGPT usage and stream are independent. H₁: They are associated.
df = (2−1)×(3−1) = 2. Critical χ² at α=0.05 = 5.991
If calculated χ² > 5.991 → REJECT H₀ → ChatGPT usage IS associated with stream of study!

Master Comparison — All 4 Tests

Feature	Z-Test	t-Test	F-Test (ANOVA)	χ² Chi-Square
Data Type	Quantitative	Quantitative	Quantitative	Categorical
Sample Size	Large (n≥30)	Small (n<30)	Any	Large (E≥5)
σ Known?	Yes	No	N/A	N/A
Groups	1 or 2	1 or 2	3 or more	Association
Distribution	Normal (Z)	t-distribution	F-distribution	χ² distribution
Formula	(x̄−μ₀)/(σ/√n)	(x̄−μ₀)/(s/√n)	MSB/MSW	Σ(O−E)²/E
Example	Sales vs target	Pre vs post training	Sales across regions	Gender vs preference

Unit 4 Research Report Writing & Presentation

What is a Research Report & Why Does it Matter?

"The research report is the medium through which the researcher communicates his work to others." — G.C. Beri

"A research report is a written document that describes the research process and findings." — Brayman & Bell

Purpose 1
📢 Communicate Findings
Purpose 2
🗂️ Permanent Documentation
Purpose 3
🎯 Decision Making
Purpose 4
📚 Knowledge Contribution

Types of Research Reports

🔬

Technical Report

Expert audiences — researchers, academics, specialists. Very detailed, full statistical analysis, technical jargon.

PhD Dissertation Journal Article

Length: 100–500 pages

📰

Popular Report

Non-expert audiences. Simple language, lots of visuals (charts, infographics), avoids technical jargon.

Customer Survey Summary

Length: 5–20 pages

💼

Management Report

Written for decision-makers. Concise, action-oriented, focuses on implications & recommendations. Technical details in appendix.

Consulting Firm to Wipro

Length: 15–50 pages

⏳

Interim / Progress Report

Written during research to update clients/supervisors. No final conclusions yet.

Monthly progress update

🎓

Academic / Thesis

Most rigorous. Strict institutional guidelines. Includes all components in detail. For degree requirements.

MBA Dissertation · PhD Thesis

Complete Structure of a Research Report

📋

Section A: Preliminary (Front Matter)

📌

Title Page

Title, Author, Institution, Date

✍️

Declaration

Originality & non-plagiarism

🏅

Certificate / Approval

Signed by guide/supervisor

🙏

Acknowledgement

Thanks to supporters, supervisor

📑

Table of Contents

All chapters + page numbers

📊

List of Tables & Figures

Sequential index of visuals

📝

Abstract / Executive Summary

Written LAST, placed FIRST. 150–300 words (abstract) / 1–3 pages (exec summary). Covers: problem, objectives, methodology, key findings, recommendations.

🧠 Memory Trick

"D-C-A-T-L-A" = Declaration → Certificate → Acknowledgement → Table of Contents → List of Tables → Abstract

📖

Section B: Main Body (Chapters)

Chapter 1: Introduction

Background · Statement of Problem · Research Objectives · Research Questions · Significance · Scope & Limitations · Chapter Plan

Chapter 2: Review of Literature

Cover all major previous studies · Critically analyse (not just summarise!) · Identify gaps · Establish theoretical framework · Justify why current research is needed · Cite properly (APA/MLA/Chicago)

Chapter 3: Research Methodology

Research Design · Population & Sample · Data Collection instruments · Variables (Dependent/Independent) · Data Analysis Methods · Reliability & Validity · Limitations

Chapter 4: Data Analysis & Interpretation

Demographic Analysis · Analysis per objective · Tables & Figures (numbered, titled, sourced) · Statistical Tests (with calculated value, critical value, decision) · Interpretation in plain language after every table

📖 Correct Interpretation Example

"Since calculated χ² of 8.23 exceeds critical value 5.991 at 5% significance (df=2), we reject H₀ and conclude there IS a significant association between gender and online shopping preference."

Chapter 5: Findings, Conclusions & Recommendations

Findings

WHAT was found. Factual, numbered. Each must correspond to an objective.

Conclusions

WHAT IT MEANS. Higher-level interpretation. Links findings to research problem.

Recommendations

WHAT TO DO. Specific, actionable, practical suggestions for stakeholders. Must be justified.

🧠 Memory

Findings = WHAT · Conclusions = SO WHAT · Recommendations = NOW WHAT

🔚

Section C: End Matter (Back Matter)

Bibliography / References

Complete list of all sources cited. Consistent format throughout.

APA 7th (Most common in business)

Cooper, D.R., & Schindler, P.S. (2019). Business Research Methods (13th ed.). McGraw-Hill.

Appendices

Research Questionnaire
Statistical tables used
Raw data / Computer outputs
Maps, Photographs
Legal permissions obtained

Report Formatting Standards

Element	Standard
Paper Size	A4 (210mm × 297mm)
Margins	Top/Bottom: 1 inch · Left: 1.5 inches (for binding) · Right: 1 inch
Font (Body)	Times New Roman 12pt or Arial 11pt
Font (Headings)	H1: 16–18pt Bold · H2: 14–16pt Bold · H3: 12–14pt Bold
Line Spacing	1.5 lines for body · Single for tables, footnotes, references
Alignment	Justified (both left and right margins)
Page Numbering	Roman numerals (i, ii) for preliminary · Arabic (1, 2, 3) from Introduction
Tables & Figures	Table 4.1 · Title ABOVE table · Source BELOW · Caption below figure

🧠 Writing Style Rules

Third Person ("The researcher found..." not "I found...") · Passive Voice · Precise numbers (not "many" or "some") · Objective tone · Consistent terminology

Research Paper Structure — IMRaD

Internationally accepted structure for scientific research papers. Used in Harvard Business Review, Journal of Marketing Research, IJMS.

Introduction

What is the problem? Why does it matter? What has been done? What gap does this fill?

Methods

How was it conducted? Enough detail for replication. Past tense, passive voice.

Results

Actual findings without interpretation. Tables, graphs, stat outputs with test statistic + df + p-value.

Discussion

What do results mean? Compare with prior studies. Practical implications. Limitations.

Oral Presentation Tips

🎤

Structure (Timing)

Opening: 2–3 min — Greet, introduce, state title
Problem Statement: 3–5 min — What & why
Methodology: 5–8 min — How conducted
Key Findings: 10–15 min — Most important results + visuals
Conclusions & Reco: 5–8 min — What does it mean & what to do
Q&A: 10–15 min — Be prepared!

💡

Key Rules

6×6 Rule: Max 6 bullets per slide · Max 6 words per bullet
Eye Contact: Look at audience, not screen
Voice Modulation: Vary pitch, pace, volume
Handling Questions: Listen → Repeat → Answer clearly
Prep: Practice 3–4 times before delivery

Chart Types — When to Use Which

Chart Type	Best Used For	Example
📊 Bar Chart	Comparing categories (most common)	Sales comparison across 5 regions
🥧 Pie Chart	Proportions & percentages of a whole	Market share: Tata, Hyundai, Maruti
📈 Line Graph	Trends over time	Monthly sales Jan to Dec
📉 Histogram	Frequency distribution of continuous data	Distribution of exam scores
🔘 Scatter Plot	Relationship between two variables	Price vs demand, Ad spend vs sales
📦 Box Plot	Spread and outliers in data	Salary range across departments

Data Analysis Software

📊

SPSS

IBM · Paid

MBA / PhD

GUI-based. Z/t/F/χ², Correlation, Regression, Factor Analysis, Cluster. Most used in social science.

📋

Microsoft Excel

Microsoft · Paid

MBA / UG

AVERAGE, STDEV, CORREL, Pivot Tables, Data Analysis ToolPak. Perfect for MBA minor projects!

🔵

R Software

Free & Open Source

PhD / Advanced

Requires coding. 15,000+ packages. Most advanced statistical capabilities. Popular in academia.

🐍

Python

Free & Open Source

Tech / Data Science

Pandas, NumPy, Matplotlib, Scikit-learn. Big data, ML. Most popular in AI research.

🏦

SAS

Very Expensive · Enterprise

Corporate · Banking

Used by ICICI, HDFC, Pharma companies. Handles millions of records. Risk analytics.

📝

Google Forms + Sheets

Free · Google

Minor Project ✅

Forms auto-creates charts. Sheets has basic stats. Perfect for MBA minor research projects!

⚡

Software Quick Reference

Software	Cost	Ease	Power	Best For
SPSS	Paid	Easy (GUI)	High	Social research · MBA/PhD
Excel	Paid	Very Easy	Medium	Business analysis · MBA
R	Free	Difficult (code)	Very High	Academic research · PhD
Python	Free	Moderate	Very High	Big Data / ML · Tech
SAS	Very Expensive	Moderate	Very High	Banking / Pharma · Corporate
Google Sheets	Free	Very Easy	Low	Small surveys · Minor Project

Research Agencies in India

🏢

Full-Service Agencies

Complete research from problem to final report.

Nielsen IMRB / Kantar

🔁

Syndicated Research

Conduct research once, sell to multiple clients who share the cost.

NASSCOM Nielsen Retail

💻

Online Research

Surveys exclusively through digital platforms. Large online panels.

SurveyMonkey Qualtrics

🏛️

Government Agencies

Large-scale official research for policy-making.

NSSO NCAER NITI Aayog RBI

⚠️ Common Mistakes — Avoid These!

❌ Common Mistake	✅ How to Avoid
Vague research problem	Be specific: WHO, WHAT, WHERE, WHEN. Not "Study of sales" but "Impact of Instagram ads on SME sales in Pune (2024)"
Too many objectives	Limit to 3–5. Each objective must have a corresponding analysis section.
Poor literature review	Don't just summarise — critically analyse. Identify contradictions & gaps.
Methodology not justified	Always explain WHY you chose a method. Why convenience? Why n=50?
Data without interpretation	After every table/test, write what it MEANS in plain language.
Missing citations	Every borrowed idea/stat must be cited. Use Zotero or Mendeley.
Overgeneralization	Don't claim 50-person sample findings apply to the entire world!
Findings = Conclusions	Findings = WHAT · Conclusions = SO WHAT · Recommendations = NOW WHAT
Informal language	No contractions (don't, can't), no slang, formal academic tone throughout.

✅ Pre-Submission Checklist

Title page complete with all required information
Declaration and certificate signed
All objectives have corresponding analysis sections
Every table has a number, title, and interpretation
All statistical tests include calculated value, critical value & decision

All sources cited in bibliography
Questionnaire attached as appendix
Consistent font, spacing, and page numbering throughout
Abstract/executive summary written LAST, is accurate
Word count is within required limits

🔢 All Key Formulas at a Glance

Z-Test

Z = (x̄ − μ₀) ÷ (σ ÷ √n)

Large sample, σ known, n≥30

t-Test

t = (x̄ − μ₀) ÷ (s ÷ √n) df = n−1

Small sample / σ unknown, n<30

F-Test / ANOVA

F = MSB ÷ MSW

Mean Square Between ÷ Mean Square Within. 3+ groups.

Chi-Square

χ² = Σ[(O−E)² ÷ E] df = (r−1)(c−1)

E = (Row Total × Col Total) ÷ Grand Total

Systematic Sampling Interval

k = N ÷ n

N = Population size · n = Sample size

Sample Size (Proportions)

n = Z² × p × (1−p) ÷ E²

Z=1.96 at 95% confidence · p=0.5 (worst case) · E=margin of error

📝 Final Exam Strategy

Reference Book Emphasis:

G.C. Beri: Define BEFORE you explain
S.L. Gupta & H. Gupta: Always give Indian company examples
Cooper & Schindler: Practical application & decision-making
Naval Bajpai: Step-by-step processes
Brayman & Bell: Research design & methodology rigor

Perfect Answer Formula:

Define the concept

Explain it clearly

Give a relevant Indian company example

State advantages & disadvantages

🔑 Most Important Decision to Remember

Categorical → χ² · Quantitative + Large + σ Known → Z · Quantitative + Small/σ Unknown → t · 3+ Groups → F(ANOVA)

Unit 3 & Unit 4Visual Study Notes

Unit 3 & Unit 4
Visual Study Notes