ActuarialExam SRMPrincipal Components and Cluster Analysis
Exam SRM topic · 5–10% of exam

Principal Components and Cluster Analysis

Dimension reduction via principal components analysis and unsupervised clustering via k-means and hierarchical clustering.

Per-objective worked-example outlines

For each learning objective on Principal Components and Cluster Analysis, here is the approach an exam item would test — the setup, the ordering of your reasoning, and the formula or identity you need to bring to the page. Approaches, not full solutions, by design. Verify against the current soa.org syllabus before your sitting.

Compute and interpret principal components for dimension reduction

Setup

A correlation matrix or set of standardized variables is given and you must compute the first few principal components and interpret them.

Approach

PCA finds eigenvectors of the covariance (or correlation) matrix sorted by eigenvalue. Each principal component is a linear combination of the original variables; the first explains the most variance. Use a scree plot or cumulative variance threshold (e.g., 80%) to pick the number of components. Interpret each PC by its dominant loadings.

Key identity

PCs are eigenvectors of Σ; variance explained = eigenvalues / trace(Σ).

Apply k-means clustering and select an appropriate number of clusters

Setup

A dataset is given and you must apply k-means and determine the optimal number of clusters.

Approach

Initialize k centroids randomly, assign each observation to the nearest centroid, recompute centroids as cluster means, repeat until assignments stabilize. To select k, use the elbow method on within-cluster sum of squares or the gap statistic. Standardize variables before clustering so distance is meaningful.

Key identity

WSS = Σ_k Σ_{i in C_k} ||x_i - μ_k||^2; choose k at the elbow.

Apply hierarchical clustering and interpret dendrograms

Setup

A pairwise distance matrix or dataset is given, and you must build an agglomerative hierarchical clustering and read the dendrogram.

Approach

Start with each observation as its own cluster. At each step, merge the two closest clusters using a linkage rule: complete (max distance), single (min), average, or Ward (variance). Visualize as a dendrogram; cut horizontally at a chosen height to define clusters. Different linkages produce qualitatively different cluster shapes.

Key identity

Linkage choices: complete (compact), single (chained), average, Ward (variance-minimizing).

Common exam traps on Principal Components and Cluster Analysis

Recurring patterns where candidates lose points on Principal Components and Cluster Analysis-style items. Each entry pairs the trap with the fix.

Trap

Forgetting to standardize variables before PCA or clustering.

Fix

PCA on the covariance matrix lets large-scale variables dominate; use the correlation matrix or standardized data.

Trap

Treating cluster labels as ordered.

Fix

Cluster IDs are nominal; treat them as categories with no inherent ranking.

Trap

Choosing k by minimizing WSS without bound.

Fix

WSS always decreases with k; use the elbow or a penalized criterion to choose.

Trap

Reading dendrogram heights as distances on the original scale.

Fix

Heights depend on the linkage rule; interpret relatively, not absolutely.

Where to find Principal Components and Cluster Analysis in popular manuals

Pointers to where each major vendor covers this topic, so you can grab the right chapter without combing the full manual. We do not reproduce vendor content — just the location. Chapter and lesson numbers shift between editions; use these as a guide, not as a citation.

ACTEX

PCA and clustering chapter near the back of the SRM manual

Coaching Actuaries

Learn modules on PCA and Cluster Analysis; Adapt category "PCA / Clustering"

The Infinite Actuary

Unsupervised methods video block

6-day Principal Components and Cluster Analysis micro plan

A focused 6-day sub-schedule for Principal Components and Cluster Analysis specifically, at roughly 1.5–2.5 hours per day. Drop it inside your full Exam SRM plan as a single coverage module.

Day 1

Read the PCA chapter; compute PCs by hand on a small 3x3 correlation matrix.

Day 2

Drill 10 PCA interpretation problems including scree plots and loadings.

Day 3

k-means clustering — 8 problems including the elbow method.

Day 4

Hierarchical clustering — 6 problems on dendrogram reading and linkage choice.

Day 5

Mixed 12-question timed drill on PCA and clustering.

Day 6

Re-do flagged problems and write a one-page summary of dimension reduction vs clustering use cases.

How exclam.ai helps you master Principal Components and Cluster Analysis

Flashcards from your manual

Upload your ACTEX Exam SRM digital edition, scanned ASM pages, TIA handouts, or your own notes. exclam.ai extracts the Principal Components and Cluster Analysis sections and generates flashcards automatically, tuned to the exam traps above.

Worked-example drilling

Each per-objective approach above maps to a quiz template. exclam.ai re-surfaces missed items until you can recall both the setup and the key identity from cold.

FSRS spaced repetition

Because Principal Components and Cluster Analysis is 5–10% of your exam, losing it during review costs you. FSRS brings it back at the optimal moment.

Principal Components and Cluster Analysis in the Exam SRM context

SOA Exam SRM has 5 topic areas. Principal Components and Cluster Analysis is weighted at approximately 5–10% of the exam, here is where it sits relative to the other topics.

Topic areaWeight
Basics of Statistical Learning7–13%
Linear Models40–50%
Decision Trees20–25%
→ Principal Components and Cluster Analysis5–10%
Time Series Models5–10%

Start practicing Principal Components and Cluster Analysis today

Upload your ACTEX Exam SRM digital edition, scanned ASM pages, TIA handouts, or your own notes. exclam.ai generates a fully guided study plan with adaptive flashcards and quizzes for this topic.

See pricing