High Cardinality Explained: The Basics Without the Jargon
Cardinality refers to the number of unique values in a dataset column. A column with many distinct values—like a user ID or timestamp—has high cardinality, while a column with limited distinct values—like a boolean flag (true/false) or a category with a few possible options—has low cardinality. For example, consider a database of an e-commerce platform.