Numerical Representations

Numerical representations refer to the mapping of non-numerical data (such as text, images, or categorical labels) into vector spaces where mathematical operations can be performed. This process is fundamental to Machine Learning and Deep Learning, enabling algorithms to process and analyze unstructured data.

Core Types

  • Categorical Encoding: Methods like One-Hot Encoding, Label Encoding, and Target Encoding transform discrete categories into numerical features suitable for statistical models.
  • Continuous Mapping: Techniques such as Normalization and Standardization scale continuous variables to ensure uniformity across features.
  • Dimensionality Reduction: Algorithms like PCA (Principal Component Analysis) and t-SNE compress high-dimensional data while preserving structure.

Semantic & Textual Representations

Text embeddings map linguistic units into dense vector spaces, preserving semantic relationships.

References