jonmatumalpha
conceptsnotesexperimentsessays

© 2026 Jonatan Mata · alpha · v0.1.0

#nlp

4 articles tagged #nlp.

  • Embeddings

    Dense vector representations that capture the semantic meaning of text, images, or other data in a numerical space where proximity reflects conceptual similarity.

    evergreen#embeddings#vectors#nlp#semantic-similarity#representation-learning
  • Large Language Models

    Massive neural networks based on the Transformer architecture, trained on enormous text corpora to understand and generate natural language with emergent capabilities like reasoning, translation, and code generation.

    evergreen#llm#transformer#gpt#claude#foundation-models#deep-learning#nlp
  • Semantic Search

    Information retrieval technique that uses vector embeddings to find results by meaning, not just exact keyword matching.

    growing#search#embeddings#vector-search#nlp#transformers#information-retrieval
  • Tokenization

    Process of splitting text into discrete units (tokens) that language models can process numerically, fundamental to how LLMs understand and generate text.

    seed#tokenization#bpe#tokens#nlp#llm#preprocessing
All tags