Andrejs'🌱

Home

❯

ML

❯

LLM

Folder: ML/LLM

29 items under this folder.

  • Jan 28, 2025

    Briefly about transformer’s evolution or why is softmax cool

    • Jan 14, 2025

      Advanced RAG techniques

      • Jan 12, 2025

        prompting

        • Jan 08, 2025

          how to evaluate LLM chatbots

          • Dec 30, 2024

            what can go wrong with LLMs

            • Dec 04, 2024

              query expansion

              • Nov 18, 2024

                Inference Scaling for Long-Context Retrieval Augmented Generation

                • Nov 18, 2024

                  Lost in the Middle effect

                  • Nov 14, 2024

                    Evolution of embeddings

                    • Nov 13, 2024

                      prefix caching

                      • Nov 13, 2024

                        speculative decoding

                        • Nov 12, 2024

                          continuous batching

                          • Nov 12, 2024

                            inference optimization

                            • Nov 09, 2024

                              GPU characteristics

                              • Nov 03, 2024

                                LLM inference

                                • Nov 03, 2024

                                  decoding strategy

                                  • Oct 25, 2024

                                    scaling laws

                                    • Sep 07, 2024

                                      quantization

                                      • Sep 04, 2024

                                        ROUGE

                                        • Sep 04, 2024

                                          Reinforcement Learning from Human Feedback

                                          • Sep 04, 2024

                                            reward model

                                            • Aug 31, 2024

                                              direct preference optimization

                                              • Aug 22, 2024

                                                positional encoding

                                                • Aug 20, 2024

                                                  paper review - Llama 3 Herd of Models

                                                  • Aug 15, 2024

                                                    byte pair encoding

                                                    • Aug 15, 2024

                                                      perplexity

                                                      • Aug 15, 2024

                                                        tokenization

                                                        • Feb 28, 2024

                                                          Retrieval-Augmented Generation

                                                          • Jan 20, 2024

                                                            LLM


                                                            Created with Quartz v4.4.1 © 2025

                                                            • GitHub
                                                            • Discord Community