Luminous computing ai 200m300mwiggersventurebeat

1/17/2024

Divide these documents into chunks that can be fed into your LLM to generate embeddings and store these embeddings in a vector database.Gather all the documents you want your LLM to use.Phase 1: chunking (also known as indexing) For those not yet swept away in the RAG rage, RAG works in two phases: For example, say a company builds a chatbot for customer support, for this chatbot to answer any customer question about any product, the context needed might be that customer’s history or that product’s information.īecause the model “learns” from the context provided to it, this process is also called context learning.Ĭontext length is especially important for RAG – Retrieval Augmented Generation (Lewis et al., 2020) – which has emerged to be the predominant pattern for LLM industry use cases. Personally, I suspect that this percentage would be even higher for enterprise use cases. roughly 16.5% of the Natural Questions NQ-Open dataset. For example, if we ask ChatGPT: “What’s the best Vietnamese restaurant?”, the context needed would be “where” because the best Vietnamese restaurant in Vietnam would be different from the best Vietnamese in the US.Īccording to this cool paper SituatedQA (Zhang & Choi, 2021), a significant proportion of information-seeking questions have context-dependent answers, e.g. A simple example of fact-checking and hallucination by NVIDIA’s NeMo-GuardrailsĪ vast majority of questions require context.

luminous computing ai 200m300mwiggersventurebeat

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models (Manakul et al., 2023).
Self-Consistency Improves Chain of Thought Reasoning in Language Models (Wang et al., 2022).
Contrastive Learning Reduces Hallucination in Conversations (Sun et al., 2022).
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity (Bang et al., 2023).
How Language Model Hallucinations Can Snowball (Zhang et al., 2023).
Survey of Hallucination in Natural Language Generation (Ji et al., 2022).
There are also ad-hoc tips to reduce hallucination, such as adding more context to the prompt, chain-of-thought, self-consistency, or asking your model to be concise in its response. Mitigating hallucination and developing metrics to measure hallucination is a blossoming research topic, and I’ve seen many startups focus on this problem. I was at a panel on LLM with Dropbox, Langchain, Elastics, and Anthropic recently, and the #1 roadblock they see for companies to adopt LLMs in production is hallucination. However, for most other use cases, hallucination is a bug. For many creative use cases, hallucination is a feature. Hallucination happens when an AI model makes stuff up. Hallucination is a heavily discussed topic already so I’ll be quick. Improve the efficiency of the chat interface

Optimize context length and context constructionĨ. I’m the most excited about numbers 3 (multimodality), 5 (new architecture), and 6 (GPU alternatives).Ģ. The first two directions, hallucinations and context learning, are probably the most talked about today. After talking to many people working in both industry and academia, I noticed the 10 major research directions that emerged.

Never before in my life had I seen so many smart people working on the same goal: making LLMs better.

0 Comments

Luminous computing ai 200m300mwiggersventurebeat

Leave a Reply.

Author

Archives

Categories