Question 1

What is artificial intelligence (AI)?

Accepted Answer

Artificial intelligence (AI) refers to computer systems that perform tasks typically requiring human intelligence, such as understanding language, recognizing patterns, and making decisions. Modern AI is primarily based on machine learning, where systems learn from large amounts of data rather than following explicitly programmed rules.

Question 2

What is a large language model (LLM)?

Accepted Answer

A large language model (LLM) is a type of AI model trained on vast amounts of text data to understand and generate human language. LLMs learn statistical patterns across billions of text examples, enabling them to answer questions, write content, summarize documents, translate languages, and assist with coding. Examples include GPT-4, Claude, Gemini, and Llama.

Question 3

What is machine learning?

Accepted Answer

Machine learning is a subset of AI where systems learn to perform tasks by finding patterns in data, rather than being explicitly programmed with rules. A model is trained on examples, adjusts its internal parameters, and then applies what it has learned to new, unseen inputs. Deep learning, a subset of machine learning using multi-layered neural networks, powers most modern AI applications.

Question 4

What is the difference between AI and machine learning?

Accepted Answer

AI is the broader field concerned with building systems that can perform intelligent tasks. Machine learning is one approach to achieving AI, where systems learn from data rather than following hand-written rules. Today, most advanced AI systems are built on machine learning, but the terms are not interchangeable.

Question 5

What is generative AI?

Accepted Answer

Generative AI refers to AI systems that create new content, including text, images, audio, video, and code. Unlike traditional AI that classifies or predicts from existing data, generative AI produces novel outputs. Large language models, image generators like DALL-E and Midjourney, and audio models like ElevenLabs are all examples of generative AI.

Question 6

What is a neural network?

Accepted Answer

A neural network is a computational model loosely inspired by the structure of the human brain, consisting of layers of interconnected nodes that process and transform data. Neural networks learn by adjusting the strength of connections between nodes based on training examples. Deep neural networks with many layers form the foundation of modern AI systems including LLMs.

Question 7

What is a transformer model?

Accepted Answer

A transformer is a neural network architecture introduced in the 2017 paper "Attention Is All You Need" that revolutionized natural language processing. Transformers use a mechanism called self-attention to weigh the relevance of different parts of a sequence relative to each other. Virtually all modern LLMs, including GPT-4, Claude, and Gemini, are built on transformer architectures.

Question 8

What is natural language processing (NLP)?

Accepted Answer

Natural language processing (NLP) is the branch of AI focused on enabling computers to understand, interpret, and generate human language. NLP includes tasks such as text classification, translation, summarization, sentiment analysis, and question answering. Modern NLP is dominated by LLMs that handle many of these tasks within a single model.

Question 9

What is training data?

Accepted Answer

Training data is the dataset used to teach a machine learning model. For LLMs, training data typically consists of hundreds of billions of tokens of text sourced from websites, books, code repositories, and other text sources. The quality, diversity, and scale of training data significantly influence a model's capabilities and any biases it may exhibit.

Question 10

What is a foundation model?

Accepted Answer

A foundation model is a large AI model trained on broad data at scale that can be adapted for a wide range of downstream tasks. Foundation models like GPT-4 and Claude are trained once at great cost and then fine-tuned or prompted for specific applications. The term was introduced by Stanford's Center for Research on Foundation Models.

Question 11

How does an LLM generate text?

Accepted Answer

LLMs generate text by predicting the most likely next token (word or word fragment) given the preceding context, one token at a time. This process is called autoregressive generation. The model samples from a probability distribution over all possible tokens to produce coherent and contextually appropriate responses.

Question 12

What are tokens in LLMs?

Accepted Answer

Tokens are the basic units that LLMs use to process text. A token is typically a word, part of a word, or a punctuation character, depending on the model's tokenizer. For English text, one token is roughly four characters or three-quarters of a word on average. LLM pricing and context window sizes are both measured in tokens.

Question 13

What is a context window?

Accepted Answer

A context window is the maximum amount of text an LLM can process in a single interaction, measured in tokens. Everything the model "sees," including the system prompt, conversation history, and the current input, must fit within the context window. Larger context windows allow the model to consider more information at once.

Question 14

What is prompt engineering?

Accepted Answer

Prompt engineering is the practice of designing and refining inputs (prompts) to elicit desired outputs from an AI model. Effective prompts provide clear instructions, relevant context, examples, and constraints that guide the model toward accurate and useful responses. Prompt engineering is a key skill for getting reliable results from LLMs.

Question 15

What is fine-tuning?

Accepted Answer

Fine-tuning is the process of further training a pre-trained foundation model on a smaller, task-specific dataset to improve its performance on a particular domain or use case. Fine-tuning adjusts the model's weights to specialize its behavior without training from scratch. It is commonly used to make models follow specific formats, adopt a particular tone, or excel at domain-specific tasks.

Question 16

What is Retrieval-Augmented Generation (RAG)?

Accepted Answer

Retrieval-Augmented Generation (RAG) is a technique that combines an LLM with a retrieval system to provide the model with relevant external information at inference time. Instead of relying solely on knowledge encoded in its parameters, a RAG system fetches relevant documents and includes them in the prompt context. This reduces hallucinations and allows the model to reference up-to-date or proprietary information.

Question 17

What is temperature in LLM outputs?

Accepted Answer

Temperature is a parameter that controls the randomness of an LLM's output. A temperature of 0 makes the model deterministic, always choosing the most probable next token. Higher temperatures introduce more variety and creativity, but also increase the risk of incoherence. Most LLM APIs allow users to set temperature between 0 and 1 (or sometimes higher).

Question 18

What are embeddings?

Accepted Answer

Embeddings are numerical vector representations of text, images, or other data that capture semantic meaning in a high-dimensional space. Items with similar meanings are positioned close together in embedding space. Embeddings are used for semantic search, recommendation, clustering, and as the foundation of RAG systems and vector databases.

Question 19

What is RLHF (Reinforcement Learning from Human Feedback)?

Accepted Answer

Reinforcement Learning from Human Feedback (RLHF) is a training technique used to align LLMs with human preferences. Human raters evaluate model outputs, and those ratings train a reward model that guides further fine-tuning of the LLM. RLHF is a key technique behind the helpful and safe behavior of models like ChatGPT, Claude, and Gemini.

Question 20

What is the difference between model parameters and tokens?

Accepted Answer

Parameters are the internal numerical weights of a neural network learned during training that encode the model's knowledge. A model described as having 70 billion parameters contains 70 billion individual numerical values. Tokens are units of text processed during use. Parameters are a property of the trained model; tokens are units of input and output during inference.

Question 21

What can LLMs do well?

Accepted Answer

LLMs excel at tasks involving language: writing, summarizing, translating, explaining, brainstorming, classifying, extracting structured data from unstructured text, answering questions, and generating code. They perform particularly well on tasks where the format of a correct answer is learnable from patterns in training data, and they improve substantially when given clear, detailed prompts.

Question 22

What is AI code generation?

Accepted Answer

AI code generation is the ability of LLMs to write functional code in programming languages like Python, JavaScript, SQL, and many others. Models can generate code from natural language descriptions, complete partially written functions, explain existing code, fix bugs, and write tests. Tools like GitHub Copilot, Cursor, and Claude Code are built on this capability.

Question 23

Can LLMs reason?

Accepted Answer

LLMs can perform many tasks that require reasoning, such as solving math problems, answering multi-step logic questions, and planning sequences of actions. However, LLM reasoning emerges from pattern matching rather than formal logic, and models can make systematic errors on problems requiring precise step-by-step deduction. Techniques like chain-of-thought prompting significantly improve reasoning performance.

Question 24

Do LLMs have access to the internet by default?

Accepted Answer

Standard LLMs do not have real-time internet access. Their knowledge comes from training data with a fixed cutoff date. Some products built on LLMs (such as Perplexity, ChatGPT with browsing, and Gemini with Search) integrate web search as an external tool, but this is an add-on feature, not a property of the base model.

Question 25

What is multimodal AI?

Accepted Answer

Multimodal AI refers to models that can process and generate multiple types of data, such as text, images, audio, and video. A multimodal LLM like GPT-4o or Gemini can accept image inputs alongside text, analyze visual content, and respond with text. Multimodal capabilities allow AI to understand richer real-world inputs beyond text alone.

Question 26

What is AI text summarization?

Accepted Answer

AI text summarization uses LLMs to condense long documents into shorter versions capturing the key points. Summarization can be extractive (selecting key sentences from the original) or abstractive (generating new text that captures the meaning). Modern LLMs perform abstractive summarization well, handling documents up to the length of their context window.

Question 27

Can LLMs translate languages?

Accepted Answer

Yes. Modern LLMs are highly capable at translation between major world languages, often matching or exceeding specialized translation systems for common language pairs. They handle tone, nuance, and context better than earlier statistical translation models. Accuracy decreases for lower-resource languages with less training data.

Question 28

What is sentiment analysis with AI?

Accepted Answer

Sentiment analysis is the use of AI to classify the emotional tone of text, typically as positive, negative, or neutral. LLMs can perform nuanced sentiment analysis beyond simple polarity, identifying specific emotions, assessing intensity, and handling sarcasm more reliably than earlier NLP methods. It is widely used for analyzing customer reviews, social media, and support tickets.

Question 29

Can LLMs generate images?

Accepted Answer

Standard text-based LLMs do not generate images. Image generation requires separate generative models such as DALL-E 3 (OpenAI), Imagen (Google), or Stable Diffusion (Stability AI), which use different architectures such as diffusion models. Some products integrate both LLMs and image models to create multimodal experiences.

Question 30

How accurate are LLMs at answering factual questions?

Accepted Answer

LLM accuracy on factual questions varies significantly by domain and specificity. Models are generally reliable on well-documented topics with extensive training data, but can hallucinate on niche, recent, or highly specific questions. Grounding responses in retrieved source documents (RAG) substantially improves factual accuracy.

Question 31

What are AI hallucinations?

Accepted Answer

AI hallucinations occur when an LLM generates text that sounds confident and plausible but is factually incorrect or entirely fabricated. Hallucinations happen because LLMs are trained to produce statistically likely sequences of text, not to verify factual accuracy against ground truth. They are a fundamental limitation of current LLM architectures and a key reason why AI outputs should be verified for high-stakes uses.

Question 32

Why do LLMs make factual errors?

Accepted Answer

LLMs make factual errors because they are trained to predict plausible text, not to retrieve verified facts. They compress knowledge into parameters imperfectly, and some information is learned unreliably. Training data also contains errors and contradictions that models may reproduce. Techniques like RAG and grounding in external sources reduce but do not eliminate factual errors.

Question 33

What is a knowledge cutoff date?

Accepted Answer

A knowledge cutoff date is the point in time beyond which an LLM has no knowledge of world events, because no training data was collected after that date. A model with a cutoff of early 2024 will not know about events that occurred afterward unless that information is provided in the prompt. Providers publish cutoff dates for their models.

Question 34

What is AI bias?

Accepted Answer

AI bias refers to systematic and unfair skews in AI outputs related to characteristics such as race, gender, age, or nationality. Bias in AI often originates from imbalances or stereotypes present in training data, and can manifest as different quality of service, harmful stereotypes, or unequal outcomes for different groups. Addressing AI bias is an active area of research and policy.

Question 35

Should AI replace human judgment in important decisions?

Accepted Answer

Current AI systems should not replace human judgment in high-stakes decisions. LLMs can assist, inform, and accelerate human decision-making, but they lack lived experience, accountability, and the ability to verify their own reasoning reliably. Domains such as medicine, law, finance, and safety-critical systems require human oversight of AI-generated recommendations.

Question 36

What is prompt injection?

Accepted Answer

Prompt injection is an attack where malicious instructions are embedded in content that an LLM processes, such as a document, email, or webpage, causing the model to follow those instructions instead of the user's intended task. It is a significant security concern for AI systems that process external content, analogous to SQL injection in traditional software. Active research exists on defenses, but no complete solution is currently available.

Question 37

What are the privacy risks of using AI?

Accepted Answer

When users submit text to commercial AI services, that text may be stored, reviewed, or used for training depending on the provider's data practices. Sensitive information including personal details, passwords, or proprietary business data should not be entered into AI systems unless the service has strong, verified privacy guarantees. Enterprise AI agreements and on-premises deployments can mitigate these risks.

Question 38

Can LLMs be confidently wrong?

Accepted Answer

Yes. LLMs frequently express incorrect information with the same confident, fluent tone they use for accurate information. The model has no reliable internal mechanism for knowing what it does not know. This is why critical-use AI outputs must be verified against authoritative sources, especially in medical, legal, and financial contexts.

Question 39

What are the environmental costs of AI?

Accepted Answer

Training large AI models requires significant computational resources and energy. The training of GPT-3, for example, was estimated to emit hundreds of tonnes of CO2. Inference (running the model) at scale across millions of users also has a meaningful energy footprint. Major AI providers are investing in renewable energy and more efficient model architectures to address these costs.

Question 40

What is AI toxicity?

Accepted Answer

AI toxicity refers to the generation of harmful, offensive, or abusive content by AI systems. LLMs trained on internet data can reproduce or generate harmful material without safeguards. AI providers use techniques including RLHF, content filtering, and safety fine-tuning to reduce toxicity, but no system eliminates it entirely.

Question 41

What makes a good AI prompt?

Accepted Answer

A good prompt is clear, specific, and provides the context necessary for the model to understand the task. Effective prompts include a clear description of the desired output, any relevant constraints or format requirements, background context, and examples where helpful. The more precisely you describe what you want, the more reliably you will get it.

Question 42

What is chain-of-thought prompting?

Accepted Answer

Chain-of-thought (CoT) prompting is a technique where the user asks the model to reason through a problem step by step before giving a final answer. This approach improves performance on complex reasoning tasks like math, logic, and multi-step problems. Adding "think step by step" or "reason through this carefully" to a prompt can invoke chain-of-thought reasoning.

Question 43

What is few-shot prompting?

Accepted Answer

Few-shot prompting involves providing the model with a small number of examples (typically 2 to 5) of the desired input-output format before asking it to complete a new task. The examples help the model understand the pattern or format required without explicit instruction. Few-shot prompting is particularly effective for structured output tasks like classification, extraction, and formatting.

Question 44

What is zero-shot prompting?

Accepted Answer

Zero-shot prompting means asking an LLM to perform a task without providing any examples. The model relies entirely on its training to understand and complete the request. Modern LLMs like GPT-4 and Claude are capable zero-shot learners for many tasks, requiring only clear natural-language instructions.

Question 45

What is role prompting?

Accepted Answer

Role prompting assigns a persona or role to the LLM at the start of a conversation, for example "You are an expert tax accountant." This primes the model to draw on relevant knowledge and adopt an appropriate tone for the assigned role. The model's actual capabilities do not change, but role prompting shapes framing, vocabulary, and emphasis.

Question 46

How do you reduce hallucinations through prompting?

Accepted Answer

To reduce hallucinations, provide source documents in the prompt context (RAG), ask the model to cite specific sources, instruct it to say "I don't know" when uncertain, and break complex questions into smaller verifiable steps. Setting temperature to 0 for factual tasks and requesting that the model explain its reasoning also help reduce confident errors.

Question 47

What is prompt chaining?

Accepted Answer

Prompt chaining is the practice of breaking a complex task into a sequence of simpler prompts where the output of one becomes the input to the next. This approach is more reliable than asking a single prompt to accomplish an overly complex task. Prompt chaining is the foundation of many AI agent and workflow automation systems.

Question 48

What is a system prompt?

Accepted Answer

A system prompt is an initial instruction provided to an LLM before the user's input that sets the model's behavior, persona, and constraints for the entire conversation. API users can define system prompts to customize how the model responds to all subsequent messages. Many enterprise AI applications use system prompts to scope the model's behavior to a specific use case.

Question 49

How do you get consistent outputs from an LLM?

Accepted Answer

Consistency is improved by setting temperature to 0, using precise and detailed prompts, providing examples of desired output format, and requesting structured outputs like JSON where possible. For production systems, maintaining version-controlled prompts, testing across representative inputs, and monitoring output quality over time helps sustain consistency as models evolve.

Question 50

What is structured output in LLMs?

Accepted Answer

Structured output refers to the ability to instruct an LLM to respond in a specific machine-readable format, such as JSON or XML, rather than free-form prose. Most major LLM APIs support structured output or JSON mode, which constrains the model to produce valid structured responses. Structured outputs are essential for building reliable AI pipelines and integrations.

Question 51

How can AI help with customer service?

Accepted Answer

AI can automate responses to common customer questions, route support tickets, summarize customer history for agents, and draft responses for human review. LLMs enable 24/7 availability and can handle high volumes of routine inquiries at low cost. The best implementations combine AI automation for routine tasks with human handoff for complex or sensitive issues.

Question 52

How can businesses use AI for content creation?

Accepted Answer

Businesses use AI to draft blog posts, social media content, email campaigns, product descriptions, and internal documents. AI dramatically reduces the time from brief to first draft and allows rapid iteration. Human review and editing remain important for accuracy, brand voice, and quality control.

Question 53

What is AI workflow automation?

Accepted Answer

AI workflow automation uses AI models to execute multi-step business processes that previously required human intervention. Examples include automated data extraction from documents, email triage and response drafting, lead qualification, and report generation. When combined with integration platforms, AI can act as an intelligent layer across a company's existing software stack.

Question 54

How can AI improve sales processes?

Accepted Answer

AI can help sales teams by researching prospects, drafting personalized outreach, summarizing call transcripts, scoring leads, updating CRM records, and identifying upsell opportunities from customer data. AI-assisted sales tools let reps focus more time on conversations while reducing administrative overhead.

Question 55

Can AI help with financial analysis?

Accepted Answer

AI can assist with financial analysis by extracting data from documents, summarizing financial reports, building models from natural language instructions, and identifying anomalies in transaction data. AI outputs in financial contexts require careful human review, as errors can have significant consequences. AI is most valuable as an accelerant for analyst workflows, not a replacement for professional judgment.

Question 56

How can small businesses use AI?

Accepted Answer

Small businesses can use AI to write marketing copy, respond to customer inquiries, manage social media, draft contracts and standard documents, and automate repetitive back-office tasks. Consumer tools like ChatGPT and Claude require no technical expertise and offer significant time savings at low cost. The highest-value applications are usually those that eliminate repetitive, predictable work.

Question 57

What is AI-powered search?

Accepted Answer

AI-powered search uses LLMs to understand the intent behind a query and return synthesized answers rather than just a list of links. Systems like Perplexity AI, Google AI Overviews, and Bing Copilot combine web search with LLM synthesis to provide direct answers with citations. AI search is changing how users find information and has significant implications for content and SEO strategy.

Question 58

How can AI help with HR and recruiting?

Accepted Answer

AI can help HR teams by screening and ranking resumes, drafting job descriptions, generating interview questions, summarizing candidate profiles, and answering employee policy questions. AI tools must be used carefully in hiring to avoid amplifying bias; human review of AI-assisted screening decisions is essential and in many jurisdictions legally required.

Question 59

What is AI-assisted decision making?

Accepted Answer

AI-assisted decision making uses AI to analyze data, surface patterns, generate options, and provide recommendations that inform human decisions. The human remains responsible for the final decision. Examples include credit risk scoring, medical diagnosis support, supply chain optimization, and fraud detection. The core principle is human oversight of consequential decisions.

Question 60

How is AI used in marketing?

Accepted Answer

AI is used in marketing for content generation, audience segmentation, personalization, A/B test analysis, ad copy optimization, and customer journey mapping. AI tools allow marketing teams to produce more personalized content at scale and make faster, data-driven decisions. The highest-value applications connect AI to proprietary customer and performance data.

Question 61

What is AI safety?

Accepted Answer

AI safety is the research field focused on ensuring that AI systems behave as intended and do not cause unintended harm. It encompasses near-term concerns (reducing harmful outputs, preventing misuse) and longer-term concerns (ensuring advanced AI remains aligned with human values as capabilities increase). Major AI labs including Anthropic, OpenAI, Google DeepMind, and Meta have dedicated AI safety research teams.

Question 62

What is responsible AI?

Accepted Answer

Responsible AI refers to the principles and practices for developing and deploying AI in ways that are fair, transparent, accountable, safe, and beneficial to society. Responsible AI frameworks typically address bias mitigation, explainability, human oversight, privacy, and security. Organizations including Microsoft, Google, IBM, and the OECD have published responsible AI principles.

Question 63

What are AI guardrails?

Accepted Answer

AI guardrails are technical and policy mechanisms that limit what an AI system will say or do, preventing harmful, unethical, or inappropriate outputs. Guardrails include safety fine-tuning (RLHF), content filtering, output classifiers, and system prompt restrictions. All major AI providers implement guardrails, though their scope and strength vary by provider and use case.

Question 64

What is the EU AI Act?

Accepted Answer

The EU AI Act is a comprehensive regulatory framework for artificial intelligence in the European Union that became law in 2024. It classifies AI systems by risk level (unacceptable, high, limited, and minimal risk) and imposes requirements proportional to risk, including transparency, human oversight, and documentation requirements. It is the first comprehensive AI law enacted by a major jurisdiction.

100 AI and LLM Questions Answered

1AI Fundamentals

2How LLMs Work

3Capabilities

4Limitations and Risks

5Prompting Best Practices

6Business Use Cases

7Safety and Ethics

8Leading AI Models and Providers

9Implementation and Costs

10The Future of AI