Chapter 2: Foundations of Responsible AI#

Why it matters#

Evaluation (Chapter 1) tells you how a model behaves. Responsible AI tells you how it should behave, and how to build an organization that delivers that. This chapter defines responsible AI, introduces its dimensions, shows that responsibility spans the whole design-build-operate lifecycle, and gives a concrete method for assessing the risk of an AI application.

What is responsible AI?#

Definition

Responsible AI is the practice of designing, developing, and using AI technology with the goal of maximizing benefits and minimizing risks and unintended harms.

In practice, responsible AI is defined through a core set of dimensions that an organization assesses and updates over time as the technology evolves. Three points are easy to miss:

  • Responsible AI is an organizational structure, a set of principles, and a practice, not a subdomain of AI you can delegate to one team.

  • The dimensions depend on the organization and its responsible-AI maturity.

  • New dimensions can emerge as new scientific evidence appears.

Why so much focus on it? Because building and maintaining customer trust is a top priority, and generative AI introduces new failure modes: hallucinations and inaccuracies, instructions that leak private information, biased or hateful text, and unlicensed or unlawful content. A regulatory landscape is actively developing to understand and mitigate these risks.

The responsible AI dimensions#

AWS frames responsible AI around eight dimensions. Chapter 3 explores each in depth; here is the map:

Dimension

In one line

Controllability

Mechanisms to monitor and steer AI system behavior.

Privacy & Security

Appropriately obtaining, using, and protecting data and models.

Safety

Preventing harmful system output and misuse.

Fairness

Considering impacts on different groups of stakeholders.

Veracity & Robustness

Achieving correct outputs, even with unexpected or adversarial inputs.

Explainability

Understanding and evaluating outputs generated by an AI system.

Transparency

Enabling stakeholders to make informed choices about engaging with the system.

Governance

Incorporating best practices across the AI supply chain, providers and deployers alike.

Responsibility spans the whole lifecycle#

No matter which part of the lifecycle you work on, design, build, or operate, you should always consider responsible AI.

Design. Discuss the use case with diverse stakeholders, evaluate whether AI actually adds value, and conduct a thorough risk assessment of the proposed use case.

Build. Verify training data is safe, relevant, and representative; consider legal requirements (licensing, privacy, consent); use metrics with confidence intervals to evaluate outcomes; and apply safeguards, value alignment, and (where appropriate) model disgorgement to mitigate risk.

Operate. Give end users a way to inquire about outputs for high-risk use cases and be transparent about limitations; check for model drift as the world changes; and ensure the model is used as intended (a model trained on US data should be used for the US context).

Assessing the risk of an AI application#

Risk assessment is a structured, three-step process:

  1. Define the use case and the relevant stakeholders.

  2. Identify harmful events and evaluate both inherent and residual risk.

  3. Summarize risk levels across all dimensions and conclude findings.

Quantifying risk: likelihood and severity#

Following the NIST AI Risk Management Framework, risk is quantified along two axes:

  • Likelihood: how probable an event is.

  • Severity: the magnitude of its consequences.

Each is scored on a scale. Likelihood runs from Highly unlikely (less than once per decade) through Possible to Frequent (more than 100 times a year). Severity, for a given dimension, runs from Very low to Extreme. For the veracity dimension, for instance, Very low severity is negligible hallucination, while Extreme is persuasive, dangerous output causing irreversible real-world harm.

Combining the two in a matrix yields an overall rating:

Table 6 Likelihood x Severity (illustrative)#

Likelihood \ Severity

Very low

Low

Moderate

Major

Extreme

Frequent

Low

Medium

High

Critical

Critical

Possible

Very Low

Low

Medium

High

Critical

Highly unlikely

Very Low

Very Low

Very Low

Low

High

Worked example: a medical triage assistant

For a symptom-checking assistant, a veracity failure (a confident but wrong suggestion) is Possible in likelihood and Major-to-Extreme in severity, landing it at High or Critical risk. That rating tells you to add human oversight, strong disclaimers, and tight guardrails before launch, the techniques of Chapter 4.

The NIST AI Risk Management Framework 1.0#

The likelihood-and-severity approach above comes from the NIST AI Risk Management Framework (AI RMF 1.0), a voluntary framework published by the U.S. National Institute of Standards and Technology in January 2023 to help organizations manage the risks of AI systems. It is worth knowing in its own right because it has become a common reference point for responsible-AI governance.

Characteristics of trustworthy AI#

The framework defines AI risk as a function of the likelihood of an event and the magnitude (severity) of its impact, and it organizes “trustworthiness” into seven characteristics. They map closely onto this module’s dimensions:

NIST trustworthiness characteristic

Related dimension in this book

Valid and reliable

Veracity and robustness (Chapter 3: Dimensions of Responsible AI)

Safe

Safety

Secure and resilient

Privacy and security; robustness

Accountable and transparent

Transparency; governance

Explainable and interpretable

Explainability

Privacy-enhanced

Privacy and security

Fair, with harmful bias managed

Fairness

The four core functions#

The AI RMF organizes practice into four functions, which align with the design-build-operate lifecycle above:

Function

What it covers

Govern

A culture of risk management: policies, accountability, roles, and oversight that cut across the other three functions.

Map

Establish context and identify risks: the use case, stakeholders, and where harms could arise (the “define and identify” steps above).

Measure

Analyze, assess, and track risks using quantitative and qualitative methods (the evaluation of Chapter 1: Evaluating LLMs and the likelihood x severity rating).

Manage

Prioritize and act on risks: allocate resources, apply mitigations (guardrails, oversight), and monitor over time.

Why it matters here

Using the AI RMF as scaffolding means the responsible-AI work in this module, evaluation, the dimensions, and the security-and-safety techniques, lines up with a recognized national framework, which is exactly what auditors, funders, and regulators increasingly expect. Consult the official framework (https://www.nist.gov/itl/ai-risk-management-framework) for the authoritative text; details and companion profiles are periodically updated.

In the news#

Responsible AI has moved from voluntary principle to emerging regulation, with frameworks such as the EU AI Act and the NIST AI Risk Management Framework shaping how organizations classify and govern AI by risk level. The dimensions in this chapter map closely onto these regimes, which is why treating responsible AI as a governance practice, rather than a feature, increasingly aligns with legal obligation, not just good intentions.

Key takeaways#

  • Responsible AI maximizes benefit and minimizes harm, defined through evolving dimensions and practiced organization-wide.

  • It spans the design-build-operate lifecycle; everyone is responsible.

  • Risk assessment is define -> identify and evaluate -> summarize, with risk quantified as likelihood x severity per the NIST framework.

  • Governance ties the practice together across teams.

Next, we examine each responsible-AI dimension in detail.