Chapter 2: Foundations of Responsible AI

Chapter 2: Foundations of Responsible AI#

Why it matters#

Evaluation (Chapter 1) tells you how a model behaves. Responsible AI tells you how it should behave, and how to build an organization that delivers that. This chapter defines responsible AI, introduces its dimensions, shows that responsibility spans the whole design-build-operate lifecycle, and gives a concrete method for assessing the risk of an AI application.

What is responsible AI?#

Definition

Responsible AI is the practice of designing, developing, and using AI technology with the goal of maximizing benefits and minimizing risks and unintended harms.

In practice, responsible AI is defined through a core set of dimensions that an organization assesses and updates over time as the technology evolves. Three points are easy to miss:

Responsible AI is an organizational structure, a set of principles, and a practice, not a subdomain of AI you can delegate to one team.
The dimensions depend on the organization and its responsible-AI maturity.
New dimensions can emerge as new scientific evidence appears.

Why so much focus on it? Because building and maintaining customer trust is a top priority, and generative AI introduces new failure modes: hallucinations and inaccuracies, instructions that leak private information, biased or hateful text, and unlicensed or unlawful content. A regulatory landscape is actively developing to understand and mitigate these risks.

The responsible AI dimensions#

AWS frames responsible AI around eight dimensions. Chapter 3 explores each in depth; here is the map:

Dimension	In one line
Controllability	Mechanisms to monitor and steer AI system behavior.
Privacy & Security	Appropriately obtaining, using, and protecting data and models.
Safety	Preventing harmful system output and misuse.
Fairness	Considering impacts on different groups of stakeholders.
Veracity & Robustness	Achieving correct outputs, even with unexpected or adversarial inputs.
Explainability	Understanding and evaluating outputs generated by an AI system.
Transparency	Enabling stakeholders to make informed choices about engaging with the system.
Governance	Incorporating best practices across the AI supply chain, providers and deployers alike.

Responsibility spans the whole lifecycle#

No matter which part of the lifecycle you work on, design, build, or operate, you should always consider responsible AI.

Design. Discuss the use case with diverse stakeholders, evaluate whether AI actually adds value, and conduct a thorough risk assessment of the proposed use case.

Build. Verify training data is safe, relevant, and representative; consider legal requirements (licensing, privacy, consent); use metrics with confidence intervals to evaluate outcomes; and apply safeguards, value alignment, and (where appropriate) model disgorgement to mitigate risk.

Operate. Give end users a way to inquire about outputs for high-risk use cases and be transparent about limitations; check for model drift as the world changes; and ensure the model is used as intended (a model trained on US data should be used for the US context).

Assessing the risk of an AI application#

Risk assessment is a structured, three-step process:

Define the use case and the relevant stakeholders.
Identify harmful events and evaluate both inherent and residual risk.
Summarize risk levels across all dimensions and conclude findings.

Quantifying risk: likelihood and severity#

Following the NIST AI Risk Management Framework, risk is quantified along two axes:

Likelihood: how probable an event is.
Severity: the magnitude of its consequences.

Each is scored on a scale. Likelihood runs from Highly unlikely (less than once per decade) through Possible to Frequent (more than 100 times a year). Severity, for a given dimension, runs from Very low to Extreme. For the veracity dimension, for instance, Very low severity is negligible hallucination, while Extreme is persuasive, dangerous output causing irreversible real-world harm.

Combining the two in a matrix yields an overall rating:

Table 6 Likelihood x Severity (illustrative)#
Likelihood \ Severity	Very low	Low	Moderate	Major	Extreme
Frequent	Low	Medium	High	Critical	Critical
Possible	Very Low	Low	Medium	High	Critical
Highly unlikely	Very Low	Very Low	Very Low	Low	High

Worked example: a medical triage assistant

For a symptom-checking assistant, a veracity failure (a confident but wrong suggestion) is Possible in likelihood and Major-to-Extreme in severity, landing it at High or Critical risk. That rating tells you to add human oversight, strong disclaimers, and tight guardrails before launch, the techniques of Chapter 4.

The NIST AI Risk Management Framework 1.0#

The likelihood-and-severity approach above comes from the NIST AI Risk Management Framework (AI RMF 1.0), a voluntary framework published by the U.S. National Institute of Standards and Technology in January 2023 to help organizations manage the risks of AI systems. It is worth knowing in its own right because it has become a common reference point for responsible-AI governance.

Characteristics of trustworthy AI#

The framework defines AI risk as a function of the likelihood of an event and the magnitude (severity) of its impact, and it organizes “trustworthiness” into seven characteristics. They map closely onto this module’s dimensions:

NIST trustworthiness characteristic	Related dimension in this book
Valid and reliable	Veracity and robustness (Chapter 3: Dimensions of Responsible AI)
Safe	Safety
Secure and resilient	Privacy and security; robustness
Accountable and transparent	Transparency; governance
Explainable and interpretable	Explainability
Privacy-enhanced	Privacy and security
Fair, with harmful bias managed	Fairness

The four core functions#

The AI RMF organizes practice into four functions, which align with the design-build-operate lifecycle above:

Function	What it covers
Govern	A culture of risk management: policies, accountability, roles, and oversight that cut across the other three functions.
Map	Establish context and identify risks: the use case, stakeholders, and where harms could arise (the “define and identify” steps above).
Measure	Analyze, assess, and track risks using quantitative and qualitative methods (the evaluation of Chapter 1: Evaluating LLMs and the likelihood x severity rating).
Manage	Prioritize and act on risks: allocate resources, apply mitigations (guardrails, oversight), and monitor over time.

Why it matters here

Using the AI RMF as scaffolding means the responsible-AI work in this module, evaluation, the dimensions, and the security-and-safety techniques, lines up with a recognized national framework, which is exactly what auditors, funders, and regulators increasingly expect. Consult the official framework (https://www.nist.gov/itl/ai-risk-management-framework) for the authoritative text; details and companion profiles are periodically updated.

In the news#

Responsible AI has moved from voluntary principle to emerging regulation, with frameworks such as the EU AI Act and the NIST AI Risk Management Framework shaping how organizations classify and govern AI by risk level. The dimensions in this chapter map closely onto these regimes, which is why treating responsible AI as a governance practice, rather than a feature, increasingly aligns with legal obligation, not just good intentions.

Key takeaways#

Responsible AI maximizes benefit and minimizes harm, defined through evolving dimensions and practiced organization-wide.
It spans the design-build-operate lifecycle; everyone is responsible.
Risk assessment is define -> identify and evaluate -> summarize, with risk quantified as likelihood x severity per the NIST framework.
Governance ties the practice together across teams.

Next, we examine each responsible-AI dimension in detail.