Agentic AI from principles to practice
A C-suite guide to capturing value without losing control.
In this third article, we look in more detail at agentic AI's ability to plan, act and adapt in live environments and the implications on its governance, particularly the need for this to move beyond static policy into operational control.
The shift from governing AI models in isolation to governing agentic system behaviour in an enterprise context has created a critical gap; governance centred on principles, policy and review now needs to be translated into controls across the full lifecycle of development and production, including important safeguards such as runtime controls.
Trust and confidence in agentic AI are created by the presence of visible, testable and enforceable controls in production. It means being clear about:
The twelve steps detailed below are intended to provide the operating disciplines needed to scale agentic AI with control.
If responsible AI sets the intent, Systems Development Lifecycle Controls (SDLC) embed that intent into design, build, testing, release and change.
AgentOps - the emerging cross-functional operational discipline of running agents safely once they are live – sustains it in production through visible behaviour, enforceable boundaries, traceable actions, controlled change and rehearsed intervention.
Taken together, the twelve disciplines below describe the wider control environment needed for that capability to work in practice.
The twelve steps are the practical operating disciplines that translate Responsible AI intent into day-to-day control.
They should be applied proportionately depending on the agent’s level of autonomy, authority, business criticality and potential impact if something goes wrong.
Organisations should also consider how humans interact with agents in practice, ensuring users understand their limitations to prevent over-reliance.
These steps are what turn governance from policy into control in live operation.
The example below shows how some of the twelve disciplines work in practice.
It illustrates the difference between giving an agent freedom to act and putting it to work within clear boundaries, ownership and control.
Imagine an AI procurement agent working inside an ERP system. Its role is to raise purchase orders for approved suppliers within agreed limits.
Without governance | With governance | |
|---|---|---|
1 | The agent’s role is not controlled well, so over time it starts doing more than intended and begins to influence supplier and approval decisions that were meant to stay with people. | The agent has a clearly defined role and operates within agreed boundaries, with supplier choices and approval decisions remaining where human judgement is needed. |
2 | The method by which the agent is allowed to act is not properly secured, so delegation across systems or tools may occur without strong authentication, authorisation or traceability, increasing the risk of misuse or unsafe action. | Agent delegation is security-protected, with strong authentication, explicit authorisation and clear traceability across tools, systems and other agents, so the organisation stays in control of how it is allowed to act. |
3 | It relies on hallucinated content for context, creating the risk of poor judgement. | Governance should assume that unsupported conclusions, fabricated intermediate steps or incorrect inferences can arise through hallucination and should require proportionate validation and human review before those outputs are relied on or acted upon. |
4 | There is no clear point at which a person must step in, so the agent can break a larger purchase into smaller ones and move ahead without appropriate review. | Clear thresholds determine when the agent can proceed, when activity must pause and when a manager needs to review or approve. |
5 | A key governance challenge is that organisations may not have the monitoring, alerting or observability needed to detect problems early, so they may not know anything is going wrong until harm has already occurred. | Monitoring, alerting and observability are built into live operation, so unusual behaviour, policy breaches or emerging signs of drift are detected early and the organisation can intervene before harm spreads. |
This simplified example shows how several of the twelve disciplines work together in practice:
We have developed an agentic AI governance maturity assessment to help senior leaders judge whether their organisation is ready to put higher-autonomy and impact agents to work with confidence. It translates the twelve operating disciplines into five practical areas for which to assess maturity.
The aim is not to be at the highest maturity level overall but whether your current level is strong enough for the agents you are already deploying or planning next. A low-risk assistant may be workable at a lower level of maturity. An agent acting across live systems or handling sensitive data will require a much higher one. The gap between the maturity you have and the maturity your use cases demand becomes the leadership agenda.
| Dimension | Level 1 Ad hoc | Level 2 Basic | Level 3 Managed | Level 4 Embedded | Level 5 Leading |
|---|---|---|---|---|---|
| Scope and guardrails | No one has clearly defined what the agent is there to do, what it must not do or where human approval is required. | Some boundaries exist for certain use cases, but they are incomplete, inconsistently applied and not tied clearly to risk. | A standard approach defines the agent’s role, permitted actions, boundaries, escalation points and where human approval is required. | Scope and autonomy limits are risk-tiered, formally owned and approved before go-live and when material changes are made. | Scope, autonomy boundaries and risk classification are actively maintained, monitored and updated as the agent, its environment or its role changes. |
| Identity, data and trust | The agent can access far more than it needs and there are no reliable controls over identity, permissions, source quality, memory or data leakage.
| Some access controls exist and key sources have been identified, but identity, delegation, grounding and retention are only partly understood. | The agent has a distinct identity, access is limited to what is needed, trusted sources are defined and rules exist for retrieval, memory, retention and cross-context data handling. | Identity, access, grounding, retention and information flows are reviewed regularly, with controls to reduce excessive privilege, stale data and cross-context leakage. | Identity, access and information quality are continuously monitored, with automated detection and review of misuse, leakage, drift, anomaly or grounding failure. |
| Testing, live safeguards and assurance | Testing is absent or informal and, once live, there is little to stop the agent acting outside expectations.
| Some testing and safeguards exist, but they are patchy, inconsistent and not clearly linked to risk. | Structured testing covers safety, reliability, misuse and business performance before launch, and key safeguards and approval thresholds are in place before go-live. | Controls operate alongside the agent in production, with ongoing testing, monitoring and triggers for changing risk, behaviour or operating conditions. | Continuous testing, internal challenge, drift detection and live assurance operate in production, and safeguards can be adjusted rapidly as risk or context changes. |
| Accountability and traceability | No one clearly owns the agent and there is no agreed process for escalation, intervention or remediation if something goes wrong. | A project or technical owner is named, but responsibilities are unclear and oversight largely falls away after deployment. | Clear business and operational owners are in place, with defined responsibilities for oversight, approvals, intervention, traceability and change. | Escalation routes, review forums, traceability, governance reporting and incident handling are established, used and understood in practice. | Leadership has a live view of agent ownership, risk and control status across the organisation, supported by governance reporting, a maintained risk register and strong traceability across the estate. |
| Change and lifecycle | Governance stops at launch and the agent is treated largely as set-and-forget. | Some reviews take place, but changes to prompts, models, tools or workflows often go unmanaged. | Structured review, change approval and periodic recertification are part of how the agent is run. | Reviews are triggered by time, incidents and material change, with clear re-approval points and defined retirement steps. | Change, recertification, retirement and evidence retention are tightly managed across the full lifecycle, including dependencies on third parties and connected systems. |
Agentic AI offers organisations a significant opportunity to improve speed, productivity and decision-making, but it also changes the nature of the governance challenge. Once systems can plan, act and adapt in live environments, trust can no longer rest on policy alone. It depends on whether an organisation can define clear boundaries, maintain visibility, intervene quickly and sustain control as these systems evolve.
The organisations that succeed with agentic AI will not be those that move fastest without constraint, but those that build confidence alongside capability. Responsible AI principles remain essential, but they must now be embedded through disciplined design, testing, runtime monitoring, change control and clear accountability in live operation. That is what turns ambition into something scalable, defensible and trusted.
We bring together multidisciplinary teams spanning AI, data governance, cyber security, technology risk and legal to help organisations move from AI ambition to controlled, real-world deployment through pragmatic, right-sized governance that builds confidence and trust.
This website uses cookies.
Some of these cookies are necessary, while others help us analyse our traffic, serve advertising and deliver customised experiences for you.
For more information on the cookies we use, please refer to our Privacy Policy.
This website cannot function properly without these cookies.
Analytical cookies help us enhance our website by collecting information on its usage.
We use marketing cookies to increase the relevancy of our advertising campaigns.