Liability and chargeback mechanics in agent-mediated transactions

Chargeback and liability regimes built on human intentionality at transaction origination have no clean answer when the originator is an AI agent operating under delegated authority.

Core Tensions

Regulatory attribution of intent. Reg E, Reg Z, and PSD3's conditional refund right each predicate liability allocation on a human who either authorized or failed to safeguard credentials; an AI agent acting within its delegated scope but making a commercially erroneous purchase satisfies neither the unauthorized-use definition nor the authorized-use safe harbor cleanly.
Chain-of-liability ambiguity. No scheme rule or statute currently designates which entity in the agent stack — consumer-principal, agent operator, LLM model provider, or orchestration platform — holds the statutory account-holder role. Burge [3] documents this pattern in emerging payment technologies: the absence of a public-law designation leaves loss routing to bespoke contractual indemnities that vary by deal and jurisdiction, a condition that persists until regulators or scheme rule-makers intervene with a mandatory floor.
Fraud controls misaligned with execution errors. Velocity checks, SCA exemption criteria, and ACP attestation standards were designed to detect bad-actor intrusion, not to evaluate whether an agent's in-scope purchase decision was commercially reasonable — a category those controls have no mechanism to assess.

How Liability Routes Fracture

Under Regulation E, a consumer's liability for an unauthorized electronic fund transfer turns on whether the consumer authorized the transfer and, if not, how quickly they reported the breach. The statute defines "unauthorized transfer" at 12 CFR § 1005.2(m) as a transfer initiated by a person other than the consumer without actual authority. An AI agent, however, acts with actual authority — the consumer delegated the buying task — yet the agent's commercially erroneous execution (wrong item, wrong vendor, wrong amount) is not what the consumer intended. That gap sits between "authorized" and "unauthorized" without a statutory home. The issuer cannot classify the dispute as fraud; the consumer cannot invoke the unauthorized-transfer liability ceiling; and the merchant, who delivered exactly what the agent ordered, has a facially valid transaction.

Regulation Z's billing-error procedures present a structurally similar fracture. Under 12 CFR § 1026.13, a cardholder may dispute a charge for goods or services not delivered as agreed, but the provision assumes the cardholder herself placed the order and can articulate the discrepancy against her own intent. When an agent places the order, the cardholder's intent exists only at the level of a delegated mandate — "buy the cheapest compatible ink cartridge" — not at the level of the specific transaction. Matching the mandate to the executed transaction requires interpreting agent behavior, not reading a receipt. Neither issuers nor card scheme dispute teams are currently equipped or contractually required to perform that analysis.

Scheme operating rules layer on a further problem. Chargeback reason codes map to defined triggering conditions: item not received, item significantly not as described, fraudulent transaction. A mistaken agent purchase may not satisfy any of these codes precisely, forcing an issuer to shoehorn the claim into the closest available code — a choice that distorts the evidentiary record and may itself constitute a misuse of the chargeback mechanism, exposing the issuer to acquirer rebuttal.

PSD3's proposed refund right — drawn from Article 59 of the draft Payment Services Regulation — applies specifically to payment transactions reported by the user as unauthorized; for those transactions, liability defaults to the PSP unless it can demonstrate user fraud or gross negligence. That track is distinct from consented-but-disputed transactions, where a shared-liability model applies. When the "user" is an AI agent, the unauthorized-versus-consented boundary dissolves: the agent's erroneous purchase was technically consented to at the mandate level but not at the transaction level. The PSP therefore cannot readily invoke the unauthorized-transaction refund right, yet the shared-liability track provides no clear mechanism for allocating loss across the agent stack either. Gross negligence, on whichever track it is invoked, becomes a question about operator configuration and model behavior — categories the regulation does not address and for which no defined behavioral benchmark exists [5].

SCA exemptions compound the fracture rather than resolve it. PSD2 and PSD3 built exemption criteria around risk-scoring the human session in which a payment is initiated: device fingerprint, behavioral biometrics, transaction history anchored to a human's pattern of life. An agent-initiated payment has none of those attributes. It may pass authentication at the credential layer — the agent holds a valid token — but the session-level risk signals that justify low-friction exemptions are absent or meaningless in a machine execution context. A PSP that applies a standard transaction-risk analysis exemption to an agent-initiated payment is extending a human-calibrated risk tolerance to a non-human originator, without a regulatory framework that either authorizes or constrains that extension. If the exempted transaction is later disputed, the PSP's SCA determination becomes part of the liability record without any settled standard against which to evaluate it.

UCC Articles 3 and 4, which govern negotiable instruments and bank deposits, assign liability through the properly-payable standard and forgery rules — both of which presuppose a human drawer. They offer no ready mechanism for a payment instruction that was technically authentic but commercially mistaken at the agent layer. UCC Article 4A, which governs electronic funds transfers, provides its own authenticity and error-correction framework: a payment order issued in compliance with a security procedure agreed upon by the parties is treated as authorized even if the customer did not actually transmit it, provided the bank acted in good faith. That security-procedure safe harbor addresses intrusion-based fraud adequately, but it does not address the agent-specific problem — an instruction that was authenticated within scope but commercially erroneous. Article 4A's error provisions (§ 4A-205) focus on duplicate orders, wrong beneficiaries, and transmission errors, not on the question of whether the originating system's commercial judgment was within its delegated mandate. The statute has no mechanism to reach that question.

Foundational Reading

Regulation E (Electronic Fund Transfer Act), 12 CFR Part 1005 — The primary U.S. consumer liability and dispute framework for electronic payments; read sections 1005.2(m) (unauthorized transfer definition), 1005.6 (liability), and 1005.11 (error resolution) with attention to the scope of "actual authority."
Regulation Z (Truth in Lending Act), 12 CFR Part 1026 — Sections 1026.12–1026.13 govern credit card unauthorized use and billing-error resolution procedures.
PSD3 / EU Payment Services Regulation (draft text, 2023–2024) — Article 59 on the PSP refund right for unauthorized transactions and the gross-negligence carve-out; compare the unauthorized-transaction track against the consented-but-disputed track, and review the SCA exemption framework for session-level authentication assumptions.
Visa and Mastercard Core Operating Regulations and Dispute Resolution Rules — Specifically the chargeback reason code taxonomies and the evidentiary standards for representment.
UCC Articles 3, 4, and 4A — Articles 3 and 4 for the foundational commercial-paper liability structure; Article 4A for the electronic funds transfer security-procedure safe harbor and error-correction provisions; read alongside Bepko's exposition for historical grounding [1].

The Operational Cascade

Issuer exposure is the most immediate operational consequence. When a dispute arrives for an agent-initiated transaction that does not satisfy a scheme chargeback reason code cleanly, the issuer must choose between absorbing the loss provisionally under error-resolution timelines, forcing the dispute into a mismatched reason code with litigation risk, or denying the claim and exposing itself to regulatory action if the regulator later classifies the event as an unauthorized transfer. All three paths carry balance-sheet and compliance costs that issuers cannot currently model because no jurisdiction has published a classification standard for agent-execution errors.

Processor and acquirer liability uncertainty propagates downstream from that issuer exposure. Acquirers who have contracted with merchants on the assumption that scheme dispute rules define the outer boundary of chargeback exposure will find that assumption untested when agent-initiated disputes arrive in volume. Merchants, for their part, face representment difficulty: to rebut a chargeback, the merchant must demonstrate that the transaction was authorized by the account holder, but the account holder's authorization was filtered through an agent whose decision logic is not in the merchant's possession and may not be auditable under current scheme evidence rules. The merchant delivered what was ordered; proving that the order was within the agent's valid scope requires access to the operator's audit trail — data the merchant has no right to compel.

At scale, these friction points stress chargeback monitoring thresholds. Card schemes enforce dispute-ratio limits on merchants and, in some programs, on acquirers. As AI agent transaction volumes grow, even a modest error rate at the agent layer — errors that do not map cleanly to fraud codes — could elevate dispute ratios into program-violation territory without any underlying merchant misconduct, triggering remediation requirements that were designed for fraud-prone merchants, not for a structural gap in dispute taxonomy [3].

Agent operators face their own exposure in this environment. Those with sophisticated legal counsel embed indemnity clauses that transfer risk to consumers or merchants; those without equivalent resources bear it directly on their own balance sheets. The asymmetry does not resolve toward a stable equilibrium: as agent transaction volumes increase, the parties carrying uncontracted residual risk will seek either regulatory intervention or exit from the agent-commerce stack, neither of which produces the operational predictability that issuer credit models require.

References

Uniform Commercial Code — Gerald L. Bepko (1984) — https://doi.org/https://doi.org/10.18060/2578
Apple Pay, Bitcoin, and Consumers: The ABCs of Future Public Payments Law — Mark Edwin Burge (2015) — https://openalex.org/W2193410071
Credit or Debit? Unauthorized Use and Consumer Liability Under Federal Consumer Protection Legislation — Daniel M. Mroz (1999) — https://openalex.org/W3014777452
Luottamuksesta hyväksikäyttöön: Maksupetosten oikeudelliset ja yhteiskunnalliset vaikutukset ja tie kohti vahvempaa suojaa — Taavitsainen, Nanne (2025) — https://openalex.org/W7115309322

The problem is not a missing chargeback reason code. It is the absence of foundational determinations on which any dispute mechanism depends: which party in the agent stack holds statutory account-holder status and therefore bears default liability; what evidentiary artifact — an audit log, a cryptographic authorization token, a scheme-recognized attestation — constitutes proof that an agent acted within its delegated scope; how regulators pre-classify the agent commercial error as a legal category distinct from both unauthorized use and ordinary contract breach; what operator configuration failures or model behaviors meet the gross-negligence threshold that PSD3 reserves as a PSP carve-out; and at what transaction volume structural agent-execution error rates begin pushing merchant and acquirer dispute ratios into scheme program-violation territory.

Without those determinations, loss currently distributes by contract negotiation strength rather than by regulatory design. Agent operators with sophisticated counsel embed indemnity clauses that push risk to consumers or merchants; those without equivalent resources carry it directly, a condition that does not stabilize as volumes grow. Card schemes face pressure to create reason codes that do not fit their fraud-architecture logic. Issuers hold provisional credit exposure against standards they cannot interpret. Regulators in the EU are moving first, with PSD3's PSP liability shift on unauthorized transactions creating a de facto pressure point, but the gross-negligence carve-out they rely on has no defined behavioral benchmark for machine conduct [4]. The jurisdictions that resolve these determinations — through regulatory guidance, scheme rule amendment, or new legislation — will define the operational conditions under which agent commerce can scale without generating cascading dispute-ratio violations and uncontained issuer credit exposure.

Context

The Assumption Layer

Every layer of the current dispute stack was constructed with a human decision-maker at origination. Regulation E's liability caps and error-resolution timelines were calibrated against the behavior of a consumer who either recognized an unauthorized charge on a statement or failed to safeguard a PIN. Regulation Z's billing-error mechanism assumes a cardholder who can compare what she ordered against what she received and articulate the difference to her issuer. PSD2 and its successor PSD3 built SCA exemptions around risk-scoring the human session in which a payment was initiated — device fingerprint, behavioral biometrics, transaction history anchored to a human's pattern of life. An agent-initiated transaction produces none of those session signals, which means the exemption criteria either fail to apply or apply on the basis of credential-layer authentication alone, stripped of the behavioral risk context that makes low-friction exemptions defensible.

Scheme operating rules followed the same architecture. Chargeback reason codes reference the "cardholder," the "account holder," and the "authorized user" — always a natural person whose intent at the moment of transaction is either verifiable through authentication data or presumed from behavioral context. The UCC's properly-payable standard and its forgery doctrine similarly center on whether a human drawer's signature or instruction was genuine [1].

This is not an oversight in the original design; it was a reasonable constraint given the transactional technology of each era. The constraint only becomes a structural liability gap when the originating decision-maker is an autonomous agent whose authorization derives from a delegated mandate rather than a moment of human intent at the point of transaction.

Counterpoint

The Case for Existing Coverage

The strongest argument against treating agent-mediated transaction disputes as a genuine regulatory gap is that the existing frameworks are more elastic than they appear. Authorization, under both Regulation E and card scheme rules, is ultimately a contractual question: the consumer agreed to terms of service with the agent operator, and those terms likely confer authority on the agent to transact within defined parameters. On that reading, a mistaken agent purchase is not a regulatory dispute at all — it is a contract claim against the operator, exactly as a mistaken instruction by a human personal shopper would be resolved under agency law rather than chargeback mechanics [2].

Similarly, equitable estoppel and ratification doctrines already exist to handle situations where a principal benefits from an agent's unauthorized act or fails to repudiate it promptly. If the consumer receives and retains the wrongly purchased goods, ratification may bar the chargeback entirely. Issuers can enforce this through their existing cardholder agreements without waiting for regulatory reclassification.

Proponents of this view also point to the historical pattern documented by Burge [2]: new payment technologies have repeatedly generated predictions of regulatory inadequacy that the market and existing contract law resolved without new mandatory public-law floors. The Electronic Funds Transfer Act of 1978 predated widespread debit card adoption; debit card unauthorized-use uncertainty was absorbed over the following decades primarily through scheme rule evolution and contractual indemnity structures, with regulatory amendments addressing other dimensions of the market rather than resolving the original liability ambiguity through direct statutory revision. Agent commerce may follow the same trajectory — at higher friction cost and more slowly, given the complexity of the multi-party stack, but without requiring the clean-break regulatory intervention the gap analysis implies.

Unresolved Vectors

Account-holder designation. Which party in the agent stack — consumer-principal, operator, LLM provider, or orchestration platform — holds the Reg E / Reg Z account-holder role, and must that designation be fixed contractually before the first transaction is initiated?
Evidentiary standard for agent authorization. Can a certified operator audit log, anchored to a cryptographic scope declaration, satisfy scheme evidentiary requirements as the functional analog of a signed receipt?
Error taxonomy. Does a commercially mistaken agent purchase constitute an authorization failure, or a contract breach against the operator — and does the answer change across Reg E, Reg Z, and PSD3's unauthorized-transaction track versus its consented-but-disputed track?
Gross-negligence benchmark. What operator configuration failures or model behaviors would meet PSD3's gross-negligence carve-out threshold, given that no behavioral standard for machine conduct currently exists [4]?
Chargeback threshold recalibration. At what agent transaction volume do structural agent-execution error rates begin elevating merchant or acquirer dispute ratios into scheme program-violation territory, and how should thresholds be adjusted in advance?