Insights
/
Cold Acquisition That Works in 2026
Cold calling in 2026 is no longer about volume but real connection. Learn how to combine email and LinkedIn to personalize outreach, build trust, and start meaningful B2B conversations that lead to results.
/
AUTHOR

Ralf Klein

It is January, a tenant has no heat, and the first thing that reads the report is an AI agent. The message arrives on WhatsApp at 9pm: "radiator dead, freezing in here, baby in the house." The agent classifies it as a routine maintenance request, drops it in the standard queue, and books a technician for the next open slot three days out. No heat in winter is not a three day problem. It is a habitability emergency on a 24 hour clock, and the agent just missed it. When the tenant's lawyer later asks who decided that ticket could wait, the answer the model vendor offers is the one printed in its terms of service. Not us.
Property operators are handing these tickets to AI faster than they are working out who carries the risk when one goes wrong. Adoption among property managers has roughly tripled in a year, from about one in five companies in 2024 to three in five in 2025, according to Haven's 2026 property management AI statistics. Yet the same data shows only 8 percent have fully automated even a single workflow. The gap between "we use AI" and "the agent closes the loop without a human" is exactly where the dangerous tickets live, and most teams have not decided who is accountable inside that gap.
Why AI Agent Liability Shifted to the Deployer in 2026
The assumption baked into most buying decisions is that if the agent makes the call, the company that built the agent owns the consequences. Three regulatory moves in 2026 say the opposite. The organization that deploys the agent is the one on the hook.
California's Assembly Bill 316, in effect since January 1 2026, removes the defense everyone assumed they could fall back on. A defendant can no longer argue that the AI acted autonomously to escape liability for the harm it caused. The law reaches the full supply chain, the builder, the integrator, and most importantly the business that deployed and used the system. "The AI did it" is no longer a sentence a court will accept.
In Europe, the binding date for high risk obligations under the EU AI Act is August 2 2026. Article 26 puts specific duties on deployers, not just providers: assign human oversight to people with the competence and authority to override the system, monitor operation, and keep logs for at least six months. Non compliance on high risk systems carries penalties up to 15 million euros or 3 percent of global turnover. The override has to be real and accessible, not a setting buried where no operator can reach it in time.
Singapore went further and named the principle directly. Its Model AI Governance Framework for Agentic AI, launched in January 2026 and updated in May, is built around deployer accountability: organizations and the humans who oversee them remain accountable for the decisions and actions of their agents. The common thread across all three is a standard the legal community now calls reasonable oversight. You are liable unless you can prove you had monitoring, auditing, and a human decision path in place.
For a Dutch or European operator, the EU date is the one that matters most, but the direction of travel is unmistakable across jurisdictions. Regulators are converging on the same answer: the entity that put the agent into production owns the outcome. Waiting for perfect legal clarity is itself a choice, and it is the one that leaves you least prepared when the first contested ticket arrives.
Habitability Tickets Are the Highest Exposure Flow in Your Queue
Not every ticket carries the same legal weight, and that is the point most automation roadmaps miss. A jammed dishwasher and a gas smell are not the same risk class. Habitability and safety tickets sit at the top of the exposure ladder because the law already attaches a clock and a duty to them.
The implied warranty of habitability requires a rental to stay livable for the whole tenancy, with working heat, water, and sanitation, as the Legal Aid Society sets out for tenants. Emergencies like no heat, a gas leak, or a major water leak are widely held to a 24 hour response expectation, with less urgent repairs running from a few days to 30 days, per common landlord repair timelines. When an agent misreads an emergency as routine, it does not just create a slow ticket. It puts the operator in breach of a legal duty with a documented timestamp showing exactly when the clock started and who, or what, let it run out.
This is why an agent that answers is not the same as an agent that is safe to deploy unsupervised. Answering the tenant is the easy 10 percent. The other 90 percent, classifying urgency correctly, creating the work order, dispatching against a real 24 hour window, and escalating when the stakes are high, is where both the operational value and the legal exposure actually live. We have argued before that the ROI in property management lives in the workflow, not the feature. The liability lives there too.
Reasonable Oversight Is the Standard You Will Be Judged Against
Reasonable oversight sounds like a brake on automation. Read closely, it is a design spec. The emerging agentic liability standard rewards the deployer who can produce a defensible record: red team results, audit logs, escalation history, and evidence that a human held authority over the decisions that mattered. The organizations that lose are the ones that cannot show any of it.
The mistake is treating this as all or nothing, either a human checks everything or the agent runs wild. The frameworks point to a graduated model instead. Calibrate the level of human involvement to the risk level of the action. Routine, low exposure tickets can run autonomously with monitoring and exception handling. High exposure tickets, the habitability and safety calls, route through mandatory human authorization before the agent acts. This is not a compliance tax bolted on after launch. It is the architecture that lets you automate aggressively everywhere it is safe to do so, because you have drawn a clear line around where it is not.
Concretely, reasonable oversight on a maintenance flow means four things are always on. Every classification and action is logged with a timestamp and a reason. Urgency scoring is auditable, so you can show why a ticket was ranked the way it was. A human in the loop sits on the high exposure branch, not on every ticket. And the audit trail is complete enough that, if a regulator or a tenant's lawyer asks, you can reconstruct the decision in full. Operational AI agents that take real actions in your domain system, rather than just replying, are exactly the place this discipline has to be designed in from the start.
Classify Tickets by Legal Exposure Before You Automate
The practical move is a triage map drawn by legal exposure, not by ticket volume. Before automating anything, sort your ticket types into exposure tiers. Tier one is safety and habitability: no heat, gas, water ingress, electrical hazard, anything with a statutory clock or a path to harm. These get mandatory human authorization and the fullest audit trail. Tier two is essential services with a softer deadline, hot water, a failed appliance, a plumbing fault, where the agent can act but a human reviews exceptions. Tier three is cosmetic and low stakes, where the agent runs end to end.
Done well, this is not slower automation, it is defensible automation. In our own property management work, the highest volume of routine intake and dispatch runs through the agent, while the genuinely risky calls surface to a person with the full context attached. That is the pattern behind our property management ticket automation: deep integration into the domain system so the agent can create the work order and push status, with human in the loop reserved for the tickets where being wrong is expensive. The exposure map is what makes the difference between an agent you can defend and one you cannot.
If you are about to point a maintenance agent at your tenant inbox, the first artifact is not a prompt. It is that exposure map, plus the audit trail and escalation path that prove reasonable oversight. Build those before the agent touches a live ticket, because in 2026 they are the difference between a contained incident and an indefensible one.
The Practical Takeaway
Read your model vendor's terms of service once, carefully. It disclaims liability for the output. That clause does not move the risk to the vendor. It leaves the risk with you, and now California, Brussels, and Singapore agree. The agent can read the ticket, score it, and resolve most of the queue without you. What it cannot do is carry the accountability for the one call it gets wrong on a freezing night. That stays with the deployer. Design the oversight before the agent, not after the incident, and the same autonomy that creates the exposure becomes the thing you can actually stand behind.
/
BLOG
Other insights

Insights
/
Jun 25, 2026
Bloxs and AI: Why Tenant Communication Is the Biggest Untapped Win in Property Management

Insights
/
Apr 13, 2026
Anthropic Built an AI That Found 3,000 Zero-Day Vulnerabilities. Then They Refused to Release It.

Insights
/
Apr 10, 2026