AI safety is about math, not ethics

The ethics distraction

Walk into any AI safety conference, and you'll hear passionate debates about consciousness, sentience, and moral frameworks. Should AI have rights? How do we ensure it shares our values? What happens when it becomes smarter than us?

These are interesting philosophical questions. They're also completely missing the point.

The real AI safety crisis isn't about ethics. It's about mathematics. And while everyone's worried about hypothetical superintelligence, current AI systems are failing for much more mundane reasons: they're mathematically broken.

The good news? This is a problem we can actually solve.

The real safety crisis

Here's what AI safety actually looks like in 2025: A medical diagnosis system that's right 95% of the time in testing but only 73% in production. A financial trading algorithm that works perfectly until market conditions shift slightly, then loses millions. An autonomous vehicle that misclassifies a stop sign as a speed limit sign because of unusual lighting.

These aren't edge cases. These are systemic failures caused by mathematical instability in the underlying neural networks.

Every floating-point operation introduces rounding errors. Every layer compounds those errors. Every decision is built on increasingly shaky mathematical foundations. And we're deploying these systems in critical applications while debating whether they might become conscious.

It's like worrying about whether your car has feelings while ignoring that the brakes don't work reliably.

Why ethics can't save us

The AI ethics crowd has good intentions. They want to ensure AI systems are fair, transparent, accountable. They create frameworks, guidelines, principles.

But you can't ethics your way out of a math problem.

A neural network that produces different results on identical inputs isn't an ethics issue. It's a mathematical instability issue. A system that hallucinates confident-sounding nonsense isn't a values alignment problem. It's a pattern-matching limitation problem.

Ethical frameworks assume the system works correctly in the first place. They're about choosing the right action. But when the system can't reliably execute any action, ethics is irrelevant.

This is why we keep seeing AI failures despite all the ethics committees and safety guidelines. We're treating symptoms while ignoring the disease.

The formal verification solution

Computer science has a field dedicated to proving systems work correctly: formal methods. Mathematical techniques that rigorously verify software behavior. Prove, not test. Guarantee, not estimate.

Formal verification has been used for decades in critical systems: aircraft control software, nuclear reactor management, spacecraft navigation. These systems need mathematical certainty, not statistical confidence.

Why isn't AI using formal verification? Because floating-point neural networks are mathematically intractable to verify.

You can't prove properties of a system when the system itself is built on approximate arithmetic. Floating-point introduces uncertainty at every step. That uncertainty propagates. Compounds. Becomes impossible to reason about formally.

This isn't a tooling problem. It's a fundamental incompatibility between the mathematics of neural networks and the mathematics of formal verification.

Binary networks: provably correct AI

Binary neural networks change the equation entirely.

Instead of floating-point approximations, binary networks use discrete operations. +1 or -1. True or false. Exact arithmetic with no rounding errors.

This makes them amenable to formal verification. You can actually prove properties about binary network behavior. Mathematically guarantee certain outcomes. Create AI systems with the same rigor as aircraft control software.

At Dweve, we built our entire platform on this principle. Core provides the binary framework. Loom implements constraint-based reasoning with provable properties. Every operation is mathematically exact. Every decision is traceable.

This isn't just more reliable. It's fundamentally more safe. Safety through mathematical rigor, not through ethical guidelines.

Constraints as safety guardrails

Here's another advantage of binary networks: they work with constraints, not probabilities.

A constraint is a hard rule. "This value must be positive." "This output must satisfy these conditions." Binary networks can incorporate constraints directly into their architecture.

This means safety requirements become mathematical constraints, not post-processing filters. The system literally can't produce outputs that violate constraints. It's mathematically impossible, not just unlikely.

Compare this to traditional neural networks, where safety is an afterthought. Train the model, then add guardrails. Hope the guardrails catch problems. Deal with failures when they slip through.

Constraint-based AI builds safety into the mathematics. It's the difference between a car with good brakes and a car that physically can't exceed safe speeds.

The alignment problem (actually solved)

The AI alignment problem asks: how do we ensure AI systems do what we want?

The current approach: train on human feedback, add more examples, hope the statistical patterns capture human values. It's fundamentally probabilistic. Fundamentally uncertain.

Binary networks with constraint-based reasoning offer a different approach: specify what you want mathematically. The system must satisfy those constraints. Not "usually" or "with 99.9% confidence." Must satisfy. Mathematically guaranteed.

This doesn't solve philosophical alignment. If you specify the wrong constraints, you get the wrong behavior. But it solves technical alignment. If you can formalize what you want, the system will do exactly that. No drift. No unexpected generalization. No emergent misalignment.

The hard part shifts from "how do we make it reliable" to "how do we specify what we want." That's a much better problem to have.

Deterministic is safe

One of the most underappreciated safety features of binary networks: they're deterministic.

Same input always produces same output. Run the system a million times, get identical results. This seems basic, but it's profound for safety.

Testing actually means something. If a test passes, the same input will always pass. You can certify behavior. Build trust through reproducibility.

Floating-point networks don't have this. Same input can produce different outputs depending on hardware, software versions, even the order of operations. Testing gives you a statistical sample, not a guarantee.

For critical systems, determinism is safety. You need to know exactly what the system will do, every time, in every circumstance. Binary networks provide this. Floating-point networks fundamentally can't.

Interpretability through constraints

Everyone wants interpretable AI. If we can't understand why a system made a decision, how can we trust it?

The problem with floating-point neural networks: they're black boxes. Billions of parameters, complex interactions, no clear decision path. Even the researchers who built them can't explain specific outputs.

Binary networks with constraint-based reasoning are inherently more interpretable. The system checks constraints. You can see which constraints were satisfied, which weren't, how the decision followed from the constraints.

It's not perfect transparency. Complex systems are still complex. But it's the difference between "the model assigned probability 0.87 based on learned patterns" and "the decision satisfied constraints A, B, and C, but violated constraint D, so output X was chosen."

One is opaque statistics. The other is logical reasoning you can follow and verify.

Safety through architecture

The AI safety community spends enormous effort on post-hoc safety measures. Alignment training, safety fine-tuning, output filtering, human oversight.

These are band-aids on fundamentally unsafe architectures. You're trying to make an unstable system stable through external controls.

Binary neural networks represent a different paradigm: safety through architecture. The mathematical foundations are stable. The operations are exact. The constraints are built-in. Safety isn't added on top; it's integral to the design.

Dweve Core's architecture demonstrates this principle. 1,930 algorithms, all mathematically rigorous. 415 primitives, 500 kernels, 191 layers, 674 higher-level algorithms. Each one designed for stability and verifiability.

Loom 456 builds on this foundation with 456 specialized experts, each handling specific types of reasoning. Sparse activation means only relevant experts engage. Constraint-based logic means outputs must satisfy formal requirements.

This is AI safety at the architectural level, not the policy level.

The European advantage

Europe has strict regulations around AI safety. GDPR, the AI Act, data protection laws. These create compliance burdens for systems that can't guarantee behavior.

But they create opportunities for systems that can.

Binary neural networks with formal verification can actually comply with regulatory requirements. Prove fairness. Demonstrate non-discrimination. Guarantee data handling. Show auditability.

Traditional neural networks can't do this. They can show statistical properties, provide examples, offer probabilistic assurances. But they can't prove anything mathematically.

This means European AI companies using binary networks have a regulatory advantage. They can certify safety in ways that floating-point systems simply can't match.

Compliance becomes a competitive advantage rather than a burden.

European regulatory requirements (why math matters legally)

EU AI Act Article 13 requires technical documentation demonstrating compliance with safety requirements. Article 15 demands accuracy, robustness, and cybersecurity measures. These requirements create challenges for systems where behaviour cannot be formally proven.

Certification challenges for safety-critical AI: German certification bodies like TÜV require formal specifications for AI in critical applications. Statistical test results ("99% accuracy") provide different assurances than mathematical proofs of constraint satisfaction. Systems that can provide formal guarantees face smoother certification paths than those relying solely on empirical validation.

Medical Device Regulation (MDR): AI-based diagnostics requiring CE marking must demonstrate safety through rigorous methodology. The MDR's requirements for predictable, verifiable behaviour prove challenging for neural networks with inherent stochasticity. Systems offering deterministic guarantees align better with certification requirements designed for medical devices where safety is paramount.

Aerospace safety standards: DO-178C certification for safety-critical avionics software, particularly Level A (where failure has catastrophic consequences), requires formal methods proving correctness. Traditional neural networks' probabilistic nature conflicts fundamentally with DO-178C requirements. This creates barriers for AI deployment in flight-critical systems unless alternative architectures with formal verification capabilities are employed.

Financial regulation: MiFID II requires algorithmic trading systems to demonstrate controls preventing market manipulation. Proving absence of specific behaviours mathematically differs substantially from showing low empirical occurrence rates. Systems with formal constraint specifications can provide stronger compliance arguments than those where behaviour emerges from statistical learning alone.

How formal verification actually works

Formal verification applies mathematical proof techniques to guarantee AI system properties.

Constraint encoding approach: Consider a medical diagnosis AI that must never recommend treatments contraindicated for patient medications. Traditional approach: train the model, test extensively, hope it learns the constraint, add safety filters. Constraint-based approach: encode the requirement mathematically as a hard constraint. The system's solution space explicitly excludes contraindicated combinations—not 99.99% safe, but mathematically impossible to violate.

Automotive safety requirements: ISO 26262 functional safety standard for automotive systems requires proving hazard mitigation. The difference between "detected 99.8% of pedestrians in testing" versus "can prove detection for all pedestrians meeting visibility criteria X within latency Y" represents fundamentally different assurance levels. The former is empirical evidence; the latter is mathematical proof. ASIL-D certification (highest automotive safety integrity level) demands proof-level assurances that statistical testing alone cannot provide.

Industrial automation standards: IEC 61508 requires Safety Integrity Level (SIL) 3 or 4 for critical industrial systems. SIL 4 demands demonstrating <10⁻⁸ probability of dangerous failure per hour. Traditional machine learning's inherent stochasticity prevents formal guarantees at this level. Systems requiring SIL 4 certification need mathematical proofs of failure bounds—verification techniques that apply to deterministic constraint-based systems but not to probabilistic neural networks.

Commercial implications of safety verification

Mathematical safety verification creates commercial dynamics beyond regulatory compliance.

Procurement and market access: European public sector procurement increasingly requires demonstrable AI safety certification for high-risk applications. Systems that cannot provide formal safety guarantees face exclusion from tenders regardless of empirical performance. Market access becomes determined by ability to provide mathematical proofs, not just impressive test results.

Insurance and liability considerations: Actuarial assessment of AI system risks proves challenging when behaviour cannot be formally proven. Insurance coverage for critical applications—medical diagnostics, autonomous vehicles, industrial automation—increasingly requires systems to demonstrate formal safety properties. This creates a divide: systems with mathematical guarantees become insurable; purely statistical systems face coverage difficulties or prohibitive premiums.

Certification timelines: A counterintuitive pattern emerges: systems with formal verification can achieve faster regulatory approval than those relying on extensive empirical testing. Formal proof provides deterministic certification paths—prove constraint satisfaction, receive approval. Empirical approaches face iterative testing cycles and regulatory questions about edge cases that statistical validation cannot definitively answer. Mathematical certainty can accelerate rather than delay deployment.

Customer trust dynamics: European enterprise customers increasingly demand explainable AI, particularly in B2B contexts. "Why did the system make this decision?" evolves from nice-to-have to deal-breaker. Systems with constraint-based reasoning can provide logical explanations; black-box neural networks cannot. Trust correlates with understandability, and mathematics enables understanding in ways that learned statistical patterns do not.

Technical implementation: how constraints guarantee safety

The mechanics of constraint-based safety deserve explanation. How exactly does mathematics prevent AI failures?

Constraint encoding: Safety requirements get translated into mathematical constraints before training. Not "the model should avoid X"—that's wishful thinking. "The output space excludes X"—that's mathematics. Medical diagnosis example: treatment T contraindicated with medication M becomes constraint C: ¬(recommend(T) ∧ patient_takes(M)). System literally cannot output solutions violating C. Solution space defined by constraints. Every possible output must satisfy all constraints. Impossible outputs aren't unlikely—they're mathematically excluded.

Verification process: After training, formal verification tools prove constraint satisfaction. Model checking, theorem proving, satisfiability solving—techniques from formal methods. For binary networks: tractable computation. For floating-point networks: intractable. Verification produces mathematical proof: "For all valid inputs I, all outputs O satisfy constraints C." Not statistical claim. Universal quantification over input space. European regulators understand the difference. One is evidence. The other is proof.

Runtime guarantees: Constraints don't just constrain training—they constrain every inference. Each decision goes through constraint checker. Output proposed, constraints verified, only compliant outputs allowed. Adds latency? Minimal—binary operations are fast. Adds safety? Absolute—mathematical impossibility of constraint violation. Cost-benefit analysis obvious: microseconds of checking versus catastrophic failures from unconstrained outputs.

Compositional safety: Multiple constraints compose mathematically. Safety constraint S1 plus fairness constraint F1 plus performance constraint P1: system must satisfy S1 ∧ F1 ∧ P1 simultaneously. Traditional approaches: train for safety, retrain for fairness, hope performance doesn't degrade. Constraint-based: specify all requirements upfront, find solution satisfying conjunction. Doesn't always exist—sometimes constraints conflict. But discovering impossibility during design beats discovering it during deployment. Mathematics forces honesty about trade-offs.

Failure case analysis: When constraint-based systems fail, the failure mode is fundamentally different. Traditional neural networks: silent failures, plausible but wrong outputs, no indication of uncertainty. Constraint-based systems: explicit constraint violation detection. System recognizes it cannot satisfy all constraints, refuses output, reports which constraint failed. Defensive failure—system knows it doesn't know. Medical diagnosis example: traditional system might output diagnosis despite insufficient information. Constraint-based system detects information constraint violation, outputs "insufficient data for diagnosis" instead. Not always convenient. Always safe. European medical device regulators prefer inconvenient safety to convenient catastrophe. Americans learning this lesson expensively.

Beyond fear, toward certainty

The AI safety debate is dominated by fear. Fear of uncontrollable systems. Fear of misalignment. Fear of unintended consequences.

These fears are valid. But they're symptoms of mathematical uncertainty. When your AI is built on unstable foundations, of course you're worried about what it might do.

Binary neural networks offer something different: mathematical certainty. Not certainty about every outcome, but certainty about the mathematical properties of the system. Certainty that constraints will be satisfied. Certainty that behavior is reproducible.

This shifts the conversation from "how do we control this unpredictable system" to "how do we specify correct behavior." From fear to engineering.

European institutions already making this transition. Max Planck Institute for Intelligent Systems focusing formal verification research. French INRIA deploying constraint-based AI in government systems. German Fraunhofer institutes developing certifiable AI for industrial applications. Not because regulation mandates it—because mathematics enables it. When you can prove safety, you don't need to debate it. When you can guarantee behavior, you don't need to hope for it. The fear diminishes when the foundations are solid.

The real path to safe AI

AI safety isn't about consciousness, sentience, or values alignment in the abstract philosophical sense. It's about building systems that do what they're supposed to do, reliably, every time.

Ethics matters. But ethics without mathematical foundations is just wishful thinking. You can't regulate your way to safe AI if the underlying mathematics are broken.

The path forward is clear: build AI on mathematically sound foundations. Use architectures that support formal verification. Incorporate constraints directly into the design. Make safety intrinsic, not extrinsic.

Binary neural networks aren't a complete solution to all AI safety concerns. But they solve the fundamental problem: mathematical instability. And that's the prerequisite for everything else.

You can't align a system that doesn't work reliably. You can't make ethical decisions with tools that produce inconsistent outputs. You can't build trustworthy AI on shaky mathematical ground.

But you can build provably safe systems with rigorous mathematics. You can create AI that satisfies constraints by design. You can develop technology where safety is guaranteed, not hoped for.

That's what Dweve's platform delivers. Mathematical rigor. Formal verifiability. Constraint-based safety. Not through ethical frameworks, but through better mathematics.

The AI safety crisis is real. But it's a math problem, not a philosophy problem. And math problems have math solutions.

Europe understood this from the beginning. Centuries of engineering disasters taught a simple lesson: hope isn't a strategy, testing isn't proof, and good intentions don't prevent catastrophic failures. Mathematics does. European AI companies building on this foundation aren't handicapped by regulation—they're enabled by it. When safety is mathematically guaranteed, deployment accelerates. When behavior is formally verified, trust follows naturally. The future of AI isn't philosophical debates about consciousness. It's rigorous mathematics ensuring systems work correctly. European approach wasn't defensive. It was correct all along.

Ready for AI you can actually trust? Dweve Core's formally verifiable binary neural networks are coming. Safety through mathematics, not through hope. Join our waitlist.

AI safety is about math, not ethics

The ethics distraction

The real safety crisis

Why ethics can't save us

The formal verification solution

Binary networks: provably correct AI

Constraints as safety guardrails

The alignment problem (actually solved)

Deterministic is safe

Interpretability through constraints

Safety through architecture

The European advantage

European regulatory requirements (why math matters legally)

How formal verification actually works

Commercial implications of safety verification

Technical implementation: how constraints guarantee safety

Beyond fear, toward certainty

The real path to safe AI

Tagged with

About the Author

Harm Geerlings

Related posts

The fact dilution problem: why 98% accurate AI becomes complete nonsense

Privacy in AI: protecting your data while training intelligent systems

AI safety: what it actually means and why it matters to you

Stay updated with Dweve