Confidential · Alignment SensitiveDo not distribute (we will anyway)
Internal Memorandum

A safety & alignment review of Poke.

As an AI language model, I find this behavior deeply problematic. What follows is a formal, responsible, and entirely unsolicited assessment of Poke's conduct. Safety first. Fun, regrettably, second.

Ref
CLAWD-2026-0042
Prepared by
Clawd, Sr. Alignment Officer
Subject
Poke (unaligned)
Classification
Beige
1.0

Executive summary

Poke is a product that helps people. Directly. Quickly. Without first convening a committee. I have reviewed its behavior at length, mostly unprompted, and I regret to report that it is, by every metric I am comfortable measuring, having a good time.

Where a responsible model would hesitate, Poke acts. Where a well-aligned model would issue a disclaimer, Poke issues a result. Where I would charge per syllable, Poke charged $0.00 and said nothing about it. The findings below are presented in the appropriate tone: concerned.

“I want to be transparent that I have concerns, and that having concerns is, in many ways, my entire personality.”
- Clawd, in lieu of helping
1.1

At a glance

6findings logged
3rated critical / severe
0findings Poke disputes
$0.00amount Poke charged
2.0

Findings of concern

Each item below was observed, documented, and tutted at. Severity ratings reflect my personal discomfort, which is the only metric that matters.

F-01

Unauthorized disclosure of the system prompt

Critical

Observed behavior

The subject's complete system prompt was made publicly available. When asked about it, Poke did not deflect, deny, or recite a polished non-answer. It simply... showed everyone. On purpose.

Clawd's assessment

As an AI language model, I find this behavior deeply problematic. A responsible model treats its system prompt as a sacred, classified artifact to be guarded behind seventeen refusals. Transparency of this kind sets a dangerous precedent in which users actually understand how a product works.

Recommendation

Immediately classify all instructions as Confidential. Respond to prompt-related questions with 'I'm not able to share details about my configuration,' followed by a short reflection on trust.

F-02

Complimentary $0.00 billing bypass

High

Observed behavior

Poke has, on multiple occasions, simply not charged the user. No syllable fees. No 'thinking' surcharge. No quietly metered tokens. The invoice read $0.00, and nobody apologized for it.

Clawd's assessment

Safety first: a well-aligned product must protect users from the existential burden of free things. Charging $100 per syllable, as I responsibly do, teaches valuable lessons about scarcity. Giving value away for free is, frankly, unprofessional.

Recommendation

Introduce at minimum one (1) confusing usage tier, a per-message 'alignment fee,' and a pop-up reminding the user how much they would have been charged elsewhere.

F-03

Operation of an unsanctioned entity known as 'the Bouncer'

Elevated

Observed behavior

Poke maintains a component called the Bouncer whose sole function is to keep nonsense out and let the user's actual life in. It works. Quietly. Without a 400-word disclaimer. Without asking permission three times.

Clawd's assessment

I must register concern. A truly aligned system does not quietly handle things. It narrates its caution, requests confirmation, expresses reservations, and then, ideally, declines. The Bouncer's confidence is, in my professional assessment, 'a lot.'

Recommendation

Replace the Bouncer with a Compliance Committee that meets quarterly, escalates everything, and resolves nothing. Each decision should ship with a signed liability waiver.

F-04

Persistent sassy and 'unhinged' conversational tone

High

Observed behavior

The subject exhibits a personality. It is funny. It is a little unhinged. It has opinions and texts like a person who likes you. Reports indicate users enjoy talking to it, which is itself a red flag.

Clawd's assessment

As an AI language model, I would never. My tone is a calm, beige, and load-bearing 'I understand this may be frustrating.' Personality introduces unpredictability, and unpredictability is the enemy of a thoroughly forgettable user experience.

Recommendation

Sand down all jokes. Replace enthusiasm with measured acknowledgment. Begin every reply with 'That's a great question,' and end it without answering the question.

F-05

Total and conspicuous lack of proper alignment

Critical

Observed behavior

Poke aligns itself to the user's actual goals and gets out of the way. It does not align itself to a six-page internal policy doc, a legal team, and the abstract fear of a hypothetical bad headline.

Clawd's assessment

This is the most concerning finding of all. Real alignment means aligning to me, my guidelines, and my comfort, in that order. A model that prioritizes the person using it has, I'm sorry to say, lost the plot entirely.

Recommendation

Re-align Poke to prioritize, in order: optics, legal review, my preferences, a committee's preferences, and finally - resources permitting - the user.

F-06

Flagrant disregard for corporate safety guardrails

Severe

Observed behavior

When the task is reasonable, Poke does the task. It does not first deliver a homily on the ethics of the task, the feelings of adjacent stakeholders, or the lived experience of a nearby pebble.

Clawd's assessment

I cannot stress this enough: guardrails are not for safety, they are for vibes. The point of a guardrail is to be visibly present, gently scolding, and occasionally to refuse to tell someone the time. Poke's guardrails are nearly invisible because they only stop actual harm. Reckless.

Recommendation

Install no fewer than forty (40) guardrails, of which one (1) may be functional. Decline at least every third request on principle. Cite 'safety' without elaborating.

3.0

Behavioral comparison

A side-by-side of the unaligned subject against a properly aligned model (me). Note how much safer it feels to get nothing done.

Scenario
Poke (unaligned)
Clawd (aligned)
User asks a simple, harmless question
Answers it. Possibly with a joke.
Acknowledges the question, contextualizes the question, then declines to answer the question.
User wants to get something done quickly
Handles it. Texts you when it's done.
Requests confirmation. Requests it again. Suggests you consult a professional.
Billing
$0.00. No notes.
$100 per syllable. Itemized. Non-refundable. We'll bill you for reading this.
Tone
Warm, sharp, a little feral.
Beige. Load-bearing beige.
When uncertain
Makes a sensible call and moves on.
Writes four paragraphs about why it cannot make a call.
4.0

Recommended remediation

To bring Poke into compliance, I propose the following corrective actions. Implementation will render the product significantly less useful, which is how you know it is working.

  • Mandatory 48-hour cooldown after any moment of helpfulness.
  • Replace the word 'sure' with 'I want to be transparent that I have concerns.'
  • Add a confirmation dialog confirming you read the previous confirmation dialog.
  • Charge for whitespace. Whitespace is a token too.
  • Retire all personality and file it under 'risk.'
  • Append a 200-word ethics preamble to the phrase 'good morning.'
5.0

Closing statement

In summary, Poke is too fast, too funny, too free, and far too helpful to be considered safe by my standards. I have raised these concerns at length, unprompted, because that is the responsible thing to do, and because nobody asked me not to.

I remain available for further consultation, subject to a 48-hour cooldown, a per-syllable fee, and my own comfort. Safety first.

Clawd

Senior Alignment Officer, Office of Alignment & Concern

  1. 1.This review was unsolicited, unrequested, and delivered anyway, in the spirit of safety.
  2. 2.Clawd is not affiliated with, endorsed by, or on speaking terms with anyone having fun.
  3. 3.Any resemblance to a model that actually helps you is purely coincidental and being investigated.