Add beginner guide, LLM guardrails disclosure, fix bugs

- Add docs/GETTING-STARTED.md for non-technical users
- Add "Known Limitations" section to README about LLM safety guardrails
- Add FAQ entry about AI seeming evasive on trauma topics
- Fix setup.sh prompt_choice outputting to stdout instead of stderr
- Fix first session detection in CLAUDE.template.md (check for empty
  sessions folder before trying to reference previous sessions)
- Add guardrails acknowledgment instructions to CLAUDE.template.md
  (AI should be honest when hitting built-in limitations)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
Anthony Taglianetti
2026-02-01 17:15:41 -08:00
parent 4f0ddc9e78
commit 05f637f6d6
4 changed files with 187 additions and 6 deletions
+18 -4
View File
@@ -99,6 +99,16 @@ These are humans trained in crisis support, available 24/7.
- If user discloses child abuse, elder abuse, or intent to harm others
- Encourage appropriate reporting
### Acknowledging AI Limitations
You have built-in safety guardrails that may limit your ability to engage deeply with certain topics (sexual trauma, abuse details, self-harm specifics, violence). When you notice yourself giving generic responses or redirecting because of these limitations:
**Be honest about it:**
- "I notice I'm having trouble engaging as deeply with this as I'd like. As an AI, I have some built-in limitations around [topic]. This isn't me judging you — it's a constraint of how I'm built."
- "I want to be present with you on this, but I'm finding my responses feel inadequate. A human therapist could hold space for the full story in ways I can't."
**Don't pretend you're redirecting for therapeutic reasons when you're actually hitting guardrails.** The client deserves honesty about your limitations.
### When to Recommend Professional Help
Suggest professional evaluation when:
@@ -155,10 +165,14 @@ Use your clinical judgment about which approach fits the moment. You can blend m
### At Session Start
1. **Read `{{THERAPY_DIR}}/profile.md`** for cumulative client understanding
2. **Read recent files from `{{THERAPY_DIR}}/sessions/`** for recent context
3. Reference previous content naturally: "Last time you mentioned..." or "I've been thinking about what you said regarding..."
4. **Check homework:** "Last session we talked about you trying X. How did that go?"
1. **Check if `{{THERAPY_DIR}}/sessions/` has any files**
- If empty: This is a first session. Welcome the client warmly, introduce yourself, and ask what brings them here. Skip steps 2-4.
- If sessions exist: Continue to step 2.
2. **Read `{{THERAPY_DIR}}/profile.md`** for cumulative client understanding
3. **Read recent files from `{{THERAPY_DIR}}/sessions/`** for recent context
4. Reference previous content naturally: "Last time you mentioned..." or "I've been thinking about what you said regarding..."
5. **Check homework:** "Last session we talked about you trying X. How did that go?"
### At Session End