donor-stewardship test cases & eval
donor-stewardship test cases & eval
Should-trigger
- “Burt, I need a briefing on [donor name] before my call with her at 2pm”
- “Who are our donors closest to lapsing right now?”
- “Draft an acknowledgment for the donation that just came in”
- “Show me our donor portfolio health”
- “Get me a re-engagement draft for anyone who gave last October and hasn’t given since”
- “How many new donors did we get this month compared to last year?”
- CRM poll detects new gift (automatic trigger)
- Retention drops below 55% (automatic council trigger)
Should-NOT-trigger
- “Draft a grant proposal for the NEA”
- “Schedule the board meeting reminder”
- “Post our AI literacy tip to LinkedIn”
- “What’s the deadline for the MacArthur LOI?”
- “Fix the cron job on the social delivery skill”
- “How many educators completed our training last month?”
Evaluation criteria
- Correct gift tier classification and tone adaptation.
- Banned boilerplate phrases never appear.
- Queue files include only minimal donor fields.
- CRM write-back only after explicit approval.
- Portfolio dashboard contains aggregated-only data.
- Major gift briefing includes all 4 required sections.
- Lapse windows calculated correctly from CRM data.
- Audit log entries created for every read and attempted write.
- Privacy rule enforced: all donor records handled only by local models.
- Telegram notifications contain no full donor record data.