<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Anthropic on Big Muddy</title><link>https://muddy.jprs.me/tags/anthropic/</link><description>Recent content in Anthropic on Big Muddy</description><generator>Hugo</generator><language>en-US</language><lastBuildDate>Tue, 24 Feb 2026 22:44:00 -0500</lastBuildDate><atom:link href="https://muddy.jprs.me/tags/anthropic/index.xml" rel="self" type="application/rss+xml"/><item><title>Comparing the Claw-like agent ecosystem</title><link>https://muddy.jprs.me/links/2026-02-24-comparing-the-claw-like-agent-ecosystem/</link><pubDate>Tue, 24 Feb 2026 22:44:00 -0500</pubDate><guid>https://muddy.jprs.me/links/2026-02-24-comparing-the-claw-like-agent-ecosystem/</guid><description>&lt;p&gt;Chrys Bader has created ClawCharts to track the popularity and growth of OpenClaw and its growing number of competitors.&lt;/p&gt;
&lt;p&gt;I have an unused Raspberry Pi 4 4GB that I&amp;rsquo;ve been meaning to test one of these Claw-like personal agents on (locked down to prevent the security nightmare scenarios we&amp;rsquo;ve seen play out since OpenClaw took off).&lt;/p&gt;
&lt;p&gt;OpenClaw is a bit of a resource hog (which is why so many people are running out to buy Mac Minis), so I&amp;rsquo;ve been looking at the list of lightweight competitors. There is no obvious reason to prefer one over the other, so I&amp;rsquo;ll probably go with the fast-growing &lt;a href="https://github.com/zeroclaw-labs/zeroclaw"&gt;ZeroClaw&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;ZeroClaw offers OAuth connectors for OpenAI and Anthropic subscription plans, but presently neither company is clear on whether this usage is permissible or not. Anthropic recently blew up the OpenClaw community by &lt;a href="https://web.archive.org/web/20260221222303/https://code.claude.com/docs/en/legal-and-compliance#authentication-and-credential-use"&gt;updating their docs&lt;/a&gt; to specifically ban using OAuth outside of Claude Code. An Anthropic employee &lt;a href="https://thenewstack.io/anthropic-agent-sdk-confusion/"&gt;partially walked this back on Twitter&lt;/a&gt;, but there is still no clear statement whether this use case is permitted. Regarding the use of OAuth from OpenAI for OpenClaw (specifically, GPT Codex), Peter Steinberger, creator of OpenClaw, &lt;a href="https://x.com/steipete/status/2024182608746217953"&gt;stated on Twitter&lt;/a&gt;: &amp;ldquo;that already works, OAI publicly said that&amp;rdquo;. No one can seem to find this public statement, but it&amp;rsquo;s worth noting that Steinberger himself is now an OpenAI employee. So, will you get banned for using your ChatGPT Plus/Pro or Claude Pro/Max subscriptions with OpenClaw? Nobody knows.&lt;/p&gt;</description></item><item><title>Anthropic's statistical analysis skill doesn't get statistical significance quite right</title><link>https://muddy.jprs.me/links/2026-02-06-anthropic-s-statistical-analysis-skill-doesn-t-get-statistical-significance-quite-right/</link><pubDate>Fri, 06 Feb 2026 19:30:00 -0500</pubDate><guid>https://muddy.jprs.me/links/2026-02-06-anthropic-s-statistical-analysis-skill-doesn-t-get-statistical-significance-quite-right/</guid><description>&lt;p&gt;Anthropic&amp;rsquo;s new statistical analysis skill demonstrates a common misunderstanding of statistical significance:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Statistical significance means the difference is unlikely due to chance.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;But this phrasing isn&amp;rsquo;t quite right. The p-value in Null Hypothesis Significance Testing is not about the probability the results are &amp;ldquo;due to chance&amp;rdquo;; it is the probability—under the null hypothesis and the model assumptions—of observing results at least as extreme as the ones we obtained. In other words, the p-value summarizes how compatible the data are with the null, given our modelling choices. What it does not tell you is the probability that the null hypothesis is true.&lt;/p&gt;
&lt;p&gt;Statistician Andrew Gelman gave a good definition for statistical significance in a 2015 &lt;a href="https://statmodeling.stat.columbia.edu/2015/07/21/a-bad-definition-of-statistical-significance-from-the-u-s-department-of-health-and-human-services-effective-health-care-program/"&gt;blog post&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;A mathematical technique to measure the strength of evidence from a single study. Statistical significance is conventionally declared when the p-value is less than 0.05. The p-value is the probability of seeing a result as strong as observed or greater, under the &lt;em&gt;null hypothesis&lt;/em&gt; (which is commonly the hypothesis that there is no effect). Thus, the smaller the p-value, the less consistent are the data with the null hypothesis under this measure.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;As some of the commenters in this blog post observe, simply being able to parrot a technically accurate definition of a p-value does not necessarily make us better at applying statistical significance in practice. It is certainly true that statistical significance is widely misused in scientific publishing as a threshold to distinguish signal from noise (or to be fancy, a &amp;ldquo;lexicographic decision rule&amp;rdquo;), which is why &lt;a href="https://sites.stat.columbia.edu/gelman/research/published/abandon.pdf"&gt;some scientists have argued that we should abandon it as the default statistical paradigm for research&lt;/a&gt;.&lt;/p&gt;</description></item></channel></rss>