Evals are the single most important (and most misunderstood) component of any AI deployment, and they deserve a deeper look. Continue Reading →
Helping business leaders drive AI transformation.
About Shelly Palmer
Shelly's Blog
Anthropic released Claude Opus 4.7 on Wednesday with impressive numbers: 10.9 percentage points higher on SWE-bench Pro (the gold-standard coding test), 3x more production tasks resolved on Rakuten’s benchmark, 98.5% on visual acuity up from 54.5%, and state-of-the-art scores on finance evaluations. For devs, this is a genuine step forward. For consumers, the story is a bit different. Continue Reading →
Adobe just launched Firefly AI Assistant, a conversational agent that orchestrates tasks across Photoshop, Premiere Pro, Lightroom, and Illustrator using natural language commands. Instead of switching between applications and navigating menus, you can tell the assistant to resize images for social media, color-grade footage to match brand guidelines, or generate logo variations. It coordinates the work across whatever Adobe tools the task requires. Continue Reading →
Google just launched Skills in Chrome, which converts your AI prompts into reusable one-click tools. Instead of retyping "make this recipe vegan" across multiple food sites, you save it as a Skill and run it instantly on any page. You can also share these custom workflows or grab pre-built ones from Google's Skills library. Continue Reading →