• Mico. This is one of the best names I have heard in a long time coming from Microsoft. After a series of mediocre names like Zune, Bing and Cortana, Mico, the name of the new Copilot avatar, is just brilliant. It’s cute, it has a connection with both “Mi”crosoft and “Co”pilot and the visual avatar looks friendly, playful and I think people will love it. Mico is just one of 12 AI updates from Microsoft the past week – they are really going all-in on AI and they are integrating AI everywhere in their ecosystem. And as the foundation models…

    Read more…

  • Are we in an AI bubble? In some ways I think we are. Every company that sells an AI-powered service needs to pay for API access to Anthropic, Google or OpenAI to deliver value. This includes companies like Cursor, Lovable, Microsoft, and thousands more. The challenge with this business model is that the regular models like GPT5 and Claude Sonnet 4.5 has become quite costly to run, while not performing that much better than the previous generation. And the best models like Claude Opus 4.1 or GPT5-Pro cost so much to run that it’s not financially viable to use them…

    Read more…

  • Things were so much easier just six months ago! In April 2025 the comparison between Microsoft 365 Copilot and ChatGPT was easy. Two fairly simple chatbots that both used the GPT-4o model as their foundation. Many companies picked Copilot, reading through the specifications the differences were few, and you also got additional benefits such as Teams meeting transcriptions. Fast forward six months and OpenAI ChatGPT has now evolved from a simple chat service into a complete productivity platform. In May OpenAI launched Codex, which has grown into today’s best software development tool for agentic development even surpassing Claude Code. In…

    Read more…

  • The biggest news last week was undoubtedly the launch of OpenAI Sora 2. You need an invite code to try it, but if you have it you’re in for a world of fun. If you haven’t already seen Sora 2 in action, do yourself a favor and spend 2 minutes watching The Quack short movie by OpenAI. This is how far we have come with AI generated video today, and while it’s still not perfect it’s definitely good enough for hobby productions. Last week Anthropic further tightened the usage limits of Claude Opus 4.1, their top model. So now, even…

    Read more…

  • Just how good are today’s state-of-the-art AI models at doing normal office work? To test this, OpenAI just introduced GDPval, a new benchmark focusing on AI model performance on real work tasks, tested against professionals with 14 years of experience. Last year’s best model, GPT-4o, was only better than humans in 13.7% of all tests. Today’s best model, Claude Opus 4.1, is now better than humans in 47.6% of all tests. According to OpenAI the performance increase of AI models for everyday tasks have improved almost linearly the past 18 months, and is expected to continue that way in the…

    Read more…

  • How many days would it take a full team to rebuild the entire application Apple Notes? And then add the best features from other note taking applications like Bear and Evernote to it, maybe also sprinkle it with dozens of new innovative features, translate it to eight languages and make sure the app runs on Mac, Windows, Linux, Android, iPhone and iPadOS? And let’s add a full API while we’re at it. I think most of you experienced with software development would have guessed at least a year, with a 5-man team of three developers, one product manager, a part…

    Read more…

  • On Friday last week, Anthropic posted a status message saying: “We’ve identified the root causes of the reported quality issues and deployed mitigations for each. A technical post-mortem will be published on our engineering blog next week”. If you like me have been using Claude Code extensively the past months you know that it has behaved extremely inconsistent over the past three weeks. Sometimes it performed ok, but most of the times it performed really bad. For me this was not a showstopper since I could quickly switch to OpenAI Codex. Because when Anthropic went bad, OpenAI Codex with GPT5-High…

    Read more…

  • Hello and welcome to a new episode of Tech Insights! The past week was fairly low on news, the TLDR version of the week is that NotebookLM got four new audio formats, including the interesting new “debate” mode where each host picks a different side in a discussion, Google launched a small and super efficient embedding model called EmbeddingGamma for mobile and edge devices, and China now mandates all AI generated content to be clearly marked as of September 1. Of course there were a few more news than that, but if you are in a hurry that was the…

    Read more…

  • Aaron Levie, CEO at Box wrote this on X two days ago: “We’re going to look back on the world that was pre-AI and be absolutely astonished by how slow everything was. Every week at Box as we got AI-first, we highlight a workflow where someone internally built an AI agent for automating some process. It can be things like an HR process, sales outreach, responding to RFPs, handling a compliance workflow, writing documentation, and so on. Usually my immediate reaction is “I can’t believe we used to have to do all this work manually”. The amount of time that…

    Read more…

  • You have probably seen the headline by now: 95% of all Generative AI pilots are failing according to a new MIT NANDA report. Within minutes thousands of LinkedIn AI experts rushed to declare that the AI bubble is bursting and that we all should be focusing on something else. But those of us who took the time to actually read the 26-page report discovered something different. The problem isn’t that Generative AI is bad for business, it’s that companies are doing it wrong. The report reveals a fundamental “GenAI Divide” between organizations stuck with failed pilots and those achieving real…

    Read more…