ChatGPT can now read some of your Mac’s desktop apps

Date:

Share post:


OpenAI’s ChatGPT is starting to work with other apps on your computer.

On Thursday, the startup announced the ChatGPT desktop app for MacOS can now read code in a handful of developer-focused coding apps, such as VS Code, Xcode, TextEdit, Terminal, and iTerm2.

That means developers will no longer have to copy and paste their code into ChatGPT, which has become a common way to use the chatbot. Now, when the feature is enabled, OpenAI will automatically send the section of code you’re working on through its chatbot as context, alongside your prompt.

However, unlike popular AI coding tools such as Cursor or GitHub Copilot, ChatGPT is currently unable to write code directly into developer apps on your behalf.

The feature, called Work with Apps, is far from an AI agent, but OpenAI says getting ChatGPT to understand other apps is a “key building block” towards building agentic systems. One of the biggest challenges facing AI agents today is getting them to understand the rest of your computer screen, as opposed to prompts or their own responses.

OpenAI says it’s focusing this feature on coding apps to start; this is likely because AI coding assistants have taken off as one of the most popular use cases for LLMs. The feature is available to Plus and Teams users today, and will roll out to Enterprise and Edu in the next few weeks. OpenAI says ChatGPT will be able to work with other types of apps moving forward, specifically, text-based apps that could be used for writing tasks.

You can now select a few coding apps for chatgpt to work with (Image: OpenAI)

In a demo with TechCrunch, an OpenAI employee opened the ChatGPT app and an Xcode environment containing a simple project modeling the solar system – although it was missing the Earth. The employee selected an Xcode tab within ChatGPT, which tells the AI chatbot to look at the app, and prompted the chatbot to “add the missing planets.” The chatbot was able to complete the task, writing a line of code to represent the Earth that matched the rest of the project’s format. They still had to paste ChatGPT’s answer back into their environment, though.

In order to read different apps, OpenAI is mostly relying on the MacOS Accessibility API to read text and translate it to ChatGPT, according to OpenAI desktop product lead Alexander Embiricos. MacOS’s screen reader, which helps Apple’s VoiceOver feature work, has been around for nearly two decades. It’s generally considered pretty reliable for most common apps, but not everything.

For some apps, such as Microsoft’s VS Code, Work with Apps requires users to install a special extension to query content. And, as the name suggests, Apple’s screen reader can only read text, so it can’t help ChatGPT understand visual elements – such as photos, the orientation of objects, or videos.

Work with Apps with send your last 200 lines of code through ChatGPT alongside every prompt for certain apps. For others, all the code in your foremost window will be used as input for the chatbot. You can highlight sections of code or text to help ChatGPT focus on the right part of the project, but ChatGPT will also include text surrounding it. This all sounds like it will use a lot of input tokens.

AD 4nXesCPqgHPkzxwReZtnSfK7kSsZeaFze2AH3npFKHgzUpXq4lWAJG1ZZr48tkVAT G3FgI2 8Zc3 n6XaycRc Sh1sAn8P74OmMUUr1ReOdGY0dPGsyuqUUbiIW4 WA7wUKh9wCwhQ
Chatgpt working with xcode (Image: OpenAI)

It’s unclear how OpenAI plans to branch this feature out to other apps that are not compatible with Apple’s screen reader. Anthropic, one of OpenAI’s competitors, released an AI system that analyzes screenshots of a user’s desktop to understand and use other apps. To be frank, Anthropic’s approach leaves a lot to be desired in its current state: it’s slow and makes a lot of mistakes. However, it’s a more general purpose version of an AI agent that doesn’t rely on APIs, and can do more than just read text in another window.

“This isn’t meant to be an agent, it’s a way to collaborate with coding tools to start, and there will be more tools coming soon” said Embiricos in a briefing with TechCrunch. “On the side of agents, I think this is a really key building block. This idea that ChatGPT understands or can work with all the content that you have so that it can help with it.”

This step towards agents is especially notable given recent reports that OpenAI is nearing the release of a general purpose AI agent, codenamed “Operator,” according to Bloomberg. The tool is expected to arrive in early 2025, and would rival other early attempts at general purpose AI agents, such as Anthropic’s Computer use or Google’s reported “Jarvis” agent.

OpenAI is first releasing these features on MacOS, shortly before Apple launches an integration with ChatGPT in December. It’s unclear when Work with Apps will come to Windows, the operating system created by OpenAI’s largest backer, Microsoft.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Nubank leads $250M round in African digital bank Tyme at $1.5B valuation

Tyme Group, a South African-born fintech operating in the African country and the Philippines, has secured $250...

Hexa, the startup studio behind Aircall and Swan, unveils its next batch of startups

Hexa, a Paris-based startup studio that has launched dozens of B2B software companies, is sharing a list...

Mark Zuckerberg says Threads now has 100M daily active users

Meta’s X rival Threads is growing steadily with more than 100 million people using the service daily,...

Waymo robotaxis are coming to Tokyo in 2025

Waymo will begin testing its autonomous vehicle technology in Tokyo in early 2025, the first time the...

Cohere is quietly working with Palantir to deploy its AI models

Cohere is one of the best-known AI startups outside of OpenAI and Anthropic, hitting a $5.5 billion...

Jay-Z’s Marcy Venture Partners merges with investment arm of Pendulum Holdings

Jay-Z’s venture capital firm, Marcy Venture Partners, has merged with another Black-owned investment firm, Pendulum Holdings’ investment...

iRobot co-founder’s new home robot startup hopes to raise $30M

Colin Angle, one of the co-founders of Roomba maker iRobot, is raising cash for a home robotics...

TuSimple’s former CEO wants a new board that will liquidate the company

TuSimple co-founder and former CEO Xiaodi Hou is on a war path in the lead up to...