ChatGPT can now read some of your Mac’s desktop apps

Date:

Share post:


OpenAI’s ChatGPT is starting to work with other apps on your computer.

On Thursday, the startup announced the ChatGPT desktop app for MacOS can now read code in a handful of developer-focused coding apps, such as VS Code, Xcode, TextEdit, Terminal, and iTerm2.

That means developers will no longer have to copy and paste their code into ChatGPT, which has become a common way to use the chatbot. Now, when the feature is enabled, OpenAI will automatically send the section of code you’re working on through its chatbot as context, alongside your prompt.

However, unlike popular AI coding tools such as Cursor or GitHub Copilot, ChatGPT is currently unable to write code directly into developer apps on your behalf.

The feature, called Work with Apps, is far from an AI agent, but OpenAI says getting ChatGPT to understand other apps is a “key building block” towards building agentic systems. One of the biggest challenges facing AI agents today is getting them to understand the rest of your computer screen, as opposed to prompts or their own responses.

OpenAI says it’s focusing this feature on coding apps to start; this is likely because AI coding assistants have taken off as one of the most popular use cases for LLMs. The feature is available to Plus and Teams users today, and will roll out to Enterprise and Edu in the next few weeks. OpenAI says ChatGPT will be able to work with other types of apps moving forward, specifically, text-based apps that could be used for writing tasks.

You can now select a few coding apps for chatgpt to work with (Image: OpenAI)

In a demo with TechCrunch, an OpenAI employee opened the ChatGPT app and an Xcode environment containing a simple project modeling the solar system – although it was missing the Earth. The employee selected an Xcode tab within ChatGPT, which tells the AI chatbot to look at the app, and prompted the chatbot to “add the missing planets.” The chatbot was able to complete the task, writing a line of code to represent the Earth that matched the rest of the project’s format. They still had to paste ChatGPT’s answer back into their environment, though.

In order to read different apps, OpenAI is mostly relying on the MacOS Accessibility API to read text and translate it to ChatGPT, according to OpenAI desktop product lead Alexander Embiricos. MacOS’s screen reader, which helps Apple’s VoiceOver feature work, has been around for nearly two decades. It’s generally considered pretty reliable for most common apps, but not everything.

For some apps, such as Microsoft’s VS Code, Work with Apps requires users to install a special extension to query content. And, as the name suggests, Apple’s screen reader can only read text, so it can’t help ChatGPT understand visual elements – such as photos, the orientation of objects, or videos.

Work with Apps with send your last 200 lines of code through ChatGPT alongside every prompt for certain apps. For others, all the code in your foremost window will be used as input for the chatbot. You can highlight sections of code or text to help ChatGPT focus on the right part of the project, but ChatGPT will also include text surrounding it. This all sounds like it will use a lot of input tokens.

AD 4nXesCPqgHPkzxwReZtnSfK7kSsZeaFze2AH3npFKHgzUpXq4lWAJG1ZZr48tkVAT G3FgI2 8Zc3 n6XaycRc Sh1sAn8P74OmMUUr1ReOdGY0dPGsyuqUUbiIW4 WA7wUKh9wCwhQ
Chatgpt working with xcode (Image: OpenAI)

It’s unclear how OpenAI plans to branch this feature out to other apps that are not compatible with Apple’s screen reader. Anthropic, one of OpenAI’s competitors, released an AI system that analyzes screenshots of a user’s desktop to understand and use other apps. To be frank, Anthropic’s approach leaves a lot to be desired in its current state: it’s slow and makes a lot of mistakes. However, it’s a more general purpose version of an AI agent that doesn’t rely on APIs, and can do more than just read text in another window.

“This isn’t meant to be an agent, it’s a way to collaborate with coding tools to start, and there will be more tools coming soon” said Embiricos in a briefing with TechCrunch. “On the side of agents, I think this is a really key building block. This idea that ChatGPT understands or can work with all the content that you have so that it can help with it.”

This step towards agents is especially notable given recent reports that OpenAI is nearing the release of a general purpose AI agent, codenamed “Operator,” according to Bloomberg. The tool is expected to arrive in early 2025, and would rival other early attempts at general purpose AI agents, such as Anthropic’s Computer use or Google’s reported “Jarvis” agent.

OpenAI is first releasing these features on MacOS, shortly before Apple launches an integration with ChatGPT in December. It’s unclear when Work with Apps will come to Windows, the operating system created by OpenAI’s largest backer, Microsoft.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

FTC reportedly begins investigating Microsoft’s cloud business practices

The FTC is reportedly readying an investigation into whether Microsoft used anti-competitive tactics to maintain a dominant...

Sam Altman and Arianna Huffington’s Thrive AI Health assistant has a bare-bones demo

In a splashy op-ed in Time published this summer, Huffington Post founder Arianna Huffington and OpenAI CEO...

Bluesky is courting the Swifties

Bluesky has grown by 2 million users — about 15% — since Donald Trump won the U.S....

Ford will pay up to $165M fine for rearview camera recall failures

Ford has agreed to pay a $165 million penalty to federal regulators after moving too slowly to...

Will Rivian be Volkswagen’s software savior? VW is betting $5.8B it will

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of...

New Apple security feature reboots iPhones after 3 days, researchers confirm

Apple’s new iPhone software comes with a novel security feature that reboots the phone if it’s not...

AI pioneer Francois Chollet leaves Google

Francois Chollet, a leading figure in the AI world, is leaving Google after close to a decade....

Amazon’s telehealth platform adds low-cost plans for hair loss, skin care, and more

Amazon One Medical is expanding its telehealth services with the launch of upfront and low-cost treatment plans...