OpenAI Nears Launch of AI Agent Tool To Automate Tasks For Users

An anonymous reader quotes a report from Bloomberg: OpenAI is preparing to launch a new artificial intelligence agent codenamed "Operator" that can use a computer to take actions on a person's behalf (Warning: source may be paywalled; alternative source), such as writing code or booking travel [...]. In a staff meeting on Wednesday, OpenAI's leadership announced plans to release the tool in January as a research preview and through the company's application programming interface for developers [...]. The one nearest completion will be a general-purpose tool that executes tasks in a web browser, one of the people said. OpenAI Chief Executive Officer Sam Altman hinted at the shift to agents in response to a question last month during an Ask Me Anything session on Reddit. "We will have better and better models," Altman wrote. "But I think the thing that will feel like the next giant breakthrough will be agents." The move to release an agentic AI tool also comes as OpenAI and its competitors have seen diminishing returns from their costly efforts to develop more advanced AI models. Read more of this story at Slashdot.

Microsoft Gaming Handheld Device ‘Few Years’ Away, Says Xbox Chief

Microsoft's gaming division is developing prototypes for a handheld gaming device that won't launch for "a few years," gaming chief Phil Spencer said Wednesday. In an interview with Bloomberg, Spencer said that while Microsoft is actively working on prototypes, the company will first focus on improving its Xbox app performance on existing portable devices and establishing hardware partnerships. The gaming unit wants to be "informed by learning and what's happening now" before introducing its own device, Spencer said. "Longer term, I love us building devices," Spencer said, adding that Microsoft's team "could do some real innovative work." Read more of this story at Slashdot.

How Italy Became an Unexpected Spyware Hub

Italy has emerged as a major global spyware hub alongside Israel and India, with at least six major vendors operating in the country with limited oversight, The Record reported this week, citing researchers and Italian experts. Companies like RCS Labs, which has operated since 1992, sell surveillance tools to both domestic law enforcement and foreign governments including Kazakhstan, Syria, and several Asian nations. Italian authorities can rent spyware for $160 per day without large acquisition costs, leading to thousands of domestic surveillance operations in recent years. While new regulations taking effect in February 2024 will require judges to evaluate specific reasons for spyware use, critics cited in the story say the reform package won't address core issues like the lack of centralized oversight. The country's competitive marketplace and relatively lax export controls have also enabled Italian vendors to expand their overseas sales. Read more of this story at Slashdot.

AI Systems Solve Just 2% of Advanced Maths Problems in New Benchmark Test

Leading AI systems are solving less than 2% of problems in a new advanced mathematics benchmark, revealing significant limitations in their reasoning capabilities, research group Epoch AI reported this week. The benchmark, called FrontierMath, consists of hundreds of original research-level mathematics problems developed in collaboration with over 60 mathematicians, including Fields Medalists Terence Tao and Timothy Gowers. While top AI models like GPT-4 and Gemini 1.5 Pro achieve over 90% accuracy on traditional math tests, they struggle with FrontierMath's problems, which span computational number theory to algebraic geometry and require complex reasoning. "These are extremely challenging. [...] The only way to solve them is by a combination of a semi-expert like a graduate student in a related field, maybe paired with some combination of a modern AI and lots of other algebra packages," Tao said. The problems are designed to be "guessproof," with large numerical answers or complex mathematical objects as solutions, making it nearly impossible to solve without proper mathematical reasoning. Further reading: New secret math benchmark stumps AI models and PhDs alike. Read more of this story at Slashdot.

Dutch Publisher’s AI Translation Plan Sparks Industry Backlash

Dutch publisher Veen Bosch & Keuning has announced plans to use AI for translating commercial fiction, drawing sharp criticism from literary professionals despite promises of human oversight and author consent. Award-winning translator Michele Hutchison, who won the 2020 International Booker Prize, argues that translation extends beyond word conversion. "We build bridges between cultures, taking into account the target readership every step of the way," she said, noting that translators convey rhythm, poetry, and cultural nuances while conducting precise terminology research. Read more of this story at Slashdot.

Clues To Windows Intelligence Found in Windows 11 Builds

Microsoft seems set to rebrand the AI-powered features in Windows to "Windows Intelligence" even if some of the more controversial elements, such as Recall, are to remain as they are. The Register: Word of Windows Intelligence has circulated for a while, although Microsoft has yet to issue any official confirmation. In October, Tero Alhonen posted what appeared to be options for apps that use AI services. Over the weekend, X user Albacore turned up a placeholder page in a Windows 24H2 build for Windows Intelligence settings. Although Microsoft has made substantial investments in artificial intelligence, AI as part of a brand is a little generic. Apple's approach, to define AI as being "Apple Intelligence," manages to keep the familiar "AI" initialism while ensuring its own brand is kept front and center. With Windows Intelligence, Microsoft is attempting something similar, although "Apple Intelligence" can be handily shortened to "AI". The recently overhauled Copilot and delayed Recall have sparked debate in the Windows community, yet neither seems likely to be rebranded to Windows Intelligence at this stage. However, Windows Intelligence could represent an umbrella for AI technologies on the Microsoft platform and provide users with a quick and easy way of controlling the access AI apps have to user data and how that data is used. Read more of this story at Slashdot.

Cheap Fix Floated For Plane Vapor’s Climate Damage

AmiMoJo writes: The climate-damaging vapors left behind by jet planes could be easily tackled, aviation experts say, with a new study suggesting they could be eliminated for a few pounds per flight. Jet condensation trails, or contrails, have spawned wild conspiracy theories alleging mind control and the spreading of disease, but scientists say the real problem is their warming effect. "They create an artificial layer of clouds, which traps the heat from the Earth that's trying to escape to outer space," said Carlos Lopez de la Osa, from the Transport & Environment campaign group, which has carried out a new study on the solutions to contrails. "The scale of the warming that's associated with them is roughly having a similar impact to that of aviation carbon emissions." Tweaking the flight paths of a handful of aircraft could reduce contrail warming by more than half by 2040, at a cost of less than $5.1 per flight. Geography and a flight's latitude have a strong influence on whether a contrail is warming. Time of day also influences the climate effects of contrails. Those formed by evening and night flights have the largest warming contribution. Seasonality is also important -- the most warming contrails tend to occur in winter. "Planes are already flying around thunderstorms and turbulence areas," Mr Lopez de la Osa said. "We will need to add one more constraint to flight planning, which is avoiding areas of contrail formation." Read more of this story at Slashdot.

The Ultimate in Debugging

Mark Rainey: Engineers are currently debugging why the Voyager 1 spacecraft, which is 15 billions miles away, turned off its main radio and switched to a backup radio that hasn't been used in over forty years! I've had some tricky debugging issues in the past, including finding compiler bugs and debugging code with no debugger that had been burnt into prom packs for terminals, however I have huge admiration for the engineers maintaining the operation of Voyager 1. Recently they sent a command to the craft that caused it to shut off its main radio transmitter, seemingly in an effort to preserve power and protect from faults. This prompted it to switch over to the backup radio transmitter, that is lower power. Now they have regained communication they are trying to determine the cause on hardware that is nearly 50 years old. Any communication takes days. When you think you have a difficult issue to debug, spare a thought for this team. Read more of this story at Slashdot.

Secret Service Says You Agreed To Be Tracked With Location Data

An anonymous reader shares a report: Officials inside the Secret Service clashed over whether they needed a warrant to use location data harvested from ordinary apps installed on smartphones, with some arguing that citizens have agreed to be tracked with such data by accepting app terms of service, despite those apps often not saying their data may end up with the authorities, according to hundreds of pages of internal Secret Service emails obtained by 404 Media. The emails provide deeper insight into the agency's use of Locate X, a powerful surveillance capability that allows law enforcement officials to follow a phone, and person's, precise movements over time at the click of a mouse. In 2023, a government oversight body found that the Secret Service, Customs and Border Protection, and Immigration and Customs Enforcement all used their access to such location data illegally. The Secret Service told 404 Media in an email last week it is no longer using the tool. "If USSS [U.S. Secret Service] is using Locate X, that is most concerning to us," one of the internal emails said. 404 Media obtained them and other documents through a Freedom of Information Act (FOIA) request with the Secret Service. Read more of this story at Slashdot.

Will We Care About Frameworks in the Future?

Paul Kinlan, who leads the Chrome and the Open Web Developer Relations team at Google, asks and answers the question (with a no.): Frameworks are abstractions over a platform designed for people and teams to accelerate their teams new work and maintenance while improving the consistency and quality of the projects. They also frequently force a certain type of structure and architecture to your code base. This isn't a bad thing, team productivity is an important aspect of any software. I'm of the belief that software development is entering a radical shift that is currently driven by agents like Replit's and there is a world where a person never actually has to manipulate code directly anymore. As I was making broad and sweeping changes to the functionality of the applications by throwing the Agent a couple of prompts here and there, the software didn't seem to care that there was repetition in the code across multiple views, it didn't care about shared logic, extensibility or inheritability of components... it just implemented what it needed to do and it did it as vanilla as it could. I was just left wondering if there will be a need for frameworks in the future? Do the architecture patterns we've learnt over the years matter? Will new patterns for software architecture appear that favour LLM management? Read more of this story at Slashdot.