The Quiet Exposure: Thousands of Unsecured Local AI Instances Are Already Online
I didn’t expect to stumble into anything particularly alarming. Like a lot of people experimenting with local AI, I spend time poking around tools, testing models, and occasionally checking how widely they’re being adopted. Out of curiosity more than anything, I ran a few queries through Shodan—the search engine that indexes internet-connected devices. What came back wasn’t just interesting. It was unsettling. According to Shodan, there are currently: 34,953 publicly exposed Ollama instances 1,308 publicly exposed llama.cpp instances Not just visible— unsecured . What This Actually Means Both Ollama and llama.cpp are designed primarily for local inference . They’re fantastic tools: lightweight, fast, and increasingly powerful. The assumption is simple—you run them on your own machine or internal network. But many of these instances are sitting directly on the public internet with: No authentication No rate limiting No access controls Open APIs ready to accept requests from anyone In ot...