We are now three panics into the same story and still haven't learned to tell the difference between what can happen and what will.
Voice cloning works—a scammer can steal your mother's voice and ask your father for five thousand dollars over the phone. It takes seconds.
But we have seen this exact arc before. It resolves in a specific way that has almost nothing to do with how scared we were when it started. In 2017, deepfake video technology emerged with astonishing capability, yet nine years later, deepfake videos have caused exactly zero major crimes, zero election interference. Zero successful blackmail rings operating at scale.
The reason wasn't better detection or law—it was economics. A deepfake video requires significant technical skill and social engineering to deploy effectively. The payoff for most crimes is too small relative to the risk. Before deepfakes came the spoofed caller ID panic. Followed the same pattern: terrifying scale, widespread harm, yet the response was quiet economic incentives and regulatory pressure that made spoofing costly and traceable enough to reduce its utility for large-scale fraud.
This is the part that should keep you awake: not the fraud, which will probably constrain itself through the usual friction of criminal economics. The normalization of voice as a harvestable commodity. The scammer is not the real customer—the real customer is whoever pays for perfect voice synthesis at scale, whether an advertising company, a security agency, or anyone with enough money and a use case for perfect voice impersonation without your knowledge.