I wanted to maybe start making PeerTubr videos, but then I realized I’ve never had to consider my voice as part of my threat model. A consequence that immediately comes to mind is potentially having your voice trained on by AI, but I’m not (currently) in a position where others would find it desirable to do so. Potentially in the future?
I’d like to know how your threat model handles your personal voice. And as a bonus, how would voice modulators help your voice in/prevent your voice from being more flexible in your threat model? Thanks!
Yes, to a degree. In my experience a lot of attackers will call phones and wait for you to say something to get a model of your voice.
When I answer a phone number I do not recognize, I don’t say “Hello” anymore. Instead I remain quiet for 10 seconds. If its a poorly made bot trying to scrape my voice, usually it hangs up within that time.
I am also more open to letting everything I don’t recognize ring through to my voice mail which uses the default response without my voice since these types of attacks started.
For peertube etc, I am unsure what the best solution is, but would be interested in what people here suggest!