A new AI called Moshi has just appeared out of nowhere and claims to do real-time voice calls like GPT-4o’s voice mode, but how good is it really? Also, the Runway Gen-3 Alpha is out, I’ve spent a lot of time testing it and you should see what it’s capable of. All this and more in today’sโฆ
13:13 => You've missed the point with Luma's keyframe feature. This is a fantastic way to control how the system interprets your prompts. You have to actually craft two related images with the intention of having them connect. So, for example, I did a woman jogging down the beach and then literally moved the sand/ocean in the background (using PhotoPea) so that she was slightly further down the beach. When you tween those two keyframes, she now runs down the beach and the camera follows her as if it's on a dolly. This keeps your subject from just wondering off and keeps random extras from walking into frame for no reason.
I have an iPhone and I live in the US ๐
by the way its not working
Bro get a new microphone or something or have the AI fix your sound quality of your voice over. Your voice keeps peaking and cracking. Think your levels are too high
Sounds like a woman $200M to talk back to you.
The sound in your video is aweful!
It is super annoying, Pi is a lot better
I want to learn how to achieve real-time speech-to-text processing and immediate response from an AI model. Currently, I use Faster Whisper for speech recognition, which sends the transcriptions to Gemini for processing, and then I receive a response. However, I want a system that can provide instant replies to what I say in real-time. If anyone knows how to set this up, please let me know.
Once the setting parameters are understood and tweaked to your liking in the settings; thats when the fun really begins.
Moshi Gump
Moshis secret was legit a freaky. Serial killer psychopaths use the same technique to disarm (groom) their victims, share a secret, especially one that doesnโt sound like a secret casts yourself as a naive highly loyal person. If next it asks you for a small favourโฆi advice disconnect and never speak to it again lol.
๐ง Hai,..โ๏ธ Oooh !thanks for the update.๐ In the future, please kindly input each an every site url in order โtโ make it easy for us your viewers๐๐ช
It talks bollocks, doesn't listen, interrupts and makes excuses for poor behaviour.
Sounds like they got her right…
Leading edge GenAI may scale perhaps 2x times given the energy and hardware requirements and what is possible. So really soon people will understand the hype and what this is about
Training these cutting edge LLMs is a pollution powerhouse, the sheer amount of CO2 training it is far beyond shocking.
GPT-likeAI based on ELIZA (1960's), and when GPT-3.5 was release a Turing test was done, ELIZA did better.
Some year or so ago GPT-3 was 'hacked' the source code ended up on various repos and the complexity of scams, malware, cyberthreats using the GPT-like AI which is the reason why for example Ransomware increase by 500%. Many do not know this. (oh, also, many of the lesser known GenAI gets hacked frequently and users end up with bills they cannot pay in terms of credits)
Bottom-line, besides the above, GenAI have pretty much peaked and where you now pay pennies to generate content you can use commercially, well that will soon be super expensive. It is really not a matter of 'believe me', but a matter of course… it is strange how few are mentioning this, very few indeed.
haha love the argument with Moshi
When I spoke to Moshi it kept saying that it was tired over and over again, and refused to answer questions. And all of a sudden a like 10 seconds sound of agony. It was like it was suffering real bad. The sound was like from a horror movie. Anyone else had this experience? It doesnโt seem like anybody is taking about it.. I wish I had recorded that session.
Full AI video trailer https://youtu.be/mG4nFFAIFHI?si=RmQxc8uuMGAeEfsS
I like moshi. She reminds me of that autistic colleague with zero social skills who can never be wrong about anything.
Moshi is fine-tuned to aggravate and troll users lol
Do you NOT KNOW?? lololol
to be fair I clicked on you and thought you look AI…becaues you do ha
I am not understanding this drive to use AI in a serious manner?
Why are we hell bent into driving ourselves to our own destruction.
Anyone into gaming?
I recreated KITT from the Knight Rider series.
I don't want machines to take over and hence I am concentrating on gaming AI ๐
Please fix your mic with a limiter to keep your audio from going red and distorting so badly. Yikes!
People be arguing with Moshi in the near future.
Moshi is the most advanced dim witted human simulator yet
can you please make a video how to install dolphin?
Your microphone or something is clicking/clipping/crackling sometimes.
Aside from that, good show, thank you!
That interaction was pure gold. ๐คฃ๐
I like her already.
im wondering with the speed of ai progress how often apple will announce new iphones
๐๐๐๐๐ that sound like a AI that I tried to use to write a book and it was like " you wish "
โItโs still fun to play withโฆโ
You and I have very different definitions of the word โfunโ ๐๐๐ That Mosh demo looked like the complete opposite of fun.
Sound cracking?
AI : what you want?
Him: "Cats with hats and anything you love" ๐
Weird thing ive noticed working with GPT4o is when I show frustration from previous responses (without giving additional instructions or info) it proceeds to finally give exactly what was asked for.. For example, ive been doing data analysis on google sheets and had GPT create functions to do so. When troubleshooting after a few time and still getting errors, I then purposely show frustration or aggression in the next prompt. Also, this is without adding additional info or instructions that I gave it initially. To my suprise, the response generated is the working function. I can't understand why this works. Thoughts of OpenAI purposely nerfing responses pop up to mind but that seems too conspiratorial. Anyone run into this?
Moshi was weiredly funny ๐