Along with OpenAI's ChatGPT, Microsoft's Copilot and Apple Intelligence, Google Gemini is one of the dominant forces in the world of artificial intelligence (AI) and chatbots. Gemini is both the name for Google chatbot and the LLM that powers it, and it's free to use via a web browser, or on your mobile, but there's a paid-for version called Gemini Advanced that has some juicy features, like Gemini Live.
Google Gemini burst onto the scene in February 2024 and immediately made some big waves in the AI world, but it was the release of Gemini Live at the Made for Google event in August 2024 that really put it on the map. Gemini LIve is a version of Gemini that runs on Android phones and enables you to have free flowing conversations about complex topics using your voice instead of having to type on the keyboard. But is Gemini Live enough to defeat Apple's AI-enhanced Siri or the forthcoming ChatGPT Voice Mode? What can it do right now, and what about in the future? And if you want to use Gemini, how exactly do you do that? We’ve taken a deep dive into the world of Gemini to find the answers to all these questions and more. If you’re curious about Google’s latest AI efforts, this is the place to be.
What is Google Gemini?
Gemini is Google’s large language model (LLM). What’s an LLM? It’s the system that underpins the types of AI tools you’ve probably seen and interacted with on the internet. For example, GPT-4 powers ChatGPT-4o, OpenAI’s free chatbot, and ChatGPT Plus, it's paid-for upgrade.
Gemini is more than an AI model, though, as it’s also the new name and identity for it's chatbot, previously known as Bard. Essentially, Google has simplified things by calling both the underlying model and chatbot itself Gemini. Furthermore, there’s a free Gemini app for Android now, and Gemini can replace Google Assistant on your Android phone if you wish. On iOS, Gemini is present within the Google app. There's also a free version of Google Gemini that is accessible via any internet browser.
On top of all this, Google has rebranded its Duet AI service, aimed at businesses, as Gemini for Workspace, with a whole bunch of productivity-related chops on offer.
The final twist is that as well as the basic (free) version of Gemini for consumers, there is also a subscription offering for the AI known as Gemini Advanced. This paid product is based on a more powerful LLM called Gemini Ultra, and those signed up to the Google One AI Premium subscription get extra benefits from using this model, including the ability to use Gemini Live on mobile, a voice-controlled AI experience for Android phones.
To sum up, all Google’s AI properties are now under the Gemini umbrella to simplify things, whether that’s AI for consumers or businesses, and whether accessing Gemini via the web, or the assistant or app on your smartphone.
What can Gemini do?
The short answer to this question is: a lot. But you probably want us to elaborate a bit more than that.
As we’ve just discussed, Gemini is an expansive umbrella for a whole lot of AI features and functionality delivered via different avenues. It has the same generative capabilities as other chatbots, like ChatGPT, so if you tell Gemini where you are going on your next trip, it will be able to help you pack. Or ask it to explain who Socrates was and sit back for a history lesson.
Gemini is a multimodal LLM. In other words, it can deal with various forms of input and output, including text, code, audio, images and videos. That gives it a lot of flexibility to perform a wide range of tasks. Upload an image (or take one with your smartphone) and Gemini can analyze the image and tell you things about it. Paste some code into the Gemini prompt box and ask it to rewrite it and Gemini can do that.
However, there are four separate LLMs that Google has implemented with its AI. They are Gemini Ultra, Pro, Flash and Nano. If you go to the Gemini website and ask the chatbot a question you'll be using Gemini Flash, which is Google's lightweight model, optimized for speed and efficiency. If you are a member of the paid tier via Gemini Advanced then you get access to Gemini Ultra, which is the largest model, and good for highly complex tasks, and Gemini Pro which Google describe as the "best model for general performance across a range of tasks". Finally, the Gemini Nano model is smaller and designed for on-device use. It's used on the new Pixel 9 phones, which have the Gemini Live feature, enabling a voice mode that's like having a real conversation with a human. All the different Gemini LLMs are currently at version 1.5.
And yes, it hasn’t escaped our attention that this simplification of moving everything under the Gemini brand comes with ironic complications and a level of confusion of its own (with Gemini this, that, and the other, as names for the various models and products).
So, what can Gemini actually do? It can answer questions, summarize text, write code, translate and create images (on mobile, but not in a browser running the free Gemini Flex 1.5). Google is developing its own answer to Midjourney in Imagen 3, and we'd imagine it won't be long before that is integrated into Gemini, too. Also, it plugs into other Google services: Gemini is hooked into the likes of Gmail, Google Maps, and YouTube. So, if you ask it for sightseeing recommendations, for example, it will flag them up in Google Maps.
There are some nifty abilities for sure, and those with Android smartphones get even more mileage for free with the Gemini app. As already mentioned, this can replace Google Assistant on an Android device, if you wish, running queries and working its AI magic via other Google services (Maps, search, even Assistant itself, which isn’t actually chucked off your phone as such, and still lurks underneath). But if you are a Gemini Advanced customer, then you get even more integration and the ability to handle much more complex tasks with your voice.
That choice may lead you to wonder: is Gemini a good replacement for Google Assistant? Well, we covered this extensively in our hands-on with Gemini on Android, and there are certainly drawbacks with the AI. Gemini is noticeably slower than Google Assistant, and the AI still has bugs. For instance, it’s able to interact with photos.
That said, the initial Gemini on Android experience was a lot wonkier than it is now. An example that springs to mind is that Gemini was supposed to deal with smart home gadgets, but failed in that task when we tried it initially - but more recently, this feature has come good and now works.
It seems that Google is ironing out the problems with Gemini on mobile pretty swiftly, which is heartening to see, and the results with the Gemini app can be impressive. There are still wrinkles to smooth over in terms of replacing Google Assistant on Android, but that should come in time.
Overall, then, the freebie version does give you a lot to get on with, especially for Android users. However, the paid take on Gemini is actually far more in-depth.
Gemini Ultra (the model powering the Gemini Advanced subscription product) brings a whole range of powerful abilities to the table, such as handling multi-step queries, and providing help with more complex tasks like coding. On a general level it’s also more accurate in terms of pinning down better and more organized answers to queries.
For those who use Google’s productivity apps and have a Google Workspace account, Gemini is available in Google Docs, Gmail, and more.
When was Gemini released?
Google Gemini was released on February 8, with Google confirming that it replaced Bard which was put out to pasture. Gemini was available immediately, in both its free form, and the paid version, Gemini Advanced. Google also began the rollout of the Android app in the US straight away.
The Gemini app is also available for Samsung phones, read our Samsung Galaxy Z Flip 6 review. While iOS availability lagged behind, Gemini is also available on Apple smartphones too (as part of the Google app, that is, in a more limited form compared to Android).
Is Google Gemini free?
The standard version of Google Gemini is free, but it’s more limited than the paid spin on the AI. As we’ve already discussed, the free Gemini AI is based on a simpler model (Gemini 1.5 Flash), whereas those who pay a subscription for Gemini Advanced get a lot more depth in terms of features and capabilities.
How much is Gemini Advanced? Google charges $19.99 (£18.99 / AU$32.99) per month, but you can try it out for a limited time for free, as a one-month trial is currently available. The subscription comes with other benefits, though, as Gemini Advanced is actually part of the Google One AI Premium Plan, which also provides 2TB of cloud storage among other extras.
Given that Google One with 2TB of storage already costs $9.99 / £7.99 / AU$12.49 per month, if you need a cloud storage locker, Gemini Advanced looks a more tempting value proposition.
How do I use Google Gemini?
The way you use Google Gemini depends on the version you’re interested in and the product it has been woven into.
On a PC or laptop, you can head to the Gemini website and use the AI there for free in the typical way you interact with an online chatbot. This will use the Gemini 1.5 Flash model. If you subscribe to Gemini Advanced, you get the full AI experience including sharper responses, more advanced LLMs for dealing with complex tasks and demanding creative needs, and the other benefits we’ve outlined above.
On mobile you can use the Gemini app on your Android phone (or replace Google Assistant with Gemini, as noted). On iOS, you can head to the Google app for Gemini functionality. If you are on a smartphone like the Gogole Pixel 9, and a subscriber to Gemini Advanced, you can use Gemini Live in the Gemini app, which enables you to interact with Gemini using your voice.
Gemini vs GPT-4: what’s the difference?
How does Gemini stack up against GPT-4 in the battle of the large language models?
For one thing, when Gemini was first revealed, Google claimed it’s more advanced than GPT-4. In a blog post, Google showed results from eight text-based benchmarks, with Gemini winning in seven of those tests. Across 10 multimodal benchmarks, Gemini came out on top in every one, according to Google at least.
That would seem to imply that Gemini is the superior system, but it’s not quite so straightforward, of course. GPT-4 came out in March 2023, and Gemini 1.5 Pro came out in February and Gemini 1.5 Flash came out in May 2024. So Gemini has again caught up to the rival AI tool. We don’t know how capable OpenAI’s next version of GPT will be, and there’s a world of nuances in this battle outside of Google’s own limited benchmarking, so it’s hard to say which is truly the better tool at the moment.
As well as that, Google only put its more advanced model, Gemini Ultra, up against GPT-4, and not Gemini Pro. Given the often-slim margins between GPT-4 and Gemini Ultra, it seems likely that OpenAI’s model comes out ahead of Gemini Pro.