Compare tools by category, pricing, use case, team size, and integrations in one place.
Showing 24+ tools
Google's most intelligent multimodal model: advanced reasoning, deep contextual understanding, autonomous agents, visual coding, and Deep Think mode. It performs particularly well
A locally hosted open-source personal AI assistant (Mac/Windows/Linux) with persistent memory and full system access. Control it via WhatsApp, Telegram, Discord, Slack, iMessage, o
A free cloud-based LaTeX workspace that directly integrates GPT-5.2 into scientific writing, with unlimited projects and collaborators. Instantly convert your whiteboard sketches i
Forecast the weather 8 times faster with this Google DeepMind AI model. Hourly resolution, hundreds of possible scenarios, cyclone forecasting, and informed decisions for weather a
This tool allows you to swap faces in high definition on your photos and videos, without adding any watermarks to the final result. Quickly create memes, creative content, or reali
Deepseek's open source LLM with 671 billion parameters specializing in mathematics and reasoning. This model is effective at processing long contexts and performs well in code, log
Reconstruct objects and humans in 3D from a simple photo: SAM 3D Objects creates detailed scenes and meshes, SAM 3D Body estimates complete body/hand/foot poses with SOTA accuracy,
Remove the background from your videos without a green screen or expensive equipment. Replace your background with an image, color, or video for instant professional results
Easily swap faces in photos, GIFs, or videos, with unlimited replacements and no watermarks. You can also generate creative images using AI for your montages
Create infinite interactive worlds that can be explored in real time with the Genie 3 model. It generates dynamic simulations with realistic physics from text or images. Currently
Give complex tasks to an agent that controls your browser, fills out forms, and pulls data from your tabs and sessions to deliver reliable and secure results
Create persistent 3D worlds from a prompt, image, or video, edit scenes in real time, and export meshes or videos for games, movies, or prototypes. Intuitive multimodal interface
An AI architecture with integrated long-term memory, selective updating based on a surprise signal, and context extended beyond 2 million tokens. MIRAS unifies transformers and lin
Simulate the buyer experience on your Shopify store with AI profiles that browse your pages like real customers. You get concrete feedback before rolling out theme changes or marke
Generate and edit your images directly in ChatGPT. This model is up to 4× faster and preserves essential details such as facial features. Benefit from better instruction tracking,
Run your GPU workloads in a confidential environment protected by Intel TDX (with cryptographic proof of integrity). Confidently deploy certified TDX images, encrypted storage, and
A 3D model that generates interactive 4D objects (cars with spinning wheels, automatic scripts) from textual instructions. Tokenization of 3D shapes, generation of shapes/scenes fr
Easily transcribe medical dictations and exchanges between doctors and patients. The system is based on a voice recognition model with 105 million parameters, trained on 5,000 hour
This large multimodal model combines text, vision, and interface interaction in a single system, enabling it to understand screenshots, videos, and documents. It can also reason in
Enjoy professional-level intelligence with extreme speed and efficiency (and at a reduced cost). This model uses 30% fewer tokens and is very good at agentic coding
An open-source AI image generator with 16 billion parameters that excels at displaying text. This model also supports image editing, style transfer, identity preservation, and mult
Control an AI agent that learns, reasons, and plays live in virtual 3D worlds: SIMA2 includes natural language instructions, emojis, transferable concepts, and improves on its own
Generate 1080p videos at 24 frames per second with a maximum duration of 15 seconds, featuring consistent multi-take voiceovers and native video lip-syncing. Videos can be generate
Create cinematic videos in 1080p with natively synchronized audio from text or images. With sophisticated camera movements (long tracking shots, Hitchcock-style zooms), precise lip