A Coding Guide to Build a Multimodal Image Captioning App Using Salesforce BLIP Model, Streamlit, Ngrok, and Hugging Face

In this tutorial, we’ll learn how to build an interactive multimodal image-captioning application using Google’s Colab platform, Salesforce’s powerful BLIP model, and Streamlit for an intuitive web interface. Multimodal models, which combine image and text processing capabilities, have become increasingly important in AI applications, enabling tasks like image captioning, visual question answering, and more. This … Read more

Alibaba Qwen QwQ-32B: Scaled reinforcement learning showcase

The Qwen team at Alibaba has unveiled QwQ-32B, a 32 billion parameter AI model that demonstrates performance rivalling the much larger DeepSeek-R1. This breakthrough highlights the potential of scaling Reinforcement Learning (RL) on robust foundation models. The Qwen team have successfully integrated agent capabilities into the reasoning model, enabling it to think critically, utilise tools, … Read more

The ethics of AI and how they affect you

Having worked with AI since 2018, I’m watching its slow but steady pick-up alongside the unstructured bandwagon-jumping with considerable interest. Now that the initial fear has subsided somewhat about a robotic takeover, discussion about the ethics that will surround the integration of AI into everyday business structures has taken its place.   A whole new range … Read more

Building an Interactive Bilingual (Arabic and English) Chat Interface with Open Source Meraj-Mini by Arcee AI: Leveraging GPU Acceleration, PyTorch, Transformers, Accelerate, BitsAndBytes, and Gradio

In this tutorial, we implement a Bilingual Chat Assistant powered by Arcee’s Meraj-Mini model, which is deployed seamlessly on Google Colab using T4 GPU. This tutorial showcases the capabilities of open-source language models while providing a practical, hands-on experience in deploying state-of-the-art AI solutions within the constraints of free cloud resources. We’ll utilise a powerful … Read more

From Sparse Rewards to Precise Mastery: How DEMO3 is Revolutionizing Robotic Manipulation

Long-horizon robotic manipulation tasks are a serious challenge for reinforcement learning, caused mainly by sparse rewards, high-dimensional action-state spaces, and the challenge of designing useful reward functions. Conventional reinforcement learning is not well-suited to handle efficient exploration since the lack of feedback hinders learning optimal policies. This issue is significant in robotic control tasks of … Read more

Alibaba Researchers Introduce R1-Omni: An Application of Reinforcement Learning with Verifiable Reward (RLVR) to an Omni-Multimodal Large Language Model

Emotion recognition from video involves many nuanced challenges. Models that depend exclusively on either visual or audio signals often miss the intricate interplay between these modalities, leading to misinterpretations of emotional content. A key difficulty is reliably combining visual cues—such as facial expressions or body language—with auditory signals like tone or intonation. Many existing systems … Read more

A Practical Guide to Modern Airflow

Image by Author   Airflow was created to resolve the complexity of managing multiple pipelines and workflows. Before the invention of Airflow, many organizations depended on cron jobs, custom scripts, and other inefficient means when faced with big data generated by millions of users frequently. These solutions became hard to maintain, inflexible, and lacked visibility … Read more

Google AI Releases Gemma 3: Lightweight Multimodal Open Models for Efficient and On‑Device AI

In the field of artificial intelligence, two persistent challenges remain. Many advanced language models require significant computational resources, which limits their use by smaller organizations and individual developers. Additionally, even when these models are available, their latency and size often make them unsuitable for deployment on everyday devices such as laptops or smartphones. There is … Read more

Best 3 internal developer portals of 2025

What is an internal developer portal? An internal developer portal (IDP) is a centralised, self-service platform built in organisations to provide developers with everything they need to develop, deploy, and maintain software. Imagine it as a ‘one-stop shop’ where internal teams can access documentation, APIs, tools, services, best practices, and deployment pipelines. IDPs eliminate reliance … Read more