AI
Explore practical insights, research, and benchmarks on artificial intelligence, including generative AI, large language models, RAG, governance frameworks, MLOps practices, and AI hardware. Gain an understanding of key tools, implementation strategies, and enterprise use cases shaping the AI landscape.
Top 20 Supply Chain AI Tools with Examples
From demand forecasting and inventory optimization to last-mile delivery and supplier negotiations, AI enables supply chain companies to process complex data, respond to disruptions more quickly, and make more informed decisions across global networks.
Top 8 Drug Discovery Software
The drug discovery software market divides into three categories: computational chemistry suites for structure-based design, AI-native platforms for generative chemistry and target identification, and R&D data management systems for ELN, LIMS, synthesis tracking, data analysis, and compound registration. We compared the top 8 drug discovery platforms across features, pricing, and deployment models.
Top 10 Voice Bots: Bland AI, ElevenLabs & PolyAI
A voice bot or voice AI agent listens to the caller, uses speech recognition to convert spoken words into text, applies natural language processing and natural language understanding to identify customer intent, and then returns an answer via text-to-speech.
AI Coding Benchmark: Claude Code vs Cursor
In AI coding, the market has fragmented into two categories: Agentic CLI tools and AI code editors embedded in IDEs. Each claims to automate development. Few comparisons show how they differ under identical workloads.
Embedding Models: OpenAI vs Gemini vs Voyage
We benchmarked 15 English text-embedding models and a BM25 baseline on over 500 manually curated queries across three retrieval domains: legal contracts (CUAD), customer support (IBM TechQA), and healthcare (MedRAG PubMed). Voyage-3.5 ranks first overall. Perplexity Embed V1 0.6b reaches the upper-mid tier at the lowest price point in our benchmark.
Generative AI Ethics: How to Manage Them
Generative AI raises important concerns about how knowledge is shared and trusted. Britannica, for instance, filed a lawsuit against Perplexity, alleging that the company illegally and knowingly copied Britannica’s human-verified content and misused its trademarks without permission. Explore what generative AI ethics concerns are and best practices for managing them. 1.
Audience Simulation: Can LLMs Predict Human Behavior?
In marketing, evaluating how accurately LLMs predict human behavior is crucial for assessing their effectiveness in anticipating audience needs and recognizing the risks of misalignment, ineffective communication, or unintended influence.
Open Source Embedding Models Benchmark for RAG
We benchmarked 14 open-source embedding models, self-hosted on a single H100, across 500+ manually curated retrieval queries spanning legal contracts, customer support tech notes, and medical abstracts. NVIDIA Llama-Embed-Nemotron-8B leads in accuracy. On cost, Google’s EmbeddingGemma-300m runs roughly 4x cheaper than Nemotron at the cost of a small accuracy loss.
LLM Inference Engines: vLLM vs LMDeploy vs SGLang
We benchmarked 3 leading LLM inference engines on NVIDIA H100: vLLM, LMDeploy, and SGLang. Each engine processed identical workloads: 1,000 ShareGPT prompts using Llama 3.1 8B-Instruct to isolate the true performance impact of their architectural choices and optimization strategies.
LCMs: From LLM Tokenization to Concept-level Representation
Large concept models (LCMs), as introduced by Meta in their work on “Large Concept Models,” represent a fundamental shift away from token-based prediction toward concept-level representation.