Posts

#llm#research

How Smart Is AI Compared to Humans? A New Study Puts It to the Test

schedule Oct 15, 2024

A recent study compares generative AI models to human cognitive benchmarks, revealing both strengths and significant weaknesses in AI's intellectual abilities.

#embodiedai#agent

A New Benchmark for Embodied AI: Evaluating LLMs in Decision Making

schedule Oct 14, 2024

New benchmark unifies how we evaluate language models for decision-making in embodied environments, revealing strengths and areas for improvement.

#automation#research

Human-Like Automation Framework for Computer Tasks

schedule Oct 12, 2024

Agent S enables computers to autonomously handle complex tasks in a human-like way, improving efficiency, adaptability, and accessibility for a wide range of GUI interactions.

#agent#development

The Rise of Proactive AI Assistants Enhancing Programmer Productivity

schedule Oct 11, 2024

How proactive AI assistants could reshape programming workflows with increased productivity and smarter collaboration.

#research#agent

Autonomous Digital Agents Are Getting Smarter: A New Method for Evaluation and Refinement

schedule Oct 11, 2024

New research showcases a powerful automated approach to evaluating and improving digital agents, enhancing their capabilities significantly.

#llm#embodiedai

The Intersection of Embodied AI and LLMs: Unveiling New Security Threats

schedule Oct 10, 2024

As LLMs are fine-tuned for embodied AI systems like autonomous vehicles and robots, new security risks emerge. A framework identifies backdoor attacks with success rates up to 100%, posing significant threats to these systems' safety.

#llm#research

How Generative AI is Revolutionizing Data Analysis

schedule Oct 9, 2024

AI is making data analysis accessible and efficient, helping anyone perform complex tasks without technical skills. It automates processes, assists in analysis, and ensures reliability.

#development#llm

AI Unlocks Smarter Metrics for Software Teams

schedule Oct 8, 2024

GEMS uses LLM to generate custom metrics that help identify expertise within software teams, fostering better collaboration & problem-solving.

Why GenAI Will Transform Tasks, But Keep People at the Core

schedule Sep 29, 2024

Indeed's insights on AI and the future of work. AI innovations, human-intent recognition, global AI growth, and the rise of industrial robots.

#llm#prompt

Improving AI Reasoning with Program Tracing

schedule Sep 29, 2024

Program Trace Prompting improves AI reasoning by structuring steps like Python code, making them easier to observe, analyze, and debug, while ensuring logical accuracy.

#visual#memory

Enhancing AI Summaries with Visual Workspaces

schedule Sep 28, 2024

A new method uses visual workspaces to help AI create more accurate summaries by letting humans organize data visually before the AI steps in.

#agent#hci

Teaching Robots to Infer Human Intent

schedule Sep 27, 2024

FISER helps robots understand ambiguous instructions by reasoning about human intentions and actions, improving their ability to assist in real-world tasks.

Page 1 / 2 Next