christopherwoodside

Home

Resume

About

christopherwoodside

WebAI - 2024

Taking an alpha level app to beta with the voice of the user

AI Model Creation Tool Audit

INTRODUCTION

Testing with our users to elevate our app

Shortly after joining WebAI, I proposed a research effort to audit the current state of the app. WebAI Navigator was an application built to allow low/no code users to build their own custom Artificial Intelligence models. The idea was to take users through the current state of the app, give them a series of tasks based around the core flow of the application. We would then collect feedback and see where they tripped up while trying to build their first model.

TEAM & ROLES

Elevating an app is a team sport

Christopher Woodside

STAFF PRODUCT DESIGNER (ME)

Christopher Woodside

STAFF PRODUCT DESIGNER (ME)

Christopher Woodside

STAFF PRODUCT DESIGNER (ME)

Christopher Woodside

STAFF PRODUCT DESIGNER (ME)

Christopher Woodside

STAFF PRODUCT DESIGNER (ME)

John Lahr

PRODUCT GROWTH CSPO

John Lahr

PRODUCT GROWTH CSPO

John Lahr

PRODUCT GROWTH CSPO

John Lahr

PRODUCT GROWTH CSPO

John Lahr

PRODUCT GROWTH CSPO

Seb Charroud

VP OF PRODUCT

Seb Charroud

VP OF PRODUCT

Seb Charroud

VP OF PRODUCT

Seb Charroud

VP OF PRODUCT

Seb Charroud

VP OF PRODUCT

CURRENT STATE

A no-code AI building tool that transforms ideas into models

When I joined the company, their main product, Navigator, was still in alpha stage but preparing for a limited beta release. The company's AI Model builder, Navigator, was designed to allow people with no machine learning background to create their own custom AI models. Development, to this point, had been entirely internal and the application had been developed entirely in-house without any user testing or market feedback.

THE PROBLEM

Helping our users make their first Artificial Intelligence Model

After being onboarded with the team at WebAI, I spoke with our head of product who highlighted that while our app was getting a fair number of downloads and users who were testing it, there was a low rate of users who were able to successfully build their first Artificial Intelligence Model. Our team was tasked to figure out why users weren’t able to get a project setup or get an AI model running.

THE SCRIPT

Getting to know our users and asking the right questions

Before testing our users, I began preparations by drafting a testing script. The script opened with core demographic questions (location, age, professional title, commonly used tools, AI familiarity, interest in WebAI) and from there I prompted users through the login screen all the way to getting an model up and running.

To evaluate the app's intuitive design, I intentionally kept the usability prompts open-ended. The script was also structured to allow organic exploration, enabling participants to investigate areas they found particularly interesting, confusing, or delightful. Once I received approval for the testing script, I coordinated with our Growth Product Manager to gather participants for our sessions.

WHO WE SPOKE WITH

Picking the right users

During the script approval process, I collaborated with our Growth Product Manager to define our ideal set of participants. Since engineers were the company's primary target user base, we prioritized participants with an engineering background.

However, I advocated for including one to two non-technical participants to gain valuable perspective on the app from non-technical users. This strategy led us to recruit a group of six participants for our testing sessions.

PARTICIPANTS

Jared Hills

MECHATRONICS PRODUCT ENGINEER

Jared Hills

MECHATRONICS PRODUCT ENGINEER

Jared Hills

MECHATRONICS PRODUCT ENGINEER

Jared Hills

MECHATRONICS PRODUCT ENGINEER

Jared Hills

MECHATRONICS PRODUCT ENGINEER

Alex Mead

BACKEND ENGINEER

Alex Mead

BACKEND ENGINEER

Alex Mead

BACKEND ENGINEER

Alex Mead

BACKEND ENGINEER

Alex Mead

BACKEND ENGINEER

Ignas Gaucy

AI CONSULTANT

Ignas Gaucy

AI CONSULTANT

Ignas Gaucy

AI CONSULTANT

Ignas Gaucy

AI CONSULTANT

Ignas Gaucy

AI CONSULTANT

Ken Bodnar

SOLUTIONS ARCHITECT

Ken Bodnar

SOLUTIONS ARCHITECT

Ken Bodnar

SOLUTIONS ARCHITECT

Ken Bodnar

SOLUTIONS ARCHITECT

Ken Bodnar

SOLUTIONS ARCHITECT

Alex Brothman

IT DIRECTOR

Alex Brothman

IT DIRECTOR

Alex Brothman

IT DIRECTOR

Alex Brothman

IT DIRECTOR

Alex Brothman

IT DIRECTOR

Dimitri Angelov

ML DATA SCIENTIST

Dimitri Angelov

ML DATA SCIENTIST

Dimitri Angelov

ML DATA SCIENTIST

Dimitri Angelov

ML DATA SCIENTIST

Dimitri Angelov

ML DATA SCIENTIST

THE SESSIONS

Structure and freedom to guide our sessions

Over the course of one week, I conducted 6 interview sessions. We used the latest alpha build of the app to provide participants with the most authentic experience possible. This also had the added benefit of dramatically expediting our timeline and eliminating the need for any prototype work.

We led the users through the approved script recording each session and allowing the user to deviate at appropriate points. Session lengths varied from one to two hours, depending on the depth of participant feedback and their time availability.

SYNTHESIS

Six sessions, forty-nine insights

Following the user interviews, each session was meticulously documented within our research platform. Key takeaways were highlighted and tagged, facilitating efficient analysis. This process yielded 456 distinct highlights, which were then categorized into 49 key insights. Using these insights as a basis, I wrote out 30 actionable design recommendations for the team to consider.

The insights and their corresponding recommendations were then presented to the co-founders, product team, and key engineering members for discussion and feedback before we discussed next steps.

DESIGN RECOMMENDATIONS

Below is a selection pulled from the 30 design recommendations I collected:

Application Focus

Recommendation

When asked what users thought the application was and its purpose, they identified it as a low-to-no-code tool for building AI models.

The canvas layout resonated with users, and they expressed significant interest in a tool that helps non-developers build complex AI models.

I recommended prioritizing the AI model building functionality as the primary focus for the tool, while deprioritizing other development efforts such as model deployment and local-first hardware.

Templates Based on Use Case

Recommendation

Users reacted negatively when presented with templates organized by industry (manufacturing, aviation, medical, etc.).

Instead, they expressed stronger interest in templates organized by use case (LLMs, object detection, dataset generation, etc.).

My recommendation was to pivot to this use "case-based organization" and I proposed a follow-up card-sorting exercise to ensure our templates properly align with user needs.

Common Login Options

Recommendation

Multiple users expressed strong negative reactions to Metamask and other "Web 3.0" login options on the login screen.

Based on this feedback, I recommended removing these options as they appeared to negatively impact users' first impressions of the application.

Better Run Feedback

Recommendation

Every user who built a model with the application was unsure what was happening when they clicked the “run” button. Many users weren’t sure if anything was happening at all.

Therefore, I recommended building overt notices for the user that their project was being processed rather than the subtle signifiers that were in the current built.

Interactive Canvas Elements

Recommendation

Users struggled to find and navigate the settings panel for elements on the canvas. They frequently double-clicked on elements expecting this action to reveal more details.

The design recommendation was abandoning the current 'under the hood' settings approach in favor of making elements open with a double-click to display their most important settings.

Additionally, elements should provide more visible details and results when possible.

Export AI Models

Recommendation

Users expressed significant frustration regarding how to export AI models from the application in formats compatible with their existing setups.

I recommended enabling users to export their models in formats compatible with common AI deployment platforms (AWS, Azure, custom on-premises setups), in addition to supporting the company's own proprietary system.

Onboarding Through Templates

Recommendation

Users expressed strong dis-interest in walk-through tutorials or on-screen prompts when getting familiar with a tool.

They showed delight at the presence of templates and cited them as their primary method to get familiar with a tool.

The recommendation was to use templates combined with annotations on canvas as the primary method for onboarding new users and familiarizing them with new projects.

Canvas Annotation Tools

Recommendation

Users responded positively to the text and sticky note features mocked up in the application during testing.

They identified numerous use cases where they would want to create their own annotations within projects.

Based on this feedback, I recommended prioritizing the development of this annotation feature to enable users to customize projects and add notes for improved clarity.

Browser-Based Login

Recommendation

The vast majority of users complained about having to remember login credentials at login. Having the login be based in the application proper meant we could not piggy back off of common password management services.

With this in mind, I recommended pulling the login out of the application and putting it in a web-browser. This would allow users to use their password manager of choice to expedite login.

Download Control

Recommendation

In the beta build there was no way to view AI model download progress or manage downloads after they had been installed.

This brought a strong level of confusion for our users. I recommended prioritizing download management as a high-priority feature.

Automatic Results

Recommendation

After project completion, users were unable to locate how to launch their results.

I recommended implementing more obvious prompts for opening results and automatically displaying the output as soon as the project finishes building and becomes operational.

Element Drawer Organization

Recommendation

Users were confused on which elements to use when building a project. They also had trouble determining the difference between different types of elements.

I recommended conducting a card-sorting exercise to have users sort the elements into categories that made the most sense to them.

Additionally, I suggested that we als provide descriptive text around elements to help give users more information on what each element could be used for.