Many blind and low vision (BLV) people are excluded from professional roles that may involve visual tasks due to access barriers and persisting stigmas. Advancing generative AI systems can support BLV people through providing contextual and personalized visual descriptions for creation, critique, and consumption. In this workshop paper, we provide design suggestions for how visual descriptions can be better contextualized for multiple professional tasks. We conclude by discussing how these designs can improve autonomy, inclusion, and skill development over time.
Creation, Critique, and Consumption: Exploring Generative AI Descriptions for Supporting Blind and Low Vision Professionals with Visual Tasks
- Paper ID: 2510.08991
- Title: Creation, Critique, and Consumption: Exploring Generative AI Descriptions for Supporting Blind and Low Vision Professionals with Visual Tasks
- Authors: Lucy Jiang, Lotus Zhang, Leah Findlater (University of Washington)
- Classification: cs.HC (Human-Computer Interaction)
- Publication Time/Venue: ASSETS '25 Workshop: AT @ Work, Virtual 2025
- Paper Link: https://arxiv.org/abs/2510.08991
Many blind and low vision (BLV) individuals are excluded from professional roles that may involve visual tasks due to accessibility barriers and persistent bias. Advanced generative AI systems can support BLV individuals by providing contextualized and personalized visual descriptions for creation, critique, and consumption. In this workshop paper, the authors provide design recommendations for better delivering contextualized visual descriptions across multiple professional tasks and discuss how these designs can improve autonomy, inclusivity, and skill development over time.
- Severe Employment Gap: The employment rate of people with disabilities is approximately one-third that of non-disabled individuals, with BLV populations facing particularly acute employment barriers
- Visual Tasks as Workplace Barriers: Modern workplaces involve numerous visually-communicative tasks (such as creating presentations, formatting documents, taking photographs, watching training videos, etc.) that pose significant obstacles for BLV professionals
- Limitations of Traditional Assistive Technology: Existing accessibility solutions are primarily limited to providing basic visual information access rather than enabling full workplace participation
- Rapid advances in generative AI technology create new opportunities for providing contextualized, personalized visual descriptions
- Need to move beyond basic information access to support comprehensive participation of BLV professionals in visual communication tasks
- Technological innovation can break down employment barriers and enhance workplace inclusivity for BLV populations
- Proposes a Design Framework for Specialized Visual Description Systems: Providing contextualized and personalized AI description services tailored to different professional scenarios
- Constructs Two Concrete Application Scenarios: Video production for independent content creators and marketing material creation for large advertising agencies
- Provides Systematic Design Recommendations: Covering visual task support across three dimensions—creation, critique, and consumption
- Articulates Long-term Impact Mechanisms: Analyzing how these designs can improve autonomy, inclusivity, and skill development for BLV professionals
This research focuses on designing generative AI visual description systems to support BLV professionals, encompassing three core task dimensions:
- Creation: Assisting BLV individuals in creating visual content
- Critique: Supporting evaluation and feedback on visual work
- Consumption: Facilitating understanding and processing of visual information
Core Needs Analysis:
- Difficulty identifying visual trends
- Challenges in shot composition and subject positioning
- Need for visual effect verification in post-production editing
AI Description System Design:
- Trend Identification Support: Describing common visual accompaniments to popular audio tracks (gestures, on-screen text, etc.)
- Shooting Process Assistance:
- Ensuring optimal positioning of shooting subjects within the frame
- Providing detailed content descriptions to assist artistic composition
- Editing Process Enhancement:
- Describing video color temperature
- Assessing accuracy of filters and special effects
- Providing artistic information beyond content editing
Core Challenges:
- Complexity of collaborative workflows
- Multi-format content production requirements
- Rapid iteration and real-time collaboration demands
- Strict brand guideline compliance
AI Description System Design:
- Brand Consistency Support:
- Precise brand guideline descriptions
- Accurate color descriptions ensuring brand representation
- Team Collaboration Enhancement:
- High-level overview descriptions (overall visual appearance)
- Object-level descriptions (e.g., sticky note groupings)
- Collaborator cursor position tracking (as visual focus proxy)
- Context-Aware Descriptions: Customizing description content and detail levels based on specific professional task requirements
- Multi-Level Information Architecture: Providing hierarchical visual information from macro to micro perspectives
- Real-Time Collaboration Support: Integrating dynamic visual feedback into team workflows
- Personalized Adaptation: Adjusting description strategies based on user roles and task types
Note: This is a workshop paper that primarily provides design recommendations and conceptual frameworks, without traditional experimental setup and results.
- Analysis based on existing literature regarding challenges faced by BLV content creators
- Reference to existing research on visual editing assistance systems (e.g., Huh et al.'s text-video editing system)
- Integration of related work on digital graphic creation accessibility
- Verifying problem prevalence through literature review
- Analyzing design requirements based on limitations of existing systems
- Drawing design inspiration from successful cases in related fields
- Chang et al.'s EditScribe: Using natural language verification loops to support non-visual image editing for BLV individuals
- Huh et al.'s AVScript: Text-based video editing system integrating visual descriptions and speech
- Zhang et al.'s A11yboard: Research on digital whiteboard accessibility
- Social Media Platform Engagement: BLV creators' daily life sharing and creative economy participation on video platforms
- Accessibility Barrier Research: Difficulties in creating visually appealing content, filter function verification issues, trend-tracking challenges
- Real-Time Collaboration Tools: Improvements in mixed-ability collaboration for text editors and presentation software
- Collaborative Environment Accessibility: Accessibility of visually-oriented collaborative activities (wireframing, whiteboard discussions)
- Redefining Visual Literacy: BLV individuals possess deep visual understanding capabilities; technology should support and enhance rather than assume deficiency
- Systematic Improvement of Workplace Inclusivity: Technological innovation can progressively reduce bias and improve autonomy, inclusivity, and skill development for BLV populations
- Importance of Personalized Descriptions: Different professional scenarios require customized visual description strategies
Drawing on Georgina Kleege's observation: "On average, a completely blind person born blind understands what vision means far better than the average sighted person understands what blindness means."
Expected Effects:
- Enhanced Autonomy: Reducing dependence on others' assistance
- Improved Inclusivity: Promoting more inclusive design practices and workplace culture
- Skill Development: Supporting BLV professionals in demonstrating creative capabilities
- Strong Problem Orientation: Directly addresses core barriers to BLV workplace participation
- Innovative Design Approach: Proposes the concept of contextualized, personalized AI description systems
- High Practical Value: Provides concrete, actionable design recommendations
- Solid Theoretical Foundation: Comprehensive literature references with thorough argumentation
- Significant Social Value: Addresses workplace equality rights for marginalized populations
- Lack of Empirical Validation: As a conceptual paper, it lacks user research and system evaluation
- Insufficient Technical Implementation Details: Limited description of specific technical architecture of AI systems
- Missing Scalability Analysis: Insufficient discussion of design recommendation applicability to other professional scenarios
- Absent Cost-Benefit Analysis: Does not consider practical costs of system development and deployment
- Academic Contribution: Provides new design perspectives for accessibility technology research
- Practical Guidance: Offers specific design guidance for relevant technology developers
- Policy Implications: May influence workplace accessibility policy development
- Social Value: Promotes societal reassessment of BLV individuals' professional capabilities
- Content Creation Industries: Video production, graphic design, marketing creativity, and related fields
- Collaborative Work Environments: Team work scenarios requiring real-time visual collaboration
- Education and Training: Visual skills training and professional development support
- Technology Development: AI-assisted tools and accessible technology product development
- User Research: In-depth understanding of specific needs of BLV professionals across different occupations
- Technical Implementation: Developing prototype systems and verifying technical feasibility
- Effectiveness Evaluation: Designing evaluation metrics to verify system impact on user work efficiency and satisfaction
- Cross-Domain Extension: Exploring applicability of design principles to other professional fields
- Ethical Considerations: Investigating potential bias and privacy issues arising from AI description systems
Summary: This paper proposes an important and forward-looking research direction, providing better workplace support for BLV professionals through generative AI technology. While lacking empirical validation as a conceptual study, its design approach and social value merit further in-depth research and practical application exploration.