this post was submitted on 05 Jun 2024
221 points (100.0% liked)

Technology

37739 readers
500 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

New accessibility feature coming to Firefox, an "AI powered" alt-text generator.


"Starting in Firefox 130, we will automatically generate an alt text and let the user validate it. So every time an image is added, we get an array of pixels we pass to the ML engine and a few seconds after, we get a string corresponding to a description of this image (see the code).

...

Our alt text generator is far from perfect, but we want to take an iterative approach and improve it in the open.

...

We are currently working on improving the image-to-text datasets and model with what we’ve described in this blog post..."

you are viewing a single comment's thread
view the rest of the comments
[–] brie@beehaw.org 41 points 5 months ago (4 children)

On the one hand, having an AI generated alt-text on the client side would be much better than not having any alt-text at all. On the other hand, the pessemist in me thinks that if it becomes widely available, website makers will feel less of a need to add proper alt-text to their content.

[–] smeg@feddit.uk 26 points 5 months ago

A more optimistic way of looking at it is that this tool makes people more interested in alt-text in general, meaning more tools are developed to make use of it, meaning more web devs bother with it in the first place (either using this tool or manually)

[–] FaceDeer@fedia.io 7 points 5 months ago (1 children)

If they feel less need to add proper alt-text because peoples' browsers are doing a better job anyway, I don't see why that's a problem. The end result is better alt text.

[–] kbal@fedia.io 9 points 5 months ago* (last edited 5 months ago) (2 children)

I don't think they're likely to do a better job than humans any time soon. We can hope that it won't be extremely misleading too often.

[–] ahal@lemmy.ca 5 points 5 months ago (1 children)

I dunno, I suspect most human alt texts to be vague and non descriptive. I'm sure a human trying their hardest could out write an AI alt text.. But I'd be pretty shocked if AI's weren't already better than the average alt text.

[–] averyminya@beehaw.org 2 points 5 months ago

Alt text: It's for SEO, isn't it?

  • Marketing
[–] Ilandar@aussie.zone 2 points 5 months ago* (last edited 5 months ago)

I don’t think they’re likely to do a better job than humans any time soon.

Sure, assuming the human is actually putting effort into the task. But we know that able-bodied society is generally, at best, dismissive of the needs of the disabled and, at worst, discriminatory. I very much doubt that the majority of fully sighted humans working in this area are taking the time required to view the problem from the point of view of the visually-impaired minority and then putting in the effort required to deliver the best possible solution for them. Not every website is run by some massive company with employees specifically dedicated to this task. For many it will be an afterthought, and that's where AI descriptions will shit all over the lazy human ones. Additionally, alt text contributes to SEO which means many will be tailoring it to their search ranking instead of the needs of the user.

[–] lud@lemm.ee 4 points 5 months ago

True, but if it genuinely works really well then does it really matter? Seems like the change would be a net positive.

[–] ryannathans@aussie.zone 1 points 5 months ago

Sounds like proton and linux gaming