Technology

37737 readers

711 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

Los@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org

121

Astronomers discover technique to spot AI fakes using galaxy-measurement tools (arstechnica.com)

submitted 4 months ago by sabreW4K3@lazysoci.al to c/technology@beehaw.org

25 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] jarfil@beehaw.org 1 points 4 months ago

breaking things down for an audience understanding neither the technical nor artistic aspect...

Not a reason to misrepresent things. Reminds me of the animistic fallacy, if they even understand what's really going on themselves.

As for text, I've seen the MS generator spit out decent text, at least in titles and logos, and some AI art with full legible sentences.

Unless you start off training by feeding the model 3d data (say, voxels) alongside 2d projections

Some time ago already, there was an SD fork with bounded box support, and a ChatGPT preprocessor prompt template to do the layout. Object permanence in this case is as simple as continuing with the lower layer once the upper one is finished, maintaining object continuity in the lower layer. It's reasonable to expect this to go from bounded boxes, to freehand layers for each object. Since an LLM has been shown to be a good preprocessor to set the layout, some more integration between both, with object feedback from the SD to reduce the layer bounding box, would do wonders. Adding an opacity mask could be a bit harder, but sounds doable.

I don't see the need of much higher abstraction to address this issue. Rendering videos of translucent objects, might need it, though.