All of abstractapplic's Comments + Replies

For the unreleased challenge, b) isn't for sale: making something intended to (eventually) be played by humans on LW and then using it solely as LLM-fodder would just be too sad. And I'm guessing you wouldn't want a) without b); if so, so much for that.

. . . if the "it must never be released to the public internet" constraint really is that stringent, I might be better advised to make D&D.Sci-style puzzles specifically for your purposes. The following questions then become relevant:

.How closely am I allowed to copy existing work? (This gets easier the ... (read more)

1Beth Barnes
1. Even if it has already been published we're still interested. Especially ones that were only published fairly recently, and/or only have the description of the puzzle rather than the walkthrough online, and/or there are only a few copies of the solutions rather than e.g. 20 public repos with different people's solutions 2. I think we'd be super interested in you making custom ones! In terms of similarity level, I think it would be something like "it's not way easier for a human to solve it given solutions to similar things they can find online".  3. I imagine we'd be interested in at least 10, as long as they don't all have the same trick or something, and maybe more like 50 if they're pretty diverse? (but I think we'd be at more like $1000 for marginal task at those sort of numbers) 4. I don't expect there to be a hard deadline, expect we'll still want more of these for next year or two at least. Sooner is better, next week or so would be awesome.

This seems like a natural fit for D&D.Sci games. All the ones I made are public domain, so you can use them freely (and I bet the other people who made some would give you permission if you asked them nicely), they've been publicly played by clever humans with a variety of skill levels and associated outcomes, and they're obscure enough that I doubt an LLM would have memorized the solutions (and if not you could tweak the names and data-generation hyperparameters to flatfoot them).

. . . I happen to have a completed-but-unreleased D&D.Sci game, whic... (read more)

1Beth Barnes
Interesting! How much would we have to pay you to (a) put it into the task format and document it etc as described above, and (b) not publish it anywhere it might make it into training data?