I played around with handling XML documents in java (I generally use java when I want GUIs) and worked up a simple little XML editor and started manually copying and pasting puzzles into an XML document (and replacing images with text art). This turns out to be a very slow process. I got to around thread 1700 of 15000 (580 total puzzles in the file) before deciding there was a better way to approach it (though it was interesting to see all the unsolved puzzles and old puzzles I had forgotten about but liked). Not only this, but I wasn't recording who posted the puzzles, who gave the solutions, when it was posted, the brainden topic number (to easily find the topic on brainden later), etc.
I'm thinking I'll write a bash script to rip the first posts of all topics to a file, then manually prune repeats and other undesirables (should be much faster since I'm not loading web pages, and I won't be loading topic numbers that reach an error page), then manually go through each of them and find the posted solutions and add text art or descriptions in place of images.
I think I'll get at least the following pieces of information using the bash script:
-post date
-topic number
-who posted it
-title (if it exists)
-subforum it came from (ignoring the whole miscellaneous subforum)
-puzzle description (ie, the actual puzzle)
I'll do the following manually (after pruning undesirables):
-add solution (and person who posted it) or mark unsolved
-add tags to tell what kind of puzzle it is
-add Title (if it doesn't have one I'll make one up)
-replace images with text art or descriptions
-fix formatting
Here are some of the tags I'm thinking of using:
As for the viewer/editor, I'll plan to add in the following features:
-Export to text document (puzzles with their solutions or all puzzles first solutions after)
-Import file into current list
-Easily edit anything
-Can add in fields or tags and name them
-Tags as checkboxes
-Separate tab for each field (bigger text areas and don't need to see solution unless you click its tab)
-Ability to list by requested tag(s) and/or rank
Here are the questions I have:
Is there a better way to go about it than this approach?
Am I missing something that could easily be ripped using a script?
Should I not include some of the information listed above or include something else?
Did I miss any tags that would be good to use or do I have a useless one?
What other features should the new editor have?
Any other thoughts about this project?
Do you think this would be useful, or is it a waste of time?
Would anyone be interested in helping once I get to a point where I can divide workload?







