Split article produces lots of junk
From: Jim K.
Sent: May 10, 2012
Subject: Automatic removal junk elements
Repeatedly, when an article is split, there are numerous junk elements created which need to be deleted or dismissed. For example, when a Wikipedia article is split, I often find elements containing only ": [edit section)> ]" as its entire contents. There are also often References sections to be kept but dismissed. This is so time-consuming that it sometimes takes up most of my learning time. Bulk-search for such junk elements still need to be checked before deleted, which also takes much time. An automatic function for deleting or dismissing such elements would solve this significant time-wasting problem.
- future version will have a bit smarter empty page removal
- the problem is with HTML code that is very difficult to judge for its usability, it is better to let user make deletes that to make an error in judgement (e.g. for example, check for edit text could be added as many other similar checks, but those are always dangerous because they could delete texts that other users might wish to keep)
- SuperMemo is trying to speed up all text processing operations, this often involves removing multiple checks for rare cases that take lots of processing time on long texts
- splitting articles into tiny pieces without reading isn't always the best strategy, esp. if it turns out to be "more processing than learning". If a split function produces lots of elements and lots of work without actually contributing to your knowledge, you may need to review your strategy!
If you find universal junk texts that should be removed with a 100% warranty nobody will be hurt/annoyed by the removal, please list them here. Please use HTML source code to provide for best identification of frequently occuring empty splits.
The list of "junk texts" could easily be customized to suit an individual user. However, this falls into highly advanced functions that very few people are likely to ever use.