b) will remove the link to /test.html#top which is usually not desirable
How about values encoded in the URL (ie. /test.html?myvar=1)? As long as it wasn't navigation related (eg. /test.html?page=1), or needed to remain in the URL between pages for some reason, you might want to check for those too.
Now we're getting into regular expression country.久久中文字幕免费高清,三级在线看中文字幕完整版,中文字幕偷乱视频在线