I found a bug...
I believe I've found a bug in Site Import.
When doing imports for some sites that were poorly designed, Site Import runs into issues if certain naming conventions aren't followed.
Case in point: www.whitehurst-ma.com
The previous designer of this site used spaces in the names of the pages. The home page has a link to a single page "the whitehurst advantage.htm", which shows in the browser's URL as "http://www.whitehurst-ma.com/the%20whitehurst%20advantage.htm". When this happens, Site Import is only able to produce this URL: "http://www.whitehurst-ma.com/advantage.htm", which promptly receives a 404 error, page not found.
I know many might not consider this a bug, but when dealing with websites of all kinds and of various degrees of coding quality, it might be nice to at the very least have an option to turn off or on whether Site Import should truncate the links if they contain spaces. I can see situations where it would be nice to have it turned on, but in this particular case, it would be nice to be able to turn it off so that I didn't have to spend 30 minutes to one hour downloading all the pages, images and other media manually. It completely defeats the purpose of having the product when in most cases we are updating poorly designed sites in the first place, so situations like this come up quite often.
Any chance WebAssist can look into making a small fix and upgrade to SiteImport to resolve this particular issue?

 







 
     
     
    







