Web scraping wtf



  • this is a 2 fold wtf.

    1st:

    http://www.codefund.com/ 

     2nd:


      http://www.codefund.com/java-essentials/

    is a clone of this page:

    http://forum.java.sun.com/category.jspa?categoryID=5 

    except the UI is different, and eveyone's username from SDN has an 'a' appended to the end of it.

     

    WTF! 



  • The "lastest" time I saw something so pathetic, it was some dude's VB search engine...

    are are are!
     



  • I hereby resign my elected position at SpectateSwamp and hand it over to these guys.  They can randomly scrape pages and video them better than I ever could.



  • I just made codefund My Homepage.



  • @KattMan said:

    I hereby resign my elected position at SpectateSwamp and hand it over to these guys.  They can randomly scrape pages and video them better than I ever could.

    Yes. You are hereby relieved of your duties. Thanks for your service.



  • My favourite, from the CodeFund front page:

    "Google are our Idol, Do better than google are our aim...

    Made it Easy-to-use are our wish..."



  • @Mel said:

    My favourite, from the CodeFund front page:

    "Google are our Idol, Do better than google are our aim...

    Made it Easy-to-use are our wish..."

    Ahh, good old Engrish. Or is it Engrish only if it's from Japan? Either way, the domain's registered in Beijing, so go figure on scraping+cloning entire sites. <joke>We'd better all get lead-content tests now that we've viewed the copy. That white background looks suspicious... cheap white paint's made with lead.</joke>



  • Holy Cow. The search actually forwards TO GOOGLE!

    What is it with Chinese and only being able to copy other's ideas? This, the Vii, the PS3 knockoff (can't think of name right off hand). For a country that has nigh 1/4 the world's population, there are NO new ideas coming out of that country.

    /rant 



  • I was there a couple of months ago (China, not the web site, obviously) and it turns out it's the way they do business also -- see someone successful, then do EXACTLY the same thing he's doing. Preferably in the exact same location as well. Any western tourist who's taken a stroll on The Bund or East Nanjing Road knows exactly what I mean.



  • @MarcB said:

    @Mel said:

    My favourite, from the CodeFund front page:

    "Google are our Idol, Do better than google are our aim...

    Made it Easy-to-use are our wish..."

    Ahh, good old Engrish. Or is it Engrish only if it's from Japan? Either way, the domain's registered in Beijing, so go figure on scraping+cloning entire sites. <joke>We'd better all get lead-content tests now that we've viewed the copy. That white background looks suspicious... cheap white paint's made with lead.</joke>

    If they're from Sweden (as I gather from the OP), the word you're looking for is "Swinglish." 



  • Ohh, SDN, as in, the "S" stands for "Sun."

    Still, there's a new AT&T smart phone commercial with a dude in a polo who speaks fluent Swinglish. My Swedish friends find this hilarious, as it's how they say their parents try to speak when they're being nice to Americans. 



  • @JamesKilton said:

    Holy Cow. The search actually forwards TO GOOGLE!

    What is it with Chinese and only being able to copy other's ideas? This, the Vii, the PS3 knockoff (can't think of name right off hand). For a country that has nigh 1/4 the world's population, there are NO new ideas coming out of that country.

    /rant 

    They used up their allotment of new ideas about 500 years ago.



  • @JamesKilton said:

    Holy Cow. The search actually forwards TO GOOGLE!

    What is it with Chinese and only being able to copy other's ideas? This, the Vii, the PS3 knockoff (can't think of name right off hand). For a country that has nigh 1/4 the world's population, there are NO new ideas coming out of that country.

    /rant 

    There are plenty of imitators in the US too.  We're just much more efficient about suing the shit out of people who violate our IP over here, so those sorts of people tend not to get quite as far.

    It's not that the Chinese are less original.  Their government just doesn't do much about it.



  • @merreborn said:

    It's not that the Chinese are less original.  Their government just doesn't do much about it.

    Or perhaps more precisely, the Chinese government is really not interested in protecting the revenues of large US and multinational corporations, and sees no reason why they should spend their own money on stopping this. 



  • @clevershark said:

    I was there a couple of months ago (China, not the web site, obviously) and it turns out it's the way they do business also -- see someone successful, then do EXACTLY the same thing he's doing. Preferably in the exact same location as well. Any western tourist who's taken a stroll on The Bund or East Nanjing Road knows exactly what I mean.

    Oh God.  I was there a few months ago too.  It got to the point where I was ready to smack the next person that offered me a watch, bag, photo or any useless light- or noise-producing piece of junk.  My two favorite words in Chinese: "bu yao" ("don't want").  Next time I go I'm thinking about getting it printed on a t-shirt.

     

     



  • I like how when you follow the Linux link at the top of the page, the new title image shows "MsDner Linurx".  Uh, yeah, I love using those "Linurx" distros.

    It doesn't even work.  I tried searching for ubuntu and it led to the google page (of course) showing that there were no hits.  When I tried searching for ubuntu straight from google, it (of course) returned 33 million results.
     



  • @JamesKilton said:

    Holy Cow. The search actually forwards TO GOOGLE!

    It uses google to search it's own site. Which is not a terrible idea on itself. (Note the "sitesearch=codefund.com" in the titlebar when you search)

    Ofcourse, the rest of it... WTH? 



  • @cconroy said:

    My two favorite words in Chinese: "bu yao" ("don't want"). Next time I go I'm thinking about getting it printed on a t-shirt.

    Don't know Chinese myself (just Japanese, and only somewhat), but some quick searching suggests the phrase you're using is "不要" (which Babelfish translates from Chinese as "does not want"; in Japanese, basically means "unnecessary"). Because if you wanted to put it on a shirt, you'd obviously want to do it in a way that they'd be able to properly read.



  • @JamesKilton said:

    Holy Cow. The search actually forwards TO GOOGLE!

    What is it with Chinese and only being able to copy other's ideas? This, the Vii, the PS3 knockoff (can't think of name right off hand). For a country that has nigh 1/4 the world's population, there are NO new ideas coming out of that country.

    /rant 

     Because they're an emerging economy. America was exactly the same, and the equivalent of "made in taiwan" back in the 19th century. It's the growing pains of an economic boom. The market is expanding faster than the government can keep up with it.

     

    Back in the early 20th century, America was the bootleg capital of the world. And don't forget such "wonder drugs" as heroin, cocaine, morphine, etc, that were poorly tested and sold openly despite clear physiological effects.
     



  • money shot:

    http://www.codefund.com/execute/tq/

    endless wtf's



  • if you look at the latest question and submit a helpful answer (i submitted "because you touch yourself at night" in response to "why dont my sessions work anymore?"), it seems to have some kind of arithmetic captcha:

     Question:24 + 2 = <INPUT class=q maxLength=3 size=15 value=26 name=Qc> (Must)

    AMAZING!! Why did nobody else think of this? If there's one thing that distinguishes a human from a machine, it's the ability to do simple math, right?!



  • and the page not found message:

    "page no funded...."

    ...i'm hooked



  • @NerfTW said:

    Back in the early 20th century, America was the bootleg capital of the world. And don't forget such "wonder drugs" as heroin, cocaine, morphine, etc, that were poorly tested and sold openly despite clear physiological effects.
     

    Considering that most modern drugs have a list of side effects that includes "death," I'd say things haven't gotten much better.  In fact, those narcotics you mentioned are actually less addictive than the pain killers we use now.



  • @NerfTW said:

    @JamesKilton said:

    Holy Cow. The search actually forwards TO GOOGLE!

    What is it with Chinese and only being able to copy other's ideas? This, the Vii, the PS3 knockoff (can't think of name right off hand). For a country that has nigh 1/4 the world's population, there are NO new ideas coming out of that country.

    /rant 

     Because they're an emerging economy.

    China, not India. 



  • @petvirus said:


    http://www.codefund.com/java-essentials/

    is a clone of this page:  http://forum.java.sun.com/category.jspa?categoryID=5 

    except the UI is different, and eveyone's username from SDN has an 'a' appended to the end of it.

     WTF! 

    I started reading  some of the threads and came across this one:   http://www.codefund.com/43/how-to-use-serial-port-to-read-data-435073.shtm

     and as I was reading, I really expected it to turn into a "Please e-mail me the codes" post.

     


     



  • @savar said:

    money shot:

    http://www.codefund.com/execute/tq/

    endless wtf's

    Indeed. A quickie from their MSDNER (?): Array of class instances
     



  • @cconroy said:

    My two favorite words in Chinese: "bu yao" ("don't want").  Next time I go I'm thinking about getting it printed on a t-shirt.

     

     

    DO NOT WANT.



  • @Carnildo said:

    DO NOT WANT.

    Sounds like a command.  Don't be wantin or else I'll be illin you before I blast a cap in your ass.

    For those confused, I'm marking THIS post, not carn's as the lamest post of the week.   



  • It looks like these guys are actually crackers.  www.msdner.net shows the same Google rip off page.

     

    Later,



  • @clively said:

    It looks like these guys are actually crackers.

    Do they go as well with salt and ketchup as my hash? 



  • Personally, I like how if you click on a user's name you get the message "sorry, your not vip member..."



    Huh? Someone also pointed out that they expected "http://www.codefund.com/43/how-to-use-serial-port-to-read-data-435073.shtm" to turn into a "please email the codes" thread. It's actually even funnier - read the bottom! When "Codefund" scraped the thread, they missed changing the "This thread has been reported to Sun" reply with the real thread link. Booyah!



    Oh, and msdner.net (their .net version) is scraped from forums.msdn.com.



    The real WTF is of course these forums, where I have to enter every Line Break (<br>) tag myself by hand.



  • @Kyanar said:

    The real WTF is of course these forums, where I have to enter every Line Break (<br>) tag myself by hand.
     

    I found out quite by accident that shift-enter will make the forum editor do a simple line break, and not the complicated "you will start a new paragraph and you will like it, or else" dance. 



  •  @MarcB said:

    I found out quite by accident that shift-enter will make the forum editor do a simple line break, and not the complicated "you will start a new paragraph and you will like it, or else" dance. 

    It's not a dance. It's the normal behaviour for most wysiwyg editors. Return => P.



  • @dhromed said:

     @MarcB said:

    I found out quite by accident that shift-enter will make the forum editor do a simple line break, and not the complicated "you will start a new paragraph and you will like it, or else" dance. 

    It's not a dance. It's the normal behaviour for most wysiwyg editors. Return => P.

    That's the problem - I turned OFF the WYSIWYG editor! That's why I have to line break manually!



  • @Kyanar said:

    @dhromed said:

     @MarcB said:

    I found out quite by accident that shift-enter will make the forum editor do a simple line break, and not the complicated "you will start a new paragraph and you will like it, or else" dance. 

    It's not a dance. It's the normal behaviour for most wysiwyg editors. Return => P.

    That's the problem - I turned OFF the WYSIWYG editor! That's why I have to line break manually!

     


Log in to reply