-
Where is WoW's quest information stored?
I'm building a database of quest information, is there a localization file somewhere I can parse? Or should I just scrape wowhead?
ie: quest text, experience/gold earned, etc.
-
Member
afaik wowhead gets its information from an addon/program and it gets uploaded to their site. Not that this is much of a help but just stating.
Wowhead Client - Wowhead
-
Almost Legendary User
I would recommend scraping wowhead. This way you can make sure that you have old quests that may have been deleted from the game but are still available on wowhead. However I'm not sure if wowhead keeps these, just a theory. Like for example old naxx quests.
Also, I'm not sure how the quests inside the game files would be laid out. If u decide to scrape... Have fun with the regex! ^^ If ur gonna scrape via url searching that is. :P
I really don't have anything interesting to put here anymore.
-
Post Thanks / Like - 1 Thanks
Vengfull (1 members gave Thanks to Thomja for this useful post)
-
Originally Posted by
Thomja
I would recommend scraping wowhead. This way you can make sure that you have old quests that may have been deleted from the game but are still available on wowhead. However I'm not sure if wowhead keeps these, just a theory. Like for example old naxx quests.
Also, I'm not sure how the quests inside the game files would be laid out. If u decide to scrape... Have fun with the regex! ^^ If ur gonna scrape via url searching that is. :P
haha I built an entire wowhead class file for scraping, takes no time at all to do it.
I checked the dbc files for anything, all it had was questID's. I would love to try and find the quest gossip and stats somewhere.
-
Almost Legendary User
Originally Posted by
SniffingPickles
haha I built an entire wowhead class file for scraping, takes no time at all to do it.
I checked the dbc files for anything, all it had was questID's. I would love to try and find the quest gossip and stats somewhere.
I see, maybe I just suck at regex? Atleast I think it's fun ^^ When it works that is...
Out of interest, what software do you use to look through the dbc files?
I really don't have anything interesting to put here anymore.
-
Originally Posted by
Thomja
I see, maybe I just suck at regex? Atleast I think it's fun ^^ When it works that is...
Out of interest, what software do you use to look through the dbc files?
I use xpath, not so much regex, for scraping.
I don' t know the software name because i only downloaded it last night, i can link you when i get home tonight
-
Originally Posted by
Thomja
I see, maybe I just suck at regex? Atleast I think it's fun ^^ When it works that is...
Out of interest, what software do you use to look through the dbc files?
Here you go, MyDBCEditor
mydbceditor
-
Almost Legendary User
Originally Posted by
SniffingPickles
I use xpath, not so much regex, for scraping.
I don' t know the software name because i only downloaded it last night, i can link you when i get home tonight
Haven't heard of xpath, but then again I have only done very very little in php. Maybe another time!
Originally Posted by
SniffingPickles
Thanks!
I really don't have anything interesting to put here anymore.
-
Originally Posted by
Thomja
Haven't heard of xpath, but then again I have only done very very little in php. Maybe another time!
ha, xpath is used a TON in C#/Java as well as a strong presence in Javascript
Basically it maps a page out in XML for selecting data within a specific node
for example, the xpath for your quoting me would read:
Code:
/html[@id='vbulletin_html']/body/div[@id='pageWrapper']/div[@class='pageWrap']/div[@class='pageWrapBody']/div[@class='above_body']/div[@id='contentMain']/div[@class='contentBody']/div[@class='body_wrap']/div[@id='postlist']/div[@class='forumbitBody']/ol[@id='posts']/li[@id='post_3335725']/div[@class='postdetails']/div[@class='postbody']/div[@class='postrow has_after_content']/div[@class='content']/div[@id='post_message_3335725']/blockquote[@class='postcontent restore']/div[@class='bbcode_container'][1]/div[@class='bbcode_quote']/div[@class='quote_container']/div[@class='message']
there are a ton of great chrome extensions for finding it (even chrome's DOM editor has xpath generator natively)
https://chrome.google.com/webstore/d.../support?hl=en