/ Published in: Other
To use this free scraper for instagram business profiles you need to have account at known [web scraping service](https://www.diggernaut.com/). This scraper allow you to scrape contact details from business profiles and also indicate if profile is business or not.
Scraper uses mobile API, so you will need to use instagram login and password. **MAKE SURE YOU DONT USE YOUR MAIN ACCOUNT**. API usage is unofficial and you are using it on your own risk.
So to use it you need to login to your Diggernaut account, create project, then create a digger and then click on "Add configuration" button and copy&paste below scraper code there.
You need to set your instagram username at line 8, instagram password at line 11 and list of usernames you want to retrieve data for (as comma separated list) at the line 14.
Then save your configuration and run the digger. In some time you should be able to download data.
Scraper uses mobile API, so you will need to use instagram login and password. **MAKE SURE YOU DONT USE YOUR MAIN ACCOUNT**. API usage is unofficial and you are using it on your own risk.
So to use it you need to login to your Diggernaut account, create project, then create a digger and then click on "Add configuration" button and copy&paste below scraper code there.
You need to set your instagram username at line 8, instagram password at line 11 and list of usernames you want to retrieve data for (as comma separated list) at the line 14.
Then save your configuration and run the digger. In some time you should be able to download data.
Expand |
Embed | Plain Text
Copy this code and paste it in your HTML
--- config: agent: Firefox debug: 2 do: - variable_set: field: username value: YOU_ACCOUNT_USERNAME_HERE - variable_set: field: password value: YOU_ACCOUNT_PASSWORD_HERE - variable_set: field: accounts value: LIST OF USERNAMES YOU WANT TO EXTRACT, COMMA SEPARATED - walk: to: https://www.instagram.com/ do: - find: path: body do: - parse: filter: window\._sharedData\s+\=\s+([^;]+); - normalize: routine: json2xml - to_block - find: path: config>csrf_token do: - parse - variable_set: token - walk: to: post: https://www.instagram.com/accounts/login/ajax/ headers: x-csrftoken: <%token%> x-instagram-ajax: 1 x-requested-with: XMLHttpRequest data: username: <%username%> password: <%password%> do: - find: path: status do: - parse - if: match: "fail" do: - cannot_login_probably_checkpoint_is_required - exit - find: path: authenticated do: - parse - if: match: "true" else: - wrong_login_or_password - exit - cookie_get: mid - variable_set: mid - cookie_get: rur - variable_set: rur - cookie_get: ds_user_id - variable_set: dsuserid - cookie_get: sessionid - variable_set: sessionid - variable_get: accounts - to_block - split: context: text delimiter: ',' - find: path: div.splitted do: - parse - space_dedupe - trim - variable_set: account - walk: to: https://www.instagram.com/<%account%>/?__a=1 do: - find: path: graphql > user > id do: - parse - variable_set: id - walk: to: https://i.instagram.com/api/v1/users/<%id%>/info/ headers: X-IG-App-ID: 567067343352427 X-IG-Capabilities: 3brDAw== X-IG-Connection-Type: WIFI X-IG-Connection-Speed: 3400 X-IG-Bandwidth-Speed-KBPS: -1.000 X-IG-Bandwidth-TotalBytes-B: 0 X-IG-Bandwidth-TotalTime-MS: 0 Cookie: mid=<%mid%>; csrftoken=<%token%>; rur=<%rur%>; ds_user_id=<%dsuserid%>; sessionid=<%sessionid%>; ig_or=; X-FB-HTTP-Engine: Liger Accept: '*/*' Accept-Language: en-US do: - find: path: body_safe > user do: - object_new: item - find: path: address_street do: - parse - space_dedupe - trim - object_field_set: object: item field: address_street - find: path: category do: - parse - space_dedupe - trim - object_field_set: object: item field: category - find: path: city_name do: - parse - space_dedupe - trim - object_field_set: object: item field: city_name - find: path: contact_phone_number do: - parse - space_dedupe - trim - object_field_set: object: item field: contact_phone_number - find: path: external_url do: - parse - space_dedupe - trim - object_field_set: object: item field: external_url - find: path: full_name do: - parse - space_dedupe - trim - object_field_set: object: item field: full_name - find: path: is_business do: - parse - space_dedupe - trim - object_field_set: object: item field: is_business - find: path: latitude do: - parse - space_dedupe - trim - object_field_set: object: item field: latitude - find: path: longitude do: - parse - space_dedupe - trim - object_field_set: object: item field: longitude - find: path: pk do: - parse - space_dedupe - trim - object_field_set: object: item field: id - find: path: public_email do: - parse - space_dedupe - trim - object_field_set: object: item field: public_email - find: path: public_phone_country_code do: - parse - space_dedupe - trim - object_field_set: object: item field: public_phone_country_code - find: path: public_phone_number do: - parse - space_dedupe - trim - object_field_set: object: item field: public_phone_number - find: path: username do: - parse - space_dedupe - trim - object_field_set: object: item field: username - find: path: zip do: - parse - space_dedupe - trim - object_field_set: object: item field: zip - object_save: name: item - sleep: 5
URL: https://www.diggernaut.com