News contextualization using web scraping techniques and api(s)

  • strict warning: Non-static method view::load() should not be called statically in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/views.module on line 906.
  • strict warning: Declaration of views_handler_argument::init() should be compatible with views_handler::init(&$view, $options) in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/handlers/views_handler_argument.inc on line 744.
  • strict warning: Declaration of views_handler_filter::options_validate() should be compatible with views_handler::options_validate($form, &$form_state) in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/handlers/views_handler_filter.inc on line 607.
  • strict warning: Declaration of views_handler_filter::options_submit() should be compatible with views_handler::options_submit($form, &$form_state) in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/handlers/views_handler_filter.inc on line 607.
  • strict warning: Declaration of views_handler_filter_boolean_operator::value_validate() should be compatible with views_handler_filter::value_validate($form, &$form_state) in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/handlers/views_handler_filter_boolean_operator.inc on line 159.
  • strict warning: Non-static method view::load() should not be called statically in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/views.module on line 906.
  • strict warning: Non-static method view::load() should not be called statically in /home4/vibu/public_html/journalijdr.com/sites/all/modules/views/views.module on line 906.
Author: 
Prathmesh Achyut Kestikar, Vrushali Karne, Atul Gutal, Akshata Kasliwal and Priya Thakare
Abstract: 

Getting different views and articles about any specific news, from different sources, can be done by news contextualization. The solution for news contextualization would be integrating all the textual and pictorial information about the news topic, that can be found on various social networking sites and news sites, and, displaying them all in a single place. Taking a search keyword from the user and retrieving the related news data from different news sources and social networking sites can be done by web scraping techniques and/or using ‘APIs’. The advantage of this would be the user won’t have to search repeatedly for getting the information from various sources related to any news topic. Information from most of the usual and predefined sources will be searched and displayed, after searching for it only once, and, in a single place. This will save the hassle of opening a new page every time to check information from a different source. This may also help one to find different views of different people from the social networking data about the news topic. For example, if the news search keyword is ‘xyz scam’, then the server will process this keyword on social networking sites such as Facebook and Twitter, and find the various posts and tweets related to ‘xyz scam’. We could also search YouTube for any videos related to this search query. And the news related to ‘xyz scam’ can be provided from reliable news websites. At the end, all this data i.e. the Facebook posts, the Twitter tweets and the news articles will be displayed in a single page with different sections for posts, tweets, videos and news. This could be implemented by using web scraping techniques to extract data from websites. Some of the high profile social networking websites provide ‘APIs’ for data extraction which may make relevant data retrieval easier. In this paper, we will be exploring some web scraping techniques and APIs that can be used for the purpose of news contextualization.

Download PDF: 
Certificate: 

CHIEF EDITOR

  

           Prof. Dr. Bilal BİLGİN

Call for Papers - 2017

    submit your paper now

   Vol. 07, Issue 03, March 2017

CURRENT ISSUE

 

Article Tracking

IMPACT FACTOR 2016

          4.753

Get your Certificate

Copyright © 2016 International Journal Development Research. All Rights Reserved.