Click here to close now.

Welcome!

Java IoT Authors: AppDynamics Blog, Liz McMillan, Pat Romanski, Elizabeth White, Plutora Blog

Related Topics: Java IoT

Java IoT: Article

Reading Data from the Internet

Lesson 6, Java Basics

To read local file streams, a program has to specify the file's location, i.e. "c:\practice\training.html". The same procedure is valid for reading of the remote files: just open the stream over the network. Java has a class URL that will help you to connect to a remote computer on the Internet.

At first, create an instance of the class URL:

try{
  URL xyz = new URL("http://www.xyz.com:80/training.html");
}
catch(MalformedURLException e){
      e.printStackTrace();
}

The MalformedURLException could be thrown if a non-valid URL has been used, for example missed protocol if you forgot to start URL with http://, extra spaces, etc. The MalformedURLException does not indicate that the remote machine has problems - just check the spelling of the URL.

Creation of the URL object does not establish the connection with the remote machine: you'll still need to open a stream to read it. Usually you have to perform the following steps to read a file from the Internet:

Step 1. Create and instance of the class URL

Step 2. Create an instance of the class URLConnection and open a connection using the URL instance from step 1.

Step 3. Get a reference to an input stream of this object by calling the method URLConnection.getInputStream()

Step 4. Read the data from the stream (use the buffered reader to speed up the process).

Since the streams from the package java.io are being used here for the read/write operations, you'll have to handle I/O exceptions.

The server you are trying to connect to has to be up and running and, in case of using http protocol, the special software (Web Server) has to be "listening to" the port that you specified in the URL instance. By default, Web servers are listening to the port number 80.

The program below reads and prints on the system console the content of the file index.html from yahoo.com. Obviously, to test this program your computer has to be connected to the Internet.

import java.net.*;
import java.io.*;
public class WebSiteReader {
  public static void main(String args[]){
       String nextLine;
       URL url = null;
       URLConnection urlConn = null;
       InputStreamReader  inStream = null;
       BufferedReader buff = null;
       try{
          // Create the URL obect that points
          // at the default file index.html
          url  = new URL("http://www.yahoo.com" );
          urlConn = url.openConnection();
         inStream = new InputStreamReader( 
                           urlConn.getInputStream());
           buff= new BufferedReader(inStream);
        
       // Read and print the lines from index.html
        while (true){
            nextLine =buff.readLine();  
            if (nextLine !=null){
                System.out.println(nextLine); 
            }
            else{
               break;
            } 
        }
     } catch(MalformedURLException e){
       System.out.println("Please check the URL:" + 
                                           e.toString() );
     } catch(IOException  e1){
      System.out.println("Can't read  from the Internet: "+ 
                                          e1.toString() ); 
  }
 }
}

The class WebSiteReader explicitly creates the URLConnection object. Strictly speaking we could get away just with the class URL:

URL url = new URL("http://www.yahoo.com");
InputStream in = url.getInputStream();
Buff= new BufferedReader(new InputStreamReader(in));

The reason why you may consider using the URLConnection class is that it could give you some additional control over the I/O process. For example, by calling its method setDoInput(true) you could allow (or disallow) downloads.

Connecting Through HTTP Proxy Servers

Most of the companies use firewalls for security reasons and their employees reach the Internet through the HTTP proxy server. Check the settings of your Internet browser to find out the host name and port number of the proxy server. If you are using Microsoft Internet Explorer check the menu Internet Options | Connections | LAN Setting. Netscape Navigator has proxy settings under Preferences | Advanced | Proxies.

While Java Applets know parameters of the proxy servers because they live inside the browser that has the proper settings, Java applications should set the parameters of the proxy server, for example:

   System.setProperty("http.proxyHost","xyz.com");
   System.setProperty("http.proxyPort", 8080);

If you do not want to hardcode these value, pass them to the program from the command line:

c:\practice>java -Dhttp.proxyHost=xyz.com  -Dhttp.proxyPort=8080  WebSiteReader

If you run this program without specifying the proxy settings, you'll get the UnknownHostException.

How to Download Files From the Internet

By using the class URL with input streams, we should be able to download practically any file (images, music, binary files) from the unsecured Internet site. The trick is in proper opening of the file stream. Let‚s write the class FileDownload that takes a URL and the file name as a command line arguments and copies remote file on your local disk.

import java.io.DataOutputStream;
import java.io.FileOutputStream;
import java.io.DataInputStream;
import java.net.URL;
import java.net.URLConnection;

class FileDownloader{

  public static void main(String args[]){
    if (args.length!=2){
      System.out.println(
        "Proper Usage: java FileDownloader RemoteFileURL LocalFileName");
      System.exit(0);
    }

  DataInputStream in=null;
  DataOutputStream out=null;
  FileOutputStream fOut=null;

  try{
    URL remoteFile=new URL(args[0]);
    URLConnection fileStream=remoteFile.openConnection();

    // Open the input streams for the remote file 
    fOut=new FileOutputStream(args[1]);

    // Open the output streams for saving this file on disk
    out=new DataOutputStream(fOut);

    in=new DataInputStream(fileStream.getInputStream());

    // Read the remote on save save the file
    int data;
    while((data=in.read())!=-1){
         fOut.write(data);
    }  
    System.out.println("Download of " + args[0] + " is complete." );   
  } catch (Exception e){
     e.printStackTrace();
  } finally {
     try{
       in.close();
       fOut.flush(); 
       fOut.close();      
     } catch(Exception e){e.printStackTrace();}
     
    }
 }
}

To download the Yahoo's main page into c:\temp directory start this program as follows:

java FileDownloader http://www.yahoo.com/index.html c:\\temp\\yahoo.html

The Stock Quote Program

In this section we'll write the program that can read stock market quotes from the Internet. There are many Internet sites providing stock market quotes, and 20 minutes delayed quotes are free. Wall Street companies subscribe for the real-time market data feed. One of the popular Internet sites is Yahoo and the URL for getting stock prices is http://finance.yahoo.com Point your Web browser to this site and get the price quote of any stock symbol. Note the URL of the resulting Web page in your browser. For example, if you've selected the symbol IBM, the URL would look like this:

 http://finance.yahoo.com/q?s=IBM

Right click on this page and select the View Source from the popup menu to see the HTML contents of this page: you'll see lots of HTML tags and the information about the IBM's trading will be buried somewhere deep inside the file:


...
   Last Trade:</TD><TD class=yfnc_tabledata1><BIG><B>95.32</B>
...

The next step is to modify the URL in our class WebSiteReader to print the content of the page about the symbol IBM:

    url  = new URL("http://finance.yahoo.com/q?s=IBM");

You can also store the whole page in a Java String variable instead of printing it. Just change the while loop to look as follows:

        String theWholePage;
        while (txt =buff.readLine() != null ){
             theWholePage=theWholePage + txt;
         }

If you add some smart tokenizing of theWholePage to get rid of all HTML tags and everything but Last Trade value, you can create your own little GUI Stock Quote screen. While this approach is useful to sharpen you tokenizing skills, it may not be the best solution, especially if Yahoo will change the wording of this page. That's why we'll be using another Yahoo's URL that provide stock quotes in a cleaner comma separated values format (CSV).

Here's the URL that should be used for the IBM's symbol:

http://quote.yahoo.com/d/quotes.csv?s=IBM&f=sl1d1t1c1ohgv&e=.csv

This URL would produce a string that looks something like this (the price quotes are not real):

"IBM",95.32,"1/16/2004","5:01pm",+1.30,95.00,95.35,94.71,9305000

The next class StockQuoter prints the price quote for the symbol that is specified as a command line argument.

import java.net.*;
import java.io.*;
import java.util.StringTokenizer;

public class StockQuoter {
       String csvString;
       URL url = null;
       URLConnection urlConn = null;
       InputStreamReader  inStream = null;
       BufferedReader buff = null;

     StockQuoter(String symbol){

       try{
           url  = new              
               URL("http://quote.yahoo.com/d/quotes.csv?s="
                   + symbol + "&f=sl1d1t1c1ohgv&e=.csv" );
           urlConn = url.openConnection();
           inStream = new
               InputStreamReader(urlConn.getInputStream());
           BufferedReader buff= new BufferedReader(inStream);

           // get the quote as a csv string
           csvString =buff.readLine();  

           // parse the csv string
              StringTokenizer tokenizer = new
                          StringTokenizer(csvString, ",");
              String ticker = tokenizer.nextToken();
              String price  = tokenizer.nextToken();
              String tradeDate = tokenizer.nextToken();  
              String tradeTime = tokenizer.nextToken();  

              System.out.println("Symbol: " + ticker + 
                " Price: " + price + " Date: "  + tradeDate 
                + " Time: " + tradeTime);
     } catch(MalformedURLException e){
         System.out.println("Please check the spelling of the URL:" 
         		           + e.toString() );
     } catch(IOException  e1){
      System.out.println("Can't read from the Internet: " + 
                                           e1.toString() ); 
     }
     finally{
         try{
           inStream.close();
           buff.close();   
         }catch(Exception e){
            e.printStackTrace();
         }
     }  
   } 

  public static void main(String args[]){
       if (args.length==0){
          System.out.println(
                     "Sample Usage: java StockQuoter IBM");
          System.exit(0);
       } 
       StockQuoter sq = new StockQuoter(args[0]);
  }

}

To see the latest price of IBM stock start the StockQuoter program as follows:

c:\practice>java StockQuoter IBM

In this lesson I tried to show you that working with the streams over the net may be as simple as dealing with files on your local disk. These days Java runs remote-controlled Mars rovers, and this is not a rocket science anymore - just open a Java stream that points at Mars.

More Stories By Yakov Fain

Yakov Fain is a co-founder of two software companies: Farata Systems and SuranceBay. He authored several technical books and lots of articles on software development. Yakov is Java Champion (https://java-champions.java.net). He leads leads Princeton Java Users Group. Two of Yakov's books will go in print this year: "Enterprise Web Development" (O'Reilly) and "Java For Kids" (No Starch Press).

Comments (19) View Comments

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Most Recent Comments
Dave Mason 10/18/07 03:00:39 AM EDT

Hello Yakov,
how can I read data in real time from java applet for example:
http://www.saxobank.com/?id=911&Lan=EN&Au=1&Grp=5

nitin 11/27/04 02:34:32 AM EST

hi,
can any one tell me how can i read the parameters from URL that has been passed by someone ie id someone send id=123 then how can i get it,how the whole process work & last but the mostimp i want to do whole thing through core java.
Bye

MIchael Behrens 02/08/04 08:17:43 PM EST

Great Article! Keep''m coming!
There is also a good WebCopy implementation at:
http://www.acme.com/java/software/
It makes all the web references local - It does not work on HTTPS - I wish I had all the know-how & code to make an equivalent HTTPS WebSiteReader, which would be very handing for testing our query pages.
Thanks!

Ferruccio Spagna 01/31/04 10:13:33 AM EST

I am very happy with your code, Yakov.
I tried it and everything works very well.
Thanks Ferruccio

Yakov Fain 01/31/04 08:59:53 AM EST

In this lesson I do this using HTTP GET request by attaching parameters to the URL after a question mark separated with an ampersand.

Fo HTTP POST, after getting the URLConnection stream, do something like this:

urlConn.setRequestProperty("Content-Type", "application/x-www-form-urlencoded");
urlConn.setDoOutput(true);

String myData = "myParam1=" + URLEncoder.encode ("abc") +
"&myParam2=" + URLEncoder.encode ("xyz");

DataOutputStream encodedParams = new DataOutputStream (urlConn.getOutputStream ());

encodedParams.writeBytes (myData);
encodedParams.flush ();
encodedParams.close ();

Ferruccio Spagna 01/31/04 08:29:47 AM EST

Ok. What you say occurs when you have a page with a form. Well known. But my question was: you want to read the stream (see your last article) and perhaps manipulate it. You then use the code:
java.net.URL url = new java.net.URL("http://www.cgi.com");
java.net.URLConnection c = url.openConnection();
java.io.DataInputStream dis = new java.io.DataInputStream(c.getInputStream());
and so on.
How can you give the parameters to the cgi with THESE code lines?

Yakov Fain 01/31/04 08:16:45 AM EST

Well, here''s the sample from my book "The Java Tutorial for the Real World".

1. An HTML form has 2 fields to perform a book search:

Find a book

Enter a word from the book title:

2. The servlet FindBooks runs on the server side, gets the parameters and sends back to the browser a page that reads that this book cost $65:

public class FindBooks extends javax.servlet.http.HttpServlet {

public void doGet(HttpServletRequest req,
HttpServletResponse res)
throws ServletException, IOException {
String title;
PrintWriter out = res.getWriter();
title = req.getParameter("booktitle");
res.setContentType("text/html");

out.println("");

out.println("the book "+title+" costs only $65");

out.println("Please enter your credit card number");

out.println("");
}
}

Ferruccio Spagna 01/31/04 07:33:44 AM EST

Thank you for your answer. I have several good Java books, but I didn''t find how I can read a CGI URL stream passing POST parameters in the calling instruction anywhere. Tell me please in a few words the way, if it is possible.

Yakov Fain 01/30/04 07:38:59 PM EST

Ferruccio,

Java Servlets and JavaServer Pages technologies deal with HTTP Post and Get requests, parameters, etc. Wait for my lesson on servlets, or get a good book if you need to know the answer now.

Ferruccio Spagna 01/30/04 06:10:50 PM EST

I add to my above question: how can I pass the parameter values calling the URL?

Ferruccio Spagna 01/30/04 05:59:24 PM EST

What about reading a CGI URL accepting METHOD=POST parameters?

Yakov Fain 01/29/04 11:27:25 PM EST

Dennis,

You can run on the remote machine one of the following: RMI server, SocketServer, a Servlet,FTP server... that has to create an instance of the class java.io.File that points at your directory. The File.list() will return the list of files in this directory as a String array. Now create instances of File for each of the array elements and call File.lastModify() to check the timestamp. After finding the file with the proper date, send its URL (or a stream reference) to the client for reading.

Dennis Christopher 01/29/04 01:44:43 PM EST

I found the article useful, and above average in clarity.

I am wondering if anyone knows how you set up the
connection to download an (entire) directory of files?
and to check the file dates before doing so?

Josh Davis 01/28/04 12:49:02 PM EST

Yakov,

You should mention that HttpURLConnection may ''hang'' when contacting servers that don''t behave correctly. This can cause a lot of problems in a server side application. The solutions are:
1) Use something other than URL/URLConnection/HttpURLConnection.
2) Set the system properties that control the socket timeouts for the Sun HTTP client.

John Pantone 01/26/04 01:55:42 PM EST

Very useful, clear.

Yakov Fain 01/23/04 09:49:54 PM EST

If the proxy requires authentication, set the following properties:

System.setProperty("http.proxyUser", "JLarkin");
System.setProperty("http.proxyPassword", "YourPassword");

You may also try using the class java.net.Authenticator

John Larkin 01/23/04 11:28:08 AM EST

Our proxy server requires a user_id and password.
I am having trouble finding a method to set these.
Any ideas ? Thanks

Selvan Rajan 01/23/04 10:10:41 AM EST

What would be the case to deal with cookies from some of the web sites? Especially it becomes cumbersome, if there is a redirection involved after setting the cookie.
Any ideas?

Debashish 01/23/04 02:07:51 AM EST

In the following para :
Strictly speaking we could get away just with the class URL:
URL url = new URL("http://www.yahoo.com");
InputStream in = url.getInputStream();
Buff= new BufferedReader(new InputStreamReader(in));

I think the second line of code should be :
InputStream in = url.openStream();

@ThingsExpo Stories
"ciqada is a combined platform of hardware modules and server products that lets people take their existing devices or new devices and lets them be accessible over the Internet for their users," noted Geoff Engelstein of ciqada, a division of Mars International, in this SYS-CON.tv interview at @ThingsExpo, held June 9-11, 2015, at the Javits Center in New York City.
"We have seen the evolution of WebRTC right from the starting point to what it has become today, that people are using in real applications," noted Dr. Natasha Tamaskar, Vice President and Head of Cloud and Mobile Strategy and Ecosystem at GENBAND, in this SYS-CON.tv interview at WebRTC Summit, held June 9-11, 2015, at the Javits Center in New York City.
The 3rd International WebRTC Summit, to be held Nov. 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA, announces that its Call for Papers is now open. Topics include all aspects of improving IT delivery by eliminating waste through automated business models leveraging cloud technologies. WebRTC Summit is co-located with 15th International Cloud Expo, 6th International Big Data Expo, 3rd International DevOps Summit and 2nd Internet of @ThingsExpo. WebRTC (Web-based Real-Time Communication) is an open source project supported by Google, Mozilla and Opera that aims to enable bro...
Internet of Things (IoT) will be a hybrid ecosystem of diverse devices and sensors collaborating with operational and enterprise systems to create the next big application. In their session at @ThingsExpo, Bramh Gupta, founder and CEO of robomq.io, and Fred Yatzeck, principal architect leading product development at robomq.io, discussed how choosing the right middleware and integration strategy from the get-go will enable IoT solution developers to adapt and grow with the industry, while at the same time reduce Time to Market (TTM) by using plug and play capabilities offered by a robust IoT ...
It is one thing to build single industrial IoT applications, but what will it take to build the Smart Cities and truly society-changing applications of the future? The technology won’t be the problem, it will be the number of parties that need to work together and be aligned in their motivation to succeed. In his session at @ThingsExpo, Jason Mondanaro, Director, Product Management at Metanga, discussed how you can plan to cooperate, partner, and form lasting all-star teams to change the world and it starts with business models and monetization strategies.
SYS-CON Events announced today that ProfitBricks, the provider of painless cloud infrastructure, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. ProfitBricks is the IaaS provider that offers a painless cloud experience for all IT users, with no learning curve. ProfitBricks boasts flexible cloud servers and networking, an integrated Data Center Designer tool for visual control over the cloud and the best price/performance value available. ProfitBricks was named one of the coolest Clo...
The 4th International Internet of @ThingsExpo, co-located with the 17th International Cloud Expo - to be held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA - announces that its Call for Papers is open. The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than
17th Cloud Expo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterprises are using some form of XaaS – software, platform, and infrastructure as a service.
"In the IoT space we are helping customers, mostly enterprises and industry verticals where time-to-value is critical, and we help them with the ability to do faster insights and actions using our platform so they can transform their business operations," explained Venkat Eswara, VP of Marketing at Vitria, in this SYS-CON.tv interview at @ThingsExpo, held June 9-11, 2015, at the Javits Center in New York City.
Discussions about cloud computing are evolving into discussions about enterprise IT in general. As enterprises increasingly migrate toward their own unique clouds, new issues such as the use of containers and microservices emerge to keep things interesting. In this Power Panel at 16th Cloud Expo, moderated by Conference Chair Roger Strukhoff, panelists addressed the state of cloud computing today, and what enterprise IT professionals need to know about how the latest topics and trends affect their organization.
WebRTC converts the entire network into a ubiquitous communications cloud thereby connecting anytime, anywhere through any point. In his session at WebRTC Summit,, Mark Castleman, EIR at Bell Labs and Head of Future X Labs, will discuss how the transformational nature of communications is achieved through the democratizing force of WebRTC. WebRTC is doing for voice what HTML did for web content.
To many people, IoT is a buzzword whose value is not understood. Many people think IoT is all about wearables and home automation. In his session at @ThingsExpo, Mike Kavis, Vice President & Principal Cloud Architect at Cloud Technology Partners, discussed some incredible game-changing use cases and how they are transforming industries like agriculture, manufacturing, health care, and smart cities. He will discuss cool technologies like smart dust, robotics, smart labels, and much more. Prepare to be blown away with a glimpse of the future.
Connected things, systems and people can provide information to other things, systems and people and initiate actions for each other that result in new service possibilities. By taking a look at the impact of Internet of Things when it transitions to a highly connected services marketplace we can understand how connecting the right “things” and leveraging the right partners can provide enormous impact to your business’ growth and success. In her general session at @ThingsExpo, Esmeralda Swartz, VP, Marketing Enterprise and Cloud at Ericsson, discussed how this exciting emergence of layers of...
The 17th International Cloud Expo has announced that its Call for Papers is open. 17th International Cloud Expo, to be held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, APM, APIs, Microservices, Security, Big Data, Internet of Things, DevOps and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal today!
The 5th International DevOps Summit, co-located with 17th International Cloud Expo – being held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA – announces that its Call for Papers is open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's largest enterprises – and delivering real results. Among the proven benefits, DevOps is corr...
The Internet of Things is not only adding billions of sensors and billions of terabytes to the Internet. It is also forcing a fundamental change in the way we envision Information Technology. For the first time, more data is being created by devices at the edge of the Internet rather than from centralized systems. What does this mean for today's IT professional? In this Power Panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, panelists addressed this very serious issue of profound change in the industry.
SYS-CON Events announced today that kintone has been named “Bronze Sponsor” of SYS-CON's 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. kintone promotes cloud-based workgroup productivity, transparency and profitability with a seamless collaboration space, build your own business application (BYOA) platform, and workflow automation system.
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo in Silicon Valley. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 17th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal an...
Buzzword alert: Microservices and IoT at a DevOps conference? What could possibly go wrong? In this Power Panel at DevOps Summit, moderated by Jason Bloomberg, the leading expert on architecting agility for the enterprise and president of Intellyx, panelists peeled away the buzz and discuss the important architectural principles behind implementing IoT solutions for the enterprise. As remote IoT devices and sensors become increasingly intelligent, they become part of our distributed cloud environment, and we must architect and code accordingly. At the very least, you'll have no problem fillin...
Explosive growth in connected devices. Enormous amounts of data for collection and analysis. Critical use of data for split-second decision making and actionable information. All three are factors in making the Internet of Things a reality. Yet, any one factor would have an IT organization pondering its infrastructure strategy. How should your organization enhance its IT framework to enable an Internet of Things implementation? In his session at @ThingsExpo, James Kirkland, Red Hat's Chief Architect for the Internet of Things and Intelligent Systems, described how to revolutionize your archit...