Click here to close now.

Welcome!

Java Authors: Pat Romanski, Elizabeth White, Asad Ali, Carmen Gonzalez, Liz McMillan

Related Topics: Java

Java: Article

Java Performance I/O Tuning

Java Performance I/O Tuning

Many Java programs that utilize I/O are excellent candidates for performance tuning. One of the more common problems in Java applications is inefficient I/O. A profile of Java applications and applets that handle significant volumes of data will show significant time spent in I/O routines, implying substantial gains can be had from I/O performance tuning. In fact, the I/O performance issues usually overshadow all other performance issues, making them the first area to concentrate on when tuning performance. Therefore, I/O efficiency should be a high priority for developers looking to optimally increase performance. Unfortunately, optimal reading and writing can be challenging in Java.

Once an application's reliance upon I/O is established and I/O is determined to account for a substantial slice of the applications execution time, performance tuning can be undertaken. The best method for determining the distribution of execution time among methods is to use a profiler. Sunª Javaª WorkShopª software provides an excellent profiler that offers detailed call counts and execution times for each method. System method call statistics can be tabulated as an option. Stream chaining and custom I/O class methods of performance tuning are discussed. An example program is provided that allows the progressive measurement of the progress of the tuning effort. Using the example program that is provided, JavaIOTest.java, and utilizing the techniques described, substantial performance improvements of an order of magnitude can be achieved. Simple stream chaining provides approximately a 91% decrease in execution time from 28,198 milliseconds to 2,510 milliseconds, while a custom BufferedFileReader class cuts performance time by another 75%, over 97% total, to 630 milliseconds for a 250 kilobyte text file on The Sun™ Solaris™ 2.6 operating environment.

Introduction
Java performance is currently a topic of great interest. Performance is usually hotly debated for any relatively new language or operating environment, so this is not surprising. However, Java's reliance upon the availability of sufficient network bandwidth for the downloading of classes shifts the relative benefits of some options for optimization. The reliance on the network penalizes optimization techniques that favor increasing code size in order to provide faster execution. The resulting optimized classes can take longer to download to the client. Of course, server-side Java is not as acutely affected by code size and developers can even consider native code compilers for that case. Based upon anecdotal evidence, most Java development today seems to be concentrated on client-side applets with the result that download times are an important criterion. Java optimization efforts, therefore, need to be well-researched and considered.

Because Java is a relatively new language, optimizing compiler features are less sophisticated than those available for C and C++, leaving room for more "hand-crafting". The "hand" optimization of key sections identified by profilers, such as the profiler available in Sun's Java WorkShop 2.0, can reap substantial benefits.

One of the more common problems in Java applications is inefficient I/O. A profile of Java applications and applets that handle significant volumes of data will show significant time spent in I/O routines, implying substantial gains can be had from I/O performance tuning. In fact, the I/O performance issues usually overshadow all other performance issues, making them the first area to concentrate on when tuning performance. Therefore, I/O efficiency should be a high priority for developers looking to optimally increase performance. Unfortunately, optimal reading and writing can be challenging in Java. Streamlining the use of I/O often results in greater performance gains than all other possible optimizations combined. It is not uncommon to see a speed improvement of at least an order of magnitude using efficient I/O techniques, as this paper and the example program will demonstrate.

This article focuses on the improvement gains possible through careful use of both the existing Java I/O classes and the introduction of a custom file reader, BufferedFileReader. BufferedFileReader is responsible for some of the performance increase of Java WorkShop version 2.0 over version 1.0. An example application is used to read three different file sizes, ranging from 100 kilobytes to 500 kilobytes and the results are compared for various optimizations.

Performance Tuning Through Stream Chaining
As a demonstration of I/O performance tuning, this article will describe the process of tuning a sample program created expressly for this paper: JavaIOTest. JavaIOTest tracks the execution times for several I/O schemes starting with a very basic DataInputStream method and culminating with the use of a custom-buffered, file-reader class, while demonstrating the performance improvements obtained by several program design changes during the tuning effort. The actual execution times are meant to show the relative improvements possible.* The actual execution times will vary widely among the systems used. Readers are cautioned that what is important is the relative improvement on the same system, test-to-test, and that comparisons across operating environments and systems are complex and the results can be specious.

Basic IO: DataInputStream
The I/O method used in this section is a DataInputStream chained to a FileInputStream as shown in Listing 1. This method of reading a file is very common since it is simple, but it is extremely slow. The reason for the poor performance is that the DataInputStream class does no buffering. The resulting reads are done one byte at a time. Several instances of this technique have been found in the JDKª software as well as several "real" Java programs, providing fertile ground for improvement through a tuning regime (see Listing 1).

The results of using the default, basic I/O scheme are as follows. The first section of the example program, JavaIOTest, showed run times of 28,198 milliseconds reading a 250 kilobyte file.*

An Improvement: BufferedInputStream
A simple improvement involves buffering the FileInputStream by interposing a BufferedInputStream in the stream chain. This buffers the data, with the default buffer size of 2048 bytes. Listing 2 illustrates the minor source code change required.

The resulting performance increase for the medium sized file (250 kilobytes) was 91%, from 28,198 milliseconds to 2,510 -- over an order of magnitude with just a simple change.*

The New JDK 1.1 Classes
The foregoing method has provided a substantial performance improvement but has a serious flaw: the readLine() method of DataInputStream does not properly handle Unicode characters. The problem is that the method assumes all characters are one byte in length while Unicode characters are two bytes in length. This method has been deprecated beginning in JDK 1.1. Since deprecated classes are discouraged, the FileReader and BufferedReader classes should be substituted for the classes.

Unfortunately, the scheme to provide for Unicode character localization consists of invoking a locale-dependent converter on the raw bytes to convert them to Java characters, causing an extra copy operation per character. This penalty is offset by other efficiencies in the code. The code change is shown in Listing 3.

The resulting performance increase for the medium file size was 57%, >from 2,510 to 1,092 milliseconds.*

Buffer Size Effects
The buffer size used in buffering schemes is important for performance. As a rule of thumb, bigger is better to a point. In order to examine the impact of the buffer size, a test run was made with a smaller buffer than the default of 8,192 bytes used in the BufferedReader class. Listing 4 shows the code segment using a reduced buffer size of 1,024 bytes.

Depending upon the file's size and platform used for testing, the larger buffer size provided performance improvements ranging from 3 to 13 percent. The use of a large buffer size will improve performance significantly and should be considered unless local memory is restricted.

Summary
Using simple stream-chaining techniques, the execution performance of an I/O bound Java program has been increased an average of 97 percent over using the simple DataInputStream class. This is a substantial improvement for a little extra design work and one that could mean the difference between shipping and re-designing an interactive application.

Tuning with Custom I/O Classes
To this point, tuning has focused on using the core classes distributed with the JDK. With each version of the JDK, more effort seems to be going into tuning critical sections for performance. The improvement in speed of the BufferedReader class over the BufferedInputStream class despite the additional copy per character hints at this. However, if the application needs to read large files, a custom class can be created to further tune performance. The BufferedReader.readLine() method creates an instance of StringBuffer to hold the characters in the line it reads. It then converts the StringBuffer to String, resulting in two more copies per character. The BufferedFileReader class utilizes a modified readLine() method that avoids the extra, double-copy in most cases. It also adds the convenience of creating the FileReader class for the caller. Listing 5 shows the changes required to use this class. The resulting performance increase for the medium file size was 32% overall to 630 milliseconds.*

The BufferedFileReader class is being used in Java WorkShop (package sun.jws.util). The documentation comment in Listing 6 describes the efficiencies added.

Without having to chain together several different classes, as with the standard JDK classes, the example provides a single, efficient class through which a file may be read. It is also more efficient (typically faster) than the fastest JDK classes. Specific optimizations include:
1. More efficiently coded readLine( ) method.
2. Adds open( ) method, so the class can be reused when several files are read in a loop. This avoids repeated allocation and deallocation of buffers.

This class (see Listing 6) contains a self-benchmarking test in its main() method that can be used to measure the exact speedup on a particular system.

Further Tuning
Although the example in Listing 5 is as much as 45 times faster than the example in Listing 1 (and actually comprises fewer lines of code), it is still far from the best that can be done. There are at least two more major optimizations that can be done if still higher performance is required and we are willing to do a little more work.

First, if we look at the first line of the while loop, we see that a new String object is being created for every line of the file being read:

while ((line = in.readLine()) != null) {

This means, for example, that for a 100,000 line file 100,000 String objects would be created. Creating a large number of objects incurs costs in three ways: 1. Time and memory to allocate the space for the objects
2. Time to initialize the objects
3. Time to garbage collect the objects

The problem here is that the I/O buffer is private; the user cannot access it directly. Therefore, BufferedFileReader must create a new String object in order to return the data to the user. Although this follows the conventional assertion that class structures should largely be private in order to control data access, the performance penalty is too high an insurance premium for this case.

To get around this problem, the user must manage the buffer directly without using the BufferedReader or BufferedFileReader convenience classes. This will enable the user to reuse buffers rather than creating a new object each time to hold the data.

Second, strings are inherently less efficient than arrays based upon char. This is because the user must call a method to access each character of a String, whereas the characters can be accessed directly in a char array. Hence, our code example can be made more efficient by avoiding Strings entirely, and using char arrays directly.

Listing 7 shows the code which implements the two optimizations above. It is substantially more lines of code than the previous examples, but tests show it performs as much as 3 times faster than the example in Listing 5.

Performance Tuning Results
The results of running the test program used for this article, JavaIOTest, on text files ranging from 100 kilobytes to 500 kilobytes in size are summarized in the tables found in Appendix One at http://www.sun.com/workshop/java/wp-javaio. The relative performance numbers are more important than the absolute numbers since the system was not isolated nor used exclusively for just the test processes. As the automobile industry states in its disclaimers, "your mileage may vary", readers are again cautioned that what is most important is the relative improvement on the same system and test-to-test. Comparisons across operating environments and systems are complex and the results can be specious.

*Full test results are available at: http://www.sun.com/workshop/java/wp-javaio. See Appendix One.

This article was provided by Engineering, Sun's Authoring and Development Tools Group.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@ThingsExpo Stories
SYS-CON Events announced today that Solgenia will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY, and the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Solgenia is the global market leader in Cloud Collaboration and Cloud Infrastructure software solutions. Designed to “Bridge the Gap” between Personal and Professional Social, Mobile and Cloud user experiences, our solutions help large and medium-sized organizations dr...
SYS-CON Events announced today that Liaison Technologies, a leading provider of data management and integration cloud services and solutions, has been named "Silver Sponsor" of SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York, NY. Liaison Technologies is a recognized market leader in providing cloud-enabled data integration and data management solutions to break down complex information barriers, enabling enterprises to make smarter decisions, faster.
Connected devices and the Internet of Things are getting significant momentum in 2014. In his session at Internet of @ThingsExpo, Jim Hunter, Chief Scientist & Technology Evangelist at Greenwave Systems, examined three key elements that together will drive mass adoption of the IoT before the end of 2015. The first element is the recent advent of robust open source protocols (like AllJoyn and WebRTC) that facilitate M2M communication. The second is broad availability of flexible, cost-effective storage designed to handle the massive surge in back-end data in a world where timely analytics is e...
SYS-CON Events announced today that Akana, formerly SOA Software, has been named “Bronze Sponsor” of SYS-CON's 16th International Cloud Expo® New York, which will take place June 9-11, 2015, at the Javits Center in New York City, NY. Akana’s comprehensive suite of API Management, API Security, Integrated SOA Governance, and Cloud Integration solutions helps businesses accelerate digital transformation by securely extending their reach across multiple channels – mobile, cloud and Internet of Things. Akana enables enterprises to share data as APIs, connect and integrate applications, drive part...
SYS-CON Events announced today that CommVault has been named “Bronze Sponsor” of SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY, and the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. A singular vision – a belief in a better way to address current and future data management needs – guides CommVault in the development of Singular Information Management® solutions for high-performance data protection, universal availability and sim...
Cloud is not a commodity. And no matter what you call it, computing doesn’t come out of the sky. It comes from physical hardware inside brick and mortar facilities connected by hundreds of miles of networking cable. And no two clouds are built the same way. SoftLayer gives you the highest performing cloud infrastructure available. One platform that takes data centers around the world that are full of the widest range of cloud computing options, and then integrates and automates everything. Join SoftLayer on June 9 at 16th Cloud Expo to learn about IBM Cloud's SoftLayer platform, explore se...
SYS-CON Media announced today that @ThingsExpo Blog launched with 7,788 original stories. @ThingsExpo Blog offers top articles, news stories, and blog posts from the world's well-known experts and guarantees better exposure for its authors than any other publication. @ThingsExpo Blog can be bookmarked. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago.
The 3rd International Internet of @ThingsExpo, co-located with the 16th International Cloud Expo - to be held June 9-11, 2015, at the Javits Center in New York City, NY - announces that its Call for Papers is open. The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than 20 years ago.
The WebRTC Summit 2014 New York, to be held June 9-11, 2015, at the Javits Center in New York, NY, announces that its Call for Papers is open. Topics include all aspects of improving IT delivery by eliminating waste through automated business models leveraging cloud technologies. WebRTC Summit is co-located with 16th International Cloud Expo, @ThingsExpo, Big Data Expo, and DevOps Summit.
The Internet of Things promises to transform businesses (and lives), but navigating the business and technical path to success can be difficult to understand. In his session at @ThingsExpo, Sean Lorenz, Technical Product Manager for Xively at LogMeIn, demonstrated how to approach creating broadly successful connected customer solutions using real world business transformation studies including New England BioLabs and more.
SYS-CON Media announced today that 9 out of 10 " most read" DevOps articles are published by @DevOpsSummit Blog. Launched in October 2014, @DevOpsSummit Blog offers top articles, news stories, and blog posts from the world's well-known experts and guarantees better exposure for its authors than any other publication. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to wait for long development cycles that produce softw...
The world's leading Cloud event, Cloud Expo has launched Microservices Journal on the SYS-CON.com portal, featuring over 19,000 original articles, news stories, features, and blog entries. DevOps Journal is focused on this critical enterprise IT topic in the world of cloud computing. Microservices Journal offers top articles, news stories, and blog posts from the world's well-known experts and guarantees better exposure for its authors than any other publication. Follow new article posts on Twitter at @MicroservicesE
SYS-CON Events announced today that Site24x7, the cloud infrastructure monitoring service, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. Site24x7 is a cloud infrastructure monitoring service that helps monitor the uptime and performance of websites, online applications, servers, mobile websites and custom APIs. The monitoring is done from 50+ locations across the world and from various wireless carriers, thus providing a global perspective of the end-user experience. Site24x7 supports monitoring H...
Wearable technology was dominant at this year’s International Consumer Electronics Show (CES) , and MWC was no exception to this trend. New versions of favorites, such as the Samsung Gear (three new products were released: the Gear 2, the Gear 2 Neo and the Gear Fit), shared the limelight with new wearables like Pebble Time Steel (the new premium version of the company’s previously released smartwatch) and the LG Watch Urbane. The most dramatic difference at MWC was an emphasis on presenting wearables as fashion accessories and moving away from the original clunky technology associated with t...
One of the biggest challenges when developing connected devices is identifying user value and delivering it through successful user experiences. In his session at Internet of @ThingsExpo, Mike Kuniavsky, Principal Scientist, Innovation Services at PARC, described an IoT-specific approach to user experience design that combines approaches from interaction design, industrial design and service design to create experiences that go beyond simple connected gadgets to create lasting, multi-device experiences grounded in people's real needs and desires.
SYS-CON Events announced today that SafeLogic has been named “Bag Sponsor” of SYS-CON's 16th International Cloud Expo® New York, which will take place June 9-11, 2015, at the Javits Center in New York City, NY. SafeLogic provides security products for applications in mobile and server/appliance environments. SafeLogic’s flagship product CryptoComply is a FIPS 140-2 validated cryptographic engine designed to secure data on servers, workstations, appliances, mobile devices, and in the Cloud.
The list of ‘new paradigm’ technologies that now surrounds us appears to be at an all time high. From cloud computing and Big Data analytics to Bring Your Own Device (BYOD) and the Internet of Things (IoT), today we have to deal with what the industry likes to call ‘paradigm shifts’ at every level of IT. This is disruption; of course, we understand that – change is almost always disruptive.
Containers and microservices have become topics of intense interest throughout the cloud developer and enterprise IT communities. Accordingly, attendees at the upcoming 16th Cloud Expo at the Javits Center in New York June 9-11 will find fresh new content in a new track called PaaS | Containers & Microservices Containers are not being considered for the first time by the cloud community, but a current era of re-consideration has pushed them to the top of the cloud agenda. With the launch of Docker's initial release in March of 2013, interest was revved up several notches. Then late last...
Can call centers hang up the phones for good? Intuitive Solutions did. WebRTC enabled this contact center provider to eliminate antiquated telephony and desktop phone infrastructure with a pure web-based solution, allowing them to expand beyond brick-and-mortar confines to a home-based agent model. It also ensured scalability and better service for customers, including MUY! Companies, one of the country's largest franchise restaurant companies with 232 Pizza Hut locations. This is one example of WebRTC adoption today, but the potential is limitless when powered by IoT.
@ThingsExpo has been named the Top 5 Most Influential M2M Brand by Onalytica in the ‘Machine to Machine: Top 100 Influencers and Brands.' Onalytica analyzed the online debate on M2M by looking at over 85,000 tweets to provide the most influential individuals and brands that drive the discussion. According to Onalytica the "analysis showed a very engaged community with a lot of interactive tweets. The M2M discussion seems to be more fragmented and driven by some of the major brands present in the M2M space. This really allows some room for influential individuals to create more high value inter...