Welcome!

Java IoT Authors: SmartBear Blog, Elizabeth White, Waqar Ahmad, Pat Romanski, Liz McMillan

Blog Feed Post

BARUG talks highlight R's diverse applications

by Joseph Rickert The seven lightning talks presented to the Bay Area useR Group on Tuesday night were not only really interesting (in some cases downright entertaining) in their own right, but they also illustrated the diversity of R applications, and the extent to which R has become embedded in the corporate world. Two presentations with a whimsical touch were Gaston Sanchez’s talk on Arc Diagrams with R and Ram Narasimhan presentation on comparing the weather of various cities. Gaston showed a statistical text analysis of the movie scripts from three Star War episodes using arc-diagram representations. Gaston did some original work here in creating the arc-diagram plots and showed how to use R’s tm and igraph packages to extract text and compute adjacency matrices. The Star Wars analaysis code and the arc-diagram code are both available.   Ram’s talk was based on his weather data package (V0.3 on CRAN and V0.4 at GitHub) which has become a very useful and popular tool for scraping weather data from airports and weather stations around the world. The following plot shows how various cities rank according to his wife’s personal comfort score. Also have a look at Ram’s Shiny app next time you are wondering whether you should visit San Francisco or Honolulu. Presentations from Sara Brumbaugh on Running R from Excel, Winston Chen on Data Analysis with RStudio and MongoDB, and Cliff Click and Nidhi Mehta on Using H20 with R all made cases for integrating R with other corporate tools. Sara showed how to combine R scripts and Excel VBA code to pass inputs and parameters from a worksheet to a batch process, and back again. She showed several practical examples as well as quite a few virtuoso Excel tricks like storing and R script in a hidden Excel worksheet. Winston’s talk emphasized how R’s visualization capabilities alone are enough to earn it a place in a big-league machine learning shop. The platform stack at Winston’s company, fliptop. is built around Java/Scala, MongoDB/MySql and Python. But with all of that power they still didn’t have a good way to do data visualization with exploratory data analysis. Winston showed some examples with code of how they use RStudio to pull data from MongoDB into an R data frame where they can plot it. Cliff, 0xdata’s CTO, gave a succinct overview of how the H20 JVM can free R from its memory and speed limitations and make it possible to run machine learning algorithms from the R environment on huge data sets. According to Cliff, if you built a 16 node cluster of machines each with 64GB of RAM and all running H20 you could have a terabyte cluster for H20’s in-memory analytics and run logistic regression, gbm, neural nets, random forests and other machine learning algorithms through the R to H20 Interface. Cliff emphasized that H20 implements a "group-by" feature that is very similar to the way plyr’s ddply function making it possible to do R style analyses on big data. Nidhi followed up by running several of the examples that can be found on the 0xdata website. Nidhi showed real grace under pressure, and made the speed of the H20 algorithms seem all the more impressive by running live demos one after the other while the clock on the 12 minute presentation time limit was running out. Finally the two presentations, the first by, Raman Kapur on Managing Enterprise Cyber Risk through Big Data & Analytics, and the second by Giovanni Seni on Intuit’s new Rego package show how R applications can form the foundation of a production system. After providing some background information on the prevalence of information security breaches, Raman talked about how Foundation’s Edge has built Avana, an R based system to model the risk profile of a corporation’s business units. Giovanni gave a brief introduction to the rule based ensemble methods developed by Friedman and Popescu and worked through an example using the Rego package, which is newly available on Github. Giovanni, who has considerable experience with ensemble methods (have a look at the book he wrote with John Elder), said that he favors rule based methods because of their interpretability. He stressed that in addition to building predictive models, data scientists are often seeking insight into how complex systems work. Rule based ensemble models are useful for both purposes, often outperforming tree based classifiers for prediction. A notable  feature of the Rego package is that it has a command-line, batch interface. Here we have an R package that is meant to do the heavy lifting in a production system. key link: BARUG presentations

Read the original blog entry...

More Stories By David Smith

David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.<

David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid

@ThingsExpo Stories
You think you know what’s in your data. But do you? Most organizations are now aware of the business intelligence represented by their data. Data science stands to take this to a level you never thought of – literally. The techniques of data science, when used with the capabilities of Big Data technologies, can make connections you had not yet imagined, helping you discover new insights and ask new questions of your data. In his session at @ThingsExpo, Sarbjit Sarkaria, data science team lead ...
The IoT has the potential to create a renaissance of manufacturing in the US and elsewhere. In his session at 18th Cloud Expo, Florent Solt, CTO and chief architect of Netvibes, will discuss how the expected exponential increase in the amount of data that will be processed, transported, stored, and accessed means there will be a huge demand for smart technologies to deliver it. Florent Solt is the CTO and chief architect of Netvibes. Prior to joining Netvibes in 2007, he co-founded Rift Technol...
Join IBM June 8 at 18th Cloud Expo at the Javits Center in New York City, NY, and learn how to innovate like a startup and scale for the enterprise. You need to deliver quality applications faster and cheaper, attract and retain customers with an engaging experience across devices, and seamlessly integrate your enterprise systems. And you can't take 12 months to do it.
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, will discuss how research has demonstrated the value of Machine Learning in delivering next generation analytics to im...
This is not a small hotel event. It is also not a big vendor party where politicians and entertainers are more important than real content. This is Cloud Expo, the world's longest-running conference and exhibition focused on Cloud Computing and all that it entails. If you want serious presentations and valuable insight about Cloud Computing for three straight days, then register now for Cloud Expo.
So, you bought into the current machine learning craze and went on to collect millions/billions of records from this promising new data source. Now, what do you do with them? Too often, the abundance of data quickly turns into an abundance of problems. How do you extract that "magic essence" from your data without falling into the common pitfalls? In her session at @ThingsExpo, Natalia Ponomareva, Software Engineer at Google, will provide tips on how to be successful in large scale machine lear...
IoT device adoption is growing at staggering rates, and with it comes opportunity for developers to meet consumer demand for an ever more connected world. Wireless communication is the key part of the encompassing components of any IoT device. Wireless connectivity enhances the device utility at the expense of ease of use and deployment challenges. Since connectivity is fundamental for IoT device development, engineers must understand how to overcome the hurdles inherent in incorporating multipl...
The paradigm has shifted. A Gartner survey shows that 43% of organizations are using or plan to implement the Internet of Things in 2016. However, not just a handful of companies are still using the old-style ad-hoc trial-and-error ways, unaware of the critical barriers, paint points, traps, and hidden roadblocks. How can you become a winner? In his session at @ThingsExpo, Tony Shan will present a methodical approach to guide the holistic adoption and enablement of IoT implementations. This ov...
We’ve worked with dozens of early adopters across numerous industries and will debunk common misperceptions, which starts with understanding that many of the connected products we’ll use over the next 5 years are already products, they’re just not yet connected. With an IoT product, time-in-market provides much more essential feedback than ever before. Innovation comes from what you do with the data that the connected product provides in order to enhance the customer experience and optimize busi...
The IETF draft standard for M2M certificates is a security solution specifically designed for the demanding needs of IoT/M2M applications. In his session at @ThingsExpo, Brian Romansky, VP of Strategic Technology at TrustPoint Innovation, will explain how M2M certificates can efficiently enable confidentiality, integrity, and authenticity on highly constrained devices.
Artificial Intelligence has the potential to massively disrupt IoT. In his session at 18th Cloud Expo, AJ Abdallat, CEO of Beyond AI, will discuss what the five main drivers are in Artificial Intelligence that could shape the future of the Internet of Things. AJ Abdallat is CEO of Beyond AI. He has over 20 years of management experience in the fields of artificial intelligence, sensors, instruments, devices and software for telecommunications, life sciences, environmental monitoring, process...
SYS-CON Events announced today that Ericsson has been named “Gold Sponsor” of SYS-CON's @ThingsExpo, which will take place on June 7-9, 2016, at the Javits Center in New York, New York. Ericsson is a world leader in the rapidly changing environment of communications technology – providing equipment, software and services to enable transformation through mobility. Some 40 percent of global mobile traffic runs through networks we have supplied. More than 1 billion subscribers around the world re...
SYS-CON Events announced today that Stratoscale, the software company developing the next generation data center operating system, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Stratoscale is revolutionizing the data center with a zero-to-cloud-in-minutes solution. With Stratoscale’s hardware-agnostic, Software Defined Data Center (SDDC) solution to store everything, run anything and scale everywhere...
Angular 2 is a complete re-write of the popular framework AngularJS. Programming in Angular 2 is greatly simplified – now it's a component-based well-performing framework. This immersive one-day workshop at 18th Cloud Expo, led by Yakov Fain, a Java Champion and a co-founder of the IT consultancy Farata Systems and the product company SuranceBay, will provide you with everything you wanted to know about Angular 2.
SYS-CON Events announced today that Men & Mice, the leading global provider of DNS, DHCP and IP address management overlay solutions, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. The Men & Mice Suite overlay solution is already known for its powerful application in heterogeneous operating environments, enabling enterprises to scale without fuss. Building on a solid range of diverse platform support,...
In his session at @ThingsExpo, Chris Klein, CEO and Co-founder of Rachio, will discuss next generation communities that are using IoT to create more sustainable, intelligent communities. One example is Sterling Ranch, a 10,000 home development that – with the help of Siemens – will integrate IoT technology into the community to provide residents with energy and water savings as well as intelligent security. Everything from stop lights to sprinkler systems to building infrastructures will run ef...
You deployed your app with the Bluemix PaaS and it's gaining some serious traction, so it's time to make some tweaks. Did you design your application in a way that it can scale in the cloud? Were you even thinking about the cloud when you built the app? If not, chances are your app is going to break. Check out this webcast to learn various techniques for designing applications that will scale successfully in Bluemix, for the confidence you need to take your apps to the next level and beyond.
Manufacturers are embracing the Industrial Internet the same way consumers are leveraging Fitbits – to improve overall health and wellness. Both can provide consistent measurement, visibility, and suggest performance improvements customized to help reach goals. Fitbit users can view real-time data and make adjustments to increase their activity. In his session at @ThingsExpo, Mark Bernardo Professional Services Leader, Americas, at GE Digital, will discuss how leveraging the Industrial Interne...
Whether your IoT service is connecting cars, homes, appliances, wearable, cameras or other devices, one question hangs in the balance – how do you actually make money from this service? The ability to turn your IoT service into profit requires the ability to create a monetization strategy that is flexible, scalable and working for you in real-time. It must be a transparent, smoothly implemented strategy that all stakeholders – from customers to the board – will be able to understand and comprehe...
Increasing IoT connectivity is forcing enterprises to find elegant solutions to organize and visualize all incoming data from these connected devices with re-configurable dashboard widgets to effectively allow rapid decision-making for everything from immediate actions in tactical situations to strategic analysis and reporting. In his session at 18th Cloud Expo, Shikhir Singh, Senior Developer Relations Manager at Sencha, will discuss how to create HTML5 dashboards that interact with IoT devic...