DataStreams - CSV Reader
Sponsored Post
When developing applications for big companies who had some legacy systems/applications running, and you are required to do the Data Extraction from the legacy systems import to your application. This, sometimes, can be really hard, especially when there are missing data in some rows, or some data extractions are comma delimited, some tab and some with other format like XLS, XML etc. Here’s the definition from their website:
DataStreams is a component library for .Net developers to easily integrate data from various formats like CSV, Excel, and XML into their applications.
Csv Reader is an extremely fast and stable .Net class for stream based parsing of virtually any commonly found delimited data format, CSV files, tab delimited files, etc. It’s usable from C#, VB.Net, ASP.Net, or any other .Net language. It’s compatible with the 1.0, 1.1, and 2.0 .Net frameworks.
The methods are designed for ease of use, while the inner architecture is designed purely for speed and efficiency. Parsing is done using the de facto standard CSV file specifications. It handles quoted fields, delimiters in the data, and even data that spans across multiple lines. This gives you the ability to open csv files, edit csv files, and save csv files all directly from code.
Data can be parsed in less than half of the time of any other parser it has been benchmarked against. Rough benchmarks on a 2 Ghz processor, parsing common comma separated columns is 20 MB, or 390,000 rows with 5 columns each, almost 2,000,000 cells total, of data parsing per second. Virtually no garbage collection is needed from the framework because of object reuse internally while parsing.
All development is thoroughly unit tested to guarantee fewer bugs and to insure proper handling of extreme cases, currently using 195 test cases and 1308 asserts across 4800 lines of test code. Pricing is extremely cheap and licensing is amazingly open. One license is good across all machines enterprise wide.
I think DataStreams is a handy tool to have for application developers, especially for those who use .NET, Java or Mono.
The website provides sample codes to see how the code is used. Documentation is very critical in any kind of software, you can read documentation online or download zipped compiled help documentation. And hey, they provide full Java version for free and .NET demo version as downloads!
Special Features:
- Stream based file parsing allowing for parsing of large files without storing the entire file contents in memory
- Simple reader pattern movement through file
- Simple access to field data by index or by field name
- Internal parsing using arrays instead of strings for fast looping over large files
- Uses built in framework functionality to handle most encoding issues instead of guessing or hardcoding
- Includes a CSV writer to ease the worries while creating CSV data. No more needing to worry about what to replace with what to match up with delimiters, text qualifiers, etc
Website: http://www.csvreader.com/
Comments
Leave a Reply
You must be logged in to post a comment.
