RepeatHost Business Hosting

C# Free Component to Generate PDF - Convert HTML to PDF

The comment you are replying to does not exist.

I was trying different components and methods of generating a PDF dynamically using C# and ASP.NET. There are quite a few pay-for

components with prices ranging between $250 and $1,000+ for a license. These pay-for products do a great job and some of them can

generate very complex PDF documents. But I just needed something that would generate simple PDF documents without too much in the way

of formatting.

There are also quite a few open source projects that provide rudimentary support for PDF creation. Many are pretty limited in their

feature sets. It seems Adobe has really kept it's power position by making sure their PDF format is as complex as possible. There is

one component in particular that I tried that worked quite well for my needs and is completely free. It's called Pdfizer. You can

view the site for this project here:

http://sourceforge.net/projects/pdfizer

It was a little difficult finding an up-to-date code sample. There is a post on Code Project on how to use this component but it was

dated 2004 that you can read here:

http://www.codeproject.com/KB/cs/pdfizer.aspx

Unfortunately, the code shown in this article is out-dated and won't even compile anymore. After some playing around and a few

different tries, I was able to get things working. To get this code to work, first download the latest build from sourceforge.net or

you can download the files here just in case the link for sourceforge.net is unavailable:

Pdfizer.zip

Generate PDF from HTML

First, set a reference in your project to the 3 DLL's that Pdfizer uses. Here are the 3 dll names to set a reference to:

ICSharpCode.SharpZipLib.dll (This component is used to parse the HTML)

itextsharp.dll (This component is used by Pdfizer to create the PDF document)

Pdfizer.dll (This is the main component with the HtmlToPdf object that executes the conversion operations).

Now we can add some code to use this component. Here is the code to generate a PDF from some HTML specified:

// set a path to where you want to write the PDF to.
string sPathToWritePdfTo = @"C:\new_pdf_name.pdf";
 
// build some HTML text to write as a PDF.  You could also 
// read this HTML from a file or other means.
// NOTE: This component doesn't understand CSS or other 
// newer style HTML so you will need to use depricated 
// HTML formatting such as the <font> tag to make it look correct.
System.Text.StringBuilder sbHtml = new System.Text.StringBuilder();
sbHtml.Append("<html>");
sbHtml.Append("<body>");
sbHtml.Append("<font size='14'>My Document Title Line</font>");
sbHtml.Append("<br />");
sbHtml.Append("This is my document text");
sbHtml.Append("</body>");
sbHtml.Append("</html>");
 
// create file stream to PDF file to write to
using (System.IO.Stream stream = new System.IO.FileStream

(sPathToWritePdfTo, System.IO.FileMode.OpenOrCreate))
{
    // create new instance of Pdfizer
    Pdfizer.HtmlToPdfConverter htmlToPdf = new Pdfizer.HtmlToPdfConverter();
    // open stream to write Pdf to to
    htmlToPdf.Open(stream);
    // write the HTML to the component
    htmlToPdf.Run(sbHtml.ToString());
    // close the write operation and complete the PDF file
    htmlToPdf.Close();
}

This component also supports PDF Chapters. You could add a single line of code right before the Run() method to make the HTML specified a single chapter like this:

// open stream to write Pdf to to
htmlToPdf.Open(stream);
 
// add a chapter for this HTML
htmlToPdf.AddChapter("My Chapter Title 1");
 
// write the HTML to the component
htmlToPdf.Run(sbHtml.ToString());

Repeat the AddChapter() and Run() methods for each chapter you want to add and then Close() to commit it to the PDF.

Your PDF should look something like this:

sample pdfizer pdf format

Download PDF using ASP.NET

Once the PDF is created, you can dynamically stream it back to the client browser in ASP.NET on the fly as a file download using code like this:

// clear the http response so nothing else is in the stream so we can just isolate the file bits.
HttpContext.Current.Response.Clear();
// add the HTTP header to tell the browser to accept this as a file.  Also, the friendlypdfname.pdf is the name 
// of the PDF as you want it to appear to the user (regardless of what it is named in your file system).
HttpContext.Current.Response.AddHeader("content-disposition", string.Format("attachment; filename={0}", "friendlypdfname.pdf"));
// tell the browser what type of file this is so it can have a mime type associated with it.
HttpContext.Current.Response.ContentType = "application/pdf";
 
// pass the path that you wrote the file to on your file system as the parameter to WriteFile()

HttpContext.Current.Response.WriteFile(sPathToWritePdfTo);
// end the response and commit the file to the stream
HttpContext.Current.Response.End();

You could put this code in an OnClick() button event or other means that would then stream this new PDF down to your client browser.

Some help required with this

I'm trying to include a html table with this. The table appears fine, but it comes with a wide margin on either side of the page, this is making the table too cramped. How can I make the table spread through the width of the pdf page? Please help!

KD

About the itextsharp version

Hi, this was a very helpful post! However, I already have a version of itextsharp.dll (4.0.2.0), and the one available with this component is version 1.0.4.0. I need to use the 4.0 version in my application, and I need to find a way to get this component to refer the 4.0 dll and not look for the 1.0.4 version. How do I do this? Any help would be welcome.

I get this error now - Could not load file or assembly 'itextsharp, Version=1.0.4.0, Culture=neutral, PublicKeyToken=null' or one of its dependencies. The located assembly's manifest definition does not match the assembly reference. (Exception from HRESULT: 0x80131040)

Thanks
KD

medyum

I am to a great extent impressed with the article I have just read interesting very good

Hi...I am passing html file

Hi...I am passing html file which contains pictures too..Its giving an error that Data at the root level is invalid. Line 1, position 1..what does this mean?

Page Break

How I can make a "Page Break"?
Thank you, Rajko

برامج

Great additions, by the use of codes برامج

I am using following

I am using following code:
StringBuilder s = new StringBuilder();
//test aspx = new test();
StringWriter sw = new StringWriter(s);
HtmlTextWriter writer = new Html32TextWriter(sw);
base.Render(writer);

System.IO.Stream stream = new System.IO.FileStream(@"C:\new_pdf_name6.pdf", FileMode.OpenOrCreate);
Pdfizer.HtmlToPdfConverter htmlToPdf = new Pdfizer.HtmlToPdfConverter();
// open stream to write Pdf to to
htmlToPdf.Open(stream);
// write the HTML to the component
htmlToPdf.Run(s.ToString());
// close the write operation and complete the PDF file
htmlToPdf.Close();

This is giving following error on this statement:
htmlToPdf.Run(s.ToString());

Error is: The remote server returned an error: (503) Server Unavailable.

Please help

can you add your source code

thank you for a quite nice html2pdf machine...
the thing is that i want to add new font styles with CP1250 encoding and it sounds impossible with this old itextsharp.dll library, besides this one is not compatibile with the newest version.
the original code doesn't support [table],[br],[p] and so on...
Michal

Image Support Updated

I think the reason your image support worked is because your image paths look to be local paths.  What most have tried and failed with is using images that are at URL paths on the web.  The component seems to have no functionality to download the images and put them in the PDF.  But it looks like if the images are a local path, it works.  I think that might reconcile the difference.

Image Support WORKS!

I'm not sure why others had a problem with it, however, I was able to use this with images without any problem. Below is the basic HTML that I generated (my images were 800X600).

<html>
<body>
<table height='800' width='600'>
<tr><td><img src='filepath1' /></td></tr>
<tr><td><img src='filepath2' /></td></tr>
<tr><td><img src='filepath3' /></td></tr>
</table>
</body>
</html>

This is a great little free method to get simple PDFs created. I highly recommend it!

Damn and blast that really sucks.....

Damn and blast that really sucks..... Its unfortunate that there isn't a open source Html to Pdf converter that can compete with the commercial solutions with out the ridiculous licensing costs. Thanks for the quick reply none the less.

~3~

hello

very cool realy thanks....evden eve nakliyat
evden eve nakliyat

I hadn't tried images but it looks like

I hadn't tried images but it looks like support is not good. It seems to just ignore your image URL's. Kind of sucks. When I put this together, I only had need for text and formatting. Looks like that is as far as this component goes.

Great article, helped a ton, just one

Great article, helped a ton, just one question, what is the support for images like? I have a document rendered in valid XHTML but by adding images the pdf only contains a single table cell, it doesn't contain any of the HTML I had fed to it! Any ideas?

~3~

has this been solved, have the same problem

Hi, i have the same problem i am dynamically building the html( i have checked and the output is correct however it does not show anything on the pdf, any idea's

Hi, i have the same problem i

Hi, i have the same problem i am dynamically building the html( i have checked and the output is correct however it does not show anything on the pdf, any idea'sMedyum

Simple and good article.

thank you for posting the code and necessary info.... its a great article.
There are so many paid products available. Some of the PDF converters are very close to the original Windows XP cost. Thanks again !!

Getting Error !!

Server Error in '/NewJcs' Application.
'0' is an unexpected token. The expected token is '"' or '''. Line 1, position 791.
Description: An unhandled exception occurred during the execution of the current web request. Please review the stack trace for more information about the error and where it originated in the code.

Exception Details: System.Xml.XmlException: '0' is an unexpected token. The expected token is '"' or '''. Line 1, position 791.

Source Error:

Line 492: htmlToPdf.Open(stream);
Line 493: // write the HTML to the component
Line 494: htmlToPdf.Run(sbHtml);
Line 495: // close the write operation and complete the PDF file
Line 496: htmlToPdf.Close();

Source File: d:\Shared\Project\Under My Documents\NewJcs\App_Code\jwstore.cs Line: 494

Stack Trace:

[XmlException: '0' is an unexpected token. The expected token is '"' or '''. Line 1, position 791.]
System.Xml.XmlTextReaderImpl.Throw(Exception e) +76
System.Xml.XmlTextReaderImpl.Throw(String res, String[] args) +88
System.Xml.XmlTextReaderImpl.ThrowUnexpectedToken(String expectedToken1, String expectedToken2) +104
System.Xml.XmlTextReaderImpl.ParseAttributes() +3978624
System.Xml.XmlTextReaderImpl.ParseElement() +343
System.Xml.XmlTextReaderImpl.ParseElementContent() +121
System.Xml.XmlTextReaderImpl.Read() +45
System.Xml.XmlLoader.LoadNode(Boolean skipOverWhitespace) +58
System.Xml.XmlLoader.LoadDocSequence(XmlDocument parentDoc) +20
System.Xml.XmlLoader.Load(XmlDocument doc, XmlReader reader, Boolean preserveWhitespace) +129
System.Xml.XmlDocument.Load(XmlReader reader) +108
System.Xml.XmlDocument.LoadXml(String xml) +113
Pdfizer.HtmlToPdfConverter.Run(String html) +53
jwstore.html2pdf(String sbHtml, String sPathToWritePdfTo) in d:\Shared\Project\Under My Documents\NewJcs\App_Code\jwstore.cs:494
Secure_confirmpayment.generate_text() in d:\Shared\Project\Under My Documents\NewJcs\secured\confirmpayment1.aspx.cs:551
Secure_confirmpayment.Page_Load(Object sender, EventArgs e) in d:\Shared\Project\Under My Documents\NewJcs\secured\confirmpayment1.aspx.cs:395
System.Web.Util.CalliHelper.EventArgFunctionCaller(IntPtr fp, Object o, Object t, EventArgs e) +14
System.Web.Util.CalliEventHandlerDelegateProxy.Callback(Object sender, EventArgs e) +35
System.Web.UI.Control.OnLoad(EventArgs e) +99
System.Web.UI.Control.LoadRecursive() +50
System.Web.UI.Page.ProcessRequestMain(Boolean includeStagesBeforeAsyncPoint, Boolean includeStagesAfterAsyncPoint) +627

Version Information: Microsoft .NET Framework Version:2.0.50727.3603; ASP.NET Version:2.0.50727.3082