Notes on files: You will have to tweak the files a little if you are not using Weka: In weka, training files can be comma separated values or a proprietary format called ARFF. The files linked in this problem set are in the ARFF format. The ARFF format only adds stuff to an otherwise common comma separated values (CSV) file. Therefore, in order to use these files with other programs you will need to remove the top of the file (which contains ARFF specific instructions) and modify the remaining CSV file accordingly. Please see This Wiki Post for the ARFF format details. The overview should be plenty.
Notes on classifiers: The ID3 algorithm in its pure form (chapter 3 of the book) can only classify instances with nominal attributes. Because the data for this problem contains attributes with continuous values you will use the C4.5 algorithm. For a deeper discussion on these algorithms read section 3.7 on chapter 3. Weka has one classifier, J48, that implements the C4.5 algorithm, so please use that for decision trees. However, the default options turn prunning on. You have to turn it off for (a) at least, and turn it on with a good reason.
Notes on Weka: Weka is a java program, so you have to have java installed. When you run Weka you will have the option of launching a few tools. For this assignment you will be using the Explorer. The Explorer has a pretty intuitive interface. In the first tab you open a file and look at its columns, some statistical properties of the columns and some meta information about the columns. The second tab, "Classify" is where you define your test set and choose a classifier to run. Once you choose a classifier, you can click on the text-box next to the Choose button to obtain information about the classifier and to modify the classifier's options (for example to enable or disable prunning!!!!). Once you run a classifier on the data, a full description and metrics of the classification will appear on the right. You can save that output if you want.