Unable to read bullets 

Hello,
I am converting a pdf file into a text file. In the extracted text file, I am not getting the bullet where ever any text starts with a bullet point. 
I need to know when a bullet exists to be able to do some post processing. However, when I am getting the extracted text it is without the bullet point. 

Below is my code:

```python
def extractTextFromPDF(strDownloadDirectory, fileName, txtFilePath):
        filePathName = strDownloadDirectory + fileName
        pdfFileObj = open(filePathName, 'rb')
        pdfReader = PyPDF2.PdfFileReader(pdfFileObj)
        intPages = pdfReader.getNumPages()
        print(intPages)
        strText = ''
        print(fileName)
        fileName =fileName[0:len(fileName)-4]
        txtFilePath = txtFilePath +fileName  + '.txt'
        target_file = open(txtFilePath, "w" , encoding='utf-8')
        for i in range(0,intPages):
            objPDFObj = pdfReader.getPage(i)
            strText =  objPDFObj.extractText().rstrip()
            strText = " ".join(strText.replace(u"\xa0", " ").strip().split())
            print(strText)
        target_file.write(strText)
        target_file.close()
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to read bullets #230

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unable to read bullets #230

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions