Quantcast
Channel: Active questions tagged feed - Stack Overflow
Viewing all articles
Browse latest Browse all 547

UnicodeEncodeError: 'utf-8' codec can't encode character '\ud83c' in position 0: surrogates not allowed

$
0
0

I am trying to parse "https://tre.tbe.taleo.net/tre01/ats/servlet/Rss?org=arobpers2&cws=42" but I am getting the error "UnicodeEncodeError: 'utf-8' codec can't encode character '\ud83c' in position 0: surrogates not allowed". I tried looking at other questions with UnicodeEncodeError but this one seems to be different since chardet outputs that the text is ASCII encoded.

import chardet                                                                                                                                                                                                                import feedparser                                                                                                                                                                                                             import requests  url = "https://tre.tbe.taleo.net/tre01/ats/servlet/Rss?org=arobpers2&cws=42"r = requests.get(url)print(chardet.detect(r.text.encode())) # Outputs ASCIIfeed = feedparser.parse(r.text) # Raises UnicodeEncodeError

Viewing all articles
Browse latest Browse all 547

Latest Images

Trending Articles



Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>