stsquad's blurblog

Web Feeds: Turn any website into an RSS feed
Sunday March 15^th, 2026 at 2:34 AM

The NewsBlur Blog

Not every website has an RSS feed. Some never did. Some had one years ago and quietly removed it. And some sites have content that updates regularly but was never structured as a feed in the first place: job boards, product listings, event calendars, changelog pages. Until now, if a site didn’t offer RSS, you were out of luck.

Web Feeds is a new feature that creates RSS feeds from any website. Point it at a URL, and NewsBlur analyzes the page structure, identifies the repeating content patterns, and generates extraction rules that turn the page into a live feed. It works on news sites, blogs, job boards, product pages, or really anything with a list of items that changes over time.

This is a huge feature and has been requested for years. I’m so thrilled to finally be able to offer it in a way that I feel comfortable with. Other solutions including having you select story titles on a re-hosted version of the page, but it was clumsy and error-prone. This way, we use LLMs to figure out what the story titles are likely to be, present the variations to you, and then let you decide what’s right. So much better!

How it works

Open the Add + Discover Sites page and click the Web Feed tab. Paste a URL and click Analyze. NewsBlur fetches the page, strips out navigation and boilerplate, and analyzes the HTML structure. Within a few seconds, you’ll see multiple extraction variants, each representing a different content pattern found on the page.

Progress updates stream in real-time while the analysis runs. NewsBlur typically finds 3-5 different extraction patterns on a page. The first variant is usually the main content (article list, blog posts, product grid), but sometimes the page has multiple distinct sections worth subscribing to. Each variant shows a label, a description of what it captures, and a preview of 3 extracted stories so you can see exactly what you’d get.

Select the variant that matches what you want to follow, pick a folder, and subscribe. NewsBlur will re-fetch and re-extract the page on a regular schedule, just like any other feed.

Story hints

Sometimes the initial best guess isn’t what you’re looking for. Maybe the page has a blog section and a job listings section, and you want the jobs. Click the Refine button and type a hint like “I’m looking for the job postings.” NewsBlur re-analyzes the page with your hint in mind and reorders the variants to prioritize what you described.

What gets extracted

For each story, NewsBlur extracts whatever it can find: title, link, content snippet, image, author, and date. Not every field will be available on every site, and that’s fine. At minimum you’ll get titles and links. The extraction uses XPath expressions, which means it’s precise and consistent across page refreshes as long as the site’s HTML structure stays the same.

When things change

Websites redesign. HTML structures shift. When NewsBlur detects that the extraction rules have stopped working (after 3 consecutive failures), the feed is flagged as needing re-analysis. You’ll see a feed exception indicator, and you can re-analyze the page with one click to generate updated extraction rules.

Use cases

Some examples of sites that work well with Web Feeds:

Company blogs without RSS — Many corporate blogs dropped their RSS feeds years ago. Web Feeds brings them back.
Job boards — Track new postings on a company’s careers page.
Government sites — Follow press releases, meeting agendas, or public notices.
Changelog pages — Monitor when a tool or service ships updates.
Event listings — Keep tabs on upcoming concerts, conferences, or local events.
Product pages — Watch for new arrivals or restocks on stores that don’t offer feeds.

Availability

Web Feeds are available to Premium Archive and Premium Pro subscribers. The ongoing feed fetching and extraction runs on NewsBlur’s servers like any other feed.

If you have feedback or ideas for improvements, please share them on the NewsBlur forum.

Read the whole story

samuel

112 days ago

One of the best new features ever. I say that but just wait until I launch the Daily Briefing and story clustering, both coming sooooooon... also I just finished AI prompt classifiers for text and for images, so that's also coming. Hoo boy, lots of good stuff. And Android redesign is nearly complete!

San Francisco

chrismorgan

112 days ago

The feature makes sense, but… could you please give it a different name? https://en.wikipedia.org/wiki/Web_feed

samuel

112 days ago

Web feed is a superset of RSS feed, so it seems quite appropriate

chrismorgan

112 days ago

This is specifically a feature to let you subscribe to sources that *don’t have* a web feed. The name “Web Feed” is accordingly very confusing.

3 public comments

satadru

102 days ago

Nice!

New York, NY

digitalink2008

112 days ago

Samuel you absolute BAMF! This is an amazing feature!

jgbishop

112 days ago

NewsBlur keeps getting better!

Raleigh, NC

Today's only half of the leap year fun
Wednesday March 6^th, 2024 at 5:42 PM

Writing - rachelbythebay

It's that time again, when code written in the past four years shows up in our lives and breaks something. Still, while you're enjoying the clown show of game companies telling people to manually set the clocks on their consoles and people not being able to fill up their cars, keep one thing in mind:

Only half of the fun of a leap year happens on February 29th.

The rest of it happens in ten months, when a bunch more code finds out that it's somehow day 366, and promptly flips out. Thus, instead of preparing to party, those people get to spend the day finding out why their device is being stupid all of the sudden.

So, if you got through today unscathed, but are somehow counting days in the year somewhere, you now have about 305 days to make sure you don't have your own Zune bug buried in your own code.

...

One more random thought on the topic: some of today's kids will be around to see what happens in 2100. That one will be all kinds of fun to see who paid attention to their rules and who just guessed based on a clean division by four.

Read the whole story

QEMU Advent Calendar 2018 is coming! by stefanha (noreply@blogger.com)
Friday November 30^th, 2018 at 4:54 AM

Stefan Hajnoczi

QEMU Advent Calendar is running again this year. Each day from December 1st through 24th a surprise QEMU disk image will be released for your entertainment.

Check out the website on December 1st for the first disk image:

https://www.qemu-advent-calendar.org/2018/

Thomas Huth is organizing QEMU Advent Calendar 2018 with the help of others from the QEMU community. If you want to contribute a disk image, take a look at the call for images email.

Read the whole story

Team Chat
Tuesday January 10^th, 2017 at 3:04 PM

xkcd.com

2078: He announces that he's finally making the jump from screen+irssi to tmux+weechat.

Read the whole story

6 public comments

stsquad

3461 days ago

Truth

Cambridge, UK

hooges

3465 days ago

IRC, never die!

Topeka, KS

emdeesee

3466 days ago

OK, fine. When the galactic singularity becomes an option, I'll consider switching from IRC.

Sherman, TX

beowuff

3466 days ago

Don't worry. It'll still run irc as it's back end, so you can still connect.

Fidtz

3466 days ago

2051, solid 1960's style future prediction there!

Covarr

3466 days ago

What if I'm only still on IRC because the rest of my groups are?

East Helena, MT

alt_text_bot

3466 days ago

2078: He announces that he's finally making the jump from screen+irssi to tmux+weechat.

Chris Wellons: Some Performance Advantages of Lexical Scope by Christopher Wellons
Friday December 23^rd, 2016 at 1:32 PM

Planet Emacsen

I recently had a discussion with Xah Lee about lexical scope in Emacs Lisp. The topic was why lexical-binding exists at a file-level when there was already lexical-let (from cl-lib), prompted by my previous article on JIT byte-code compilation. The specific context is Emacs Lisp, but these concepts apply to language design in general.

Until Emacs 24.1 (June 2012), Elisp only had dynamically scoped variables — a feature, mostly by accident, common to old lisp dialects. While dynamic scope has some selective uses, it’s widely regarded as a mistake for local variables, and virtually no other languages have adopted it.

Way back in 1993, Dave Gillespie’s deviously clever lexical-let macro was committed to the cl package, providing a rudimentary form of opt-in lexical scope. The macro walks its body replacing local variable names with guaranteed-unique gensym names: the exact same technique used in macros to create “hygienic” bindings that aren’t visible to the macro body. It essentially “fakes” lexical scope within Elisp’s dynamic scope by preventing variable name collisions.

For example, here’s one of the consequences of dynamic scope.

(defun inner ()
  (setq v :inner))

(defun outer ()
  (let ((v :outer))
    (inner)
    v))

(outer)
;; => :inner

The “local” variable v in outer is visible to its callee, inner, which can access and manipulate it. The meaning of the free variable v in inner depends entirely on the run-time call stack. It might be a global variable, or it might be a local variable for a caller, direct or indirect.

Using lexical-let deconflicts these names, giving the effect of lexical scope.

(defvar v)

(defun lexical-outer ()
  (lexical-let ((v :outer))
    (inner)
    v))

(lexical-outer)
;; => :outer

But there’s more to lexical scope than this. Closures only make sense in the context of lexical scope, and the most useful feature of lexical-let is that lambda expressions evaluate to closures. The macro implements this using a technique called closure conversion. Additional parameters are added to the original lambda function, one for each lexical variable (and not just each closed-over variable), and the whole thing is wrapped in another lambda function that invokes the original lambda function with the additional parameters filled with the closed-over variables — yes, the variables (e.g. symbols) themselves, not just their values, (e.g. pass-by-reference). The last point means different closures can properly close over the same variables, and they can bind new values.

To roughly illustrate how this works, the first lambda expression below, which closes over the lexical variables x and y, would be converted into the latter by lexical-let. The #: is Elisp’s syntax for uninterned variables. So #:x is a symbol x, but not the symbol x (see print-gensym).

;; Before conversion:
(lambda ()
  (+ x y))

;; After conversion:
(lambda (&rest args)
  (apply (lambda (x y)
           (+ (symbol-value x)
              (symbol-value y)))
         '#:x '#:y args))

I’ve said on multiple occasions that lexical-binding: t has significant advantages, both in performance and static analysis, and so it should be used for all future Elisp code. The only reason it’s not the default is because it breaks some old (badly written) code. However, lexical-let doesn’t realize any of these advantages! In fact, it has worse performance than straightforward dynamic scope with let.

New symbol objects are allocated and initialized (make-symbol) on each run-time evaluation, one per lexical variable.
Since it’s just faking it, lexical-let still uses dynamic bindings, which are more expensive than lexical bindings. It varies depending on the C compiler that built Emacs, but dynamic variable accesses (opcode varref) take around 30% longer than lexical variable accesses (opcode stack-ref). Assignment is far worse, where dynamic variable assignment (varset) takes 650% longer than lexical variable assignment (stack-set). How I measured all this is a topic for another article.
The “lexical” variables are accessed using symbol-value, a full function call, so they’re even slower than normal dynamic variables.
Because converted lambda expressions are constructed dynamically at run-time within the body of lexical-let, the resulting closure is only partially byte-compiled even if the code as a whole has been byte-compiled. In contrast, lexical-binding: t closures are fully compiled. How this works is worth its own article.
Converted lambda expressions include the additional internal function invocation, making them slower.

While lexical-let is clever, and occasionally useful prior to Emacs 24, it may come at a hefty performance cost if evaluated frequently. There’s no reason to use it anymore.

Constraints on code generation

Another reason to be weary of dynamic scope is that it puts needless constraints on the compiler, preventing a number of important optimization opportunities. For example, consider the following function, bar:

(defun bar ()
  (let ((x 1)
        (y 2))
    (foo)
    (+ x y)))

Byte-compile this function under dynamic scope (lexical-binding: nil) and disassemble it to see what it looks like.

(byte-compile #'bar)
(disassemble #'bar)

That pops up a buffer with the disassembly listing:

0       constant  1
1       constant  2
2       varbind   y
3       varbind   x
4       constant  foo
5       call      0
6       discard
7       varref    x
8       varref    y
9       plus
10      unbind    2
11      return

It’s 12 instructions, 5 of which deal with dynamic bindings. The byte-compiler doesn’t always produce optimal byte-code, but this just so happens to be nearly optimal byte-code. The discard (a very fast instruction) isn’t necessary, but otherwise no more compiler smarts can improve on this. Since the variables x and y are visible to foo, they must be bound before the call and loaded after the call. While generally this function will return 3, the compiler cannot assume so since it ultimately depends on the behavior foo. Its hands are tied.

Compare this to the lexical scope version (lexical-binding: t):

0       constant  1
1       constant  2
2       constant  foo
3       call      0
4       discard
5       stack-ref 1
6       stack-ref 1
7       plus
8       return

It’s only 8 instructions, none of which are expensive dynamic variable instructions. And this isn’t even close to the optimal byte-code. In fact, as of Emacs 25.1 the byte-compiler often doesn’t produce the optimal byte-code for lexical scope code and still needs some work. Despite not firing on all cylinders, lexical scope still manages to beat dynamic scope in performance benchmarks.

Here’s the optimal byte-code, should the byte-compiler become smarter someday:

0       constant  foo
1       call      0
2       constant  3
3       return

It’s down to 4 instructions due to computing the math operation at compile time. Emacs’ byte-compiler only has rudimentary constant folding, so it doesn’t notice that x and y are constants and misses this optimization. I speculate this is due to its roots compiling under dynamic scope. Since x and y are no longer exposed to foo, the compiler has the opportunity to optimize them out of existence. I haven’t measured it, but I would expect this to be significantly faster than the dynamic scope version of this function.

Optional dynamic scope

You might be thinking, “What if I really do want x and y to be dynamically bound for foo?” This is often useful. Many of Emacs’ own functions are designed to have certain variables dynamically bound around them. For example, the print family of functions use the global variable standard-output to determine where to send output by default.

(let ((standard-output (current-buffer)))
  (princ "value = ")
  (prin1 value))

Have no fear: With lexical-binding: t you can have your cake and eat it too. Variables declared with defvar, defconst, or defvaralias are marked as “special” with an internal bit flag (declared_special in C). When the compiler detects one of these variables (special-variable-p), it uses a classical dynamic binding.

Declaring both x and y as special restores the original semantics, reverting bar back to its old byte-code definition (next time it’s compiled, that is). But it would be poor form to mark x or y as special: You’d de-optimize all code (compiled after the declaration) anywhere in Emacs that uses these names. As a package author, only do this with the namespace-prefixed variables that belong to you.

The only way to unmark a special variable is with the undocumented function internal-make-var-non-special. I expected makunbound to do this, but as of Emacs 25.1 it does not. This could possibly be considered a bug.

Accidental closures

I’ve said there are are absolutely no advantages to lexical-binding: nil. It’s only the default for the sake of backwards-compatibility. However, there is one case where lexical-binding: t introduces a subtle issue that would otherwise not exist. Take this code for example (and nevermind prin1-to-string for a moment):

;; -*- lexical-binding: t; -*-

(defun function-as-string ()
  (with-temp-buffer
    (prin1 (lambda () :example) (current-buffer))
    (buffer-string)))

This creates and serializes a closure, which is one of Elisp’s unique features. It doesn’t close over any variables, so it should be pretty simple. However, this function will only work correctly under lexical-binding: t when byte-compiled.

(function-as-string)
;; => "(closure ((temp-buffer . #<buffer  *temp*>) t) nil :example)"

The interpreter doesn’t analyze the closure, so just closes over everything. This includes the hidden variable temp-buffer created by the with-temp-buffer macro, resulting in an abstraction leak. Buffers aren’t readable, so this will signal an error if an attempt is made to read this function back into an s-expression. The byte-compiler fixes this by noticing temp-buffer isn’t actually closed over and so doesn’t include it in the closure, making it work correctly.

Under lexical-binding: nil it works correctly either way:

(function-as-string)
;; -> "(lambda nil :example)"

This may seem contrived — it’s certainly unlikely — but it has come up in practice. Still, it’s no reason to avoid lexical-binding: t.

Use lexical scope in all new code

As I’ve said again and again, always use lexical-binding: t. Use dynamic variables judiciously. And lexical-let is no replacement. It has virtually none of the benefits, performs worse, and it only applies to let, not any of the other places bindings are created: function parameters, dotimes, dolist, and condition-case.

Read the whole story

Work
Monday October 3^rd, 2016 at 6:42 PM

xkcd.com

Despite it being imaginary, I already have SUCH a strong opinion on the cord-switch firing incident.

Read the whole story

11 public comments

brico

3560 days ago

A curiously bureaucratic and teleological vision from Randall; I look at these and see evolution, not design. The gooseneck lamp goes back well over a century. And the glass tumbler and the wooden table with aprons are even older forms.

Brooklyn, NY

Brstrk

3560 days ago

No mention on any documentation processes. Some heroes just go unsung.

duerig

3560 days ago

He missed designing for manufacturability. "This shape was chosen because it is easy to stamp out of sheet metal." or "These two parts snap together in order to eliminate two screws."

duerig

3560 days ago

Also, I found out how they make plastic soda bottles recently. First, a test-tube shaped piece of plastic is cast. Then before it has cooled, they stick it at the opening of a mold and blow air through it until the sides conform to the shape of the mold. For large bottles (2-liter), they have a rod that automatically pushes the bottom of the tube down deeper into the mold as air is blown in. It is like automated glassblowing with plastic, done countless times over and over for a disposable bottle.

infini

3561 days ago

supernormal design as jasper morrison and naoko fukusawa would call it

Asia, EU, Africa

dukeofwulf

3561 days ago

Related: "I, Pencil." http://www.econlib.org/library/Essays/rdPncl1.html

Covarr

3561 days ago

An artist spent several minutes deciding what objects to put on this table.

East Helena, MT

ossiander

3561 days ago

I could see this.

magicseth

3561 days ago

On point

mburch42

3561 days ago

My life.

reconbot

3561 days ago

Truth behind objects

New York City

alt_text_bot

3561 days ago

Despite it being imaginary, I already have SUCH a strong opinion on the cord-switch firing incident.

Web Feeds: Turn any website into an RSS feed Sunday March 15th, 2026 at 2:34 AM

How it works

Story hints

What gets extracted

When things change

Use cases

Availability

Today's only half of the leap year fun Wednesday March 6th, 2024 at 5:42 PM

QEMU Advent Calendar 2018 is coming! by stefanha (noreply@blogger.com) Friday November 30th, 2018 at 4:54 AM

Team Chat Tuesday January 10th, 2017 at 3:04 PM

Chris Wellons: Some Performance Advantages of Lexical Scope by Christopher Wellons Friday December 23rd, 2016 at 1:32 PM

Constraints on code generation

Optional dynamic scope

Accidental closures

Use lexical scope in all new code

Work Monday October 3rd, 2016 at 6:42 PM

Web Feeds: Turn any website into an RSS feed
Sunday March 15^th, 2026 at 2:34 AM

Today's only half of the leap year fun
Wednesday March 6^th, 2024 at 5:42 PM

QEMU Advent Calendar 2018 is coming! by stefanha (noreply@blogger.com)
Friday November 30^th, 2018 at 4:54 AM

Team Chat
Tuesday January 10^th, 2017 at 3:04 PM

Chris Wellons: Some Performance Advantages of Lexical Scope by Christopher Wellons
Friday December 23^rd, 2016 at 1:32 PM

Work
Monday October 3^rd, 2016 at 6:42 PM