user-defined eagerization #8

kurt-o-sys · 2017-07-24T08:32:10Z

implement both multimethods and fn-passing to provide a user-defined eagerization

+ rename manage-with-courage -> manage-eager-conditions + add manage-with function

kurt-o-sys · 2017-07-24T20:02:24Z

Making manage-with-courage public may help people out to write an own version of manage. I guess adding some examples in the documentation would be nice in that case. However, if the same (alternative) implementation is used often, adding the choice of eagerization to the library makes a lot of sense to me. This prevents that everyone starts to write the same manage-function(s) in their own code base.

As different implementations, there seem to be a few strategies already mentioned:

pr-str - I fail to see why this is a good choice, but the author does, and that's good enough
walk/postwalk - efficient, fast, I like that one (it does some eagerization, but fails in some weird cases, just like pr-str)
identity - no eagerization if you don't use laziness (or don't care about weird behavior when it comes to lazy data structures)

There may be other strategies that make sense, but with these 3, I guess about 99% of the cases will be covered. So one might opt for a cond instead of multimethod, adding new ways of eagerization to the library as it's requested. However, I don't know why one would 'close' it.

clojureman · 2017-07-24T20:08:31Z

I am a bit afraid of allowing the eagerization to be mutated by the user of the library.
This is because if two libraries each set their own custom eagerization strategy for this lib, they can potentially become incompatible with no warning to the user

kurt-o-sys · 2017-07-24T20:17:27Z

Well... it's in the manage-function that the user decides which strategy to use, right? If another library, let's say A, uses a certain strategy inside library A, the user of library A shouldn't know or care about this. It's up to the maintainer of library A to make the right decision and document it properly.

However, if library A also adds the special condition system, as a user of A, I'd prefer to add my eagerization of preference. The maintainer of library A doesn't know about the data structures I'm using. I do, however, so it's my call to make the right decision.

Moreover, making manage-with-courage public will result in exactly the same 'potential incompatibility', won't it?

kurt-o-sys · 2017-07-24T20:21:52Z

Oh, you're talking about a global setting? <- #2 (comment)

You certainly misunderstood! I am not in favour of a global setting at all! I am in favour if a setting inside the manage-function. Please, no global settings... I was at any point referring to something like the manage-with and manage-as functions (see pull request)... The former passes functions, the latter is multimethod based. Both allow for something like (def manage (partial manage-with ...)) (as I always intended).

Where does the global setting idea suddenly comes from?

didibus · 2017-07-26T18:15:05Z

When would a user want to override eagerization? I'm thinking: 1) When pr-str becomes a performance bottleneck in their code. 2) When pr-str fails to eagerize an edge case. The only need for eagerization is to force lazy code to be executed within the context of manage, to avoid a scenario where a user doesn't realize his code is lazy and not really being managed, since it runs after leaving the manage scope. So we probably want to offer a faster variant of eagerization which sacrifices correctness. And a version which sacrificies performance for correctness. And then we want to offer one that does not eagerize, if the user wants to be lazy and knows how to handle it, by managing operations inside the lazy construct instead. * manage - Most correct form of eagerization * manage! - Not safe in STM and non eagerizing form of manage (name could be different) * manage-as - Takes an enum of eagerization strategies, can be used to choose between correct, fast or lazy. * manage-with - Takes an eagerizing fn. Is this all really necessary? Like how often will #1 and #2 be a concern to most users? If we could find an eagerization strategy that is 100% correct and still fast, we'd only need manage and manage! as #1 and #2 would be taken care of with manage. We failed to find one. But can't we find one that is as correct as pr-str, yet avoids the overhead of creating strings and coercing values to strings? Basically I wonder if it wouldn't be better for the library to continually try to improve eagerization both for correctness and performance of manage. And in the rare cases a user isn't satisfied, manage! would be good enough for them to do whatever. Like for #1, the performance issue. Most of the time bypassing eagerization will actually result in the most performance gains. So manage! would be my goto for performance scenarios. For #2, I'm not even sure it's possible for the user to eagerize what pr-str fails to eagerize. Can we start listing the actual edge cases and see? TL;DR I think an eagerizing manage and a public non eagerizing manage is all that's needed. Appart from that, we should just look into ways to improve the correctness and the performance of the eagerization over time. In the rare cases a user wants to do anything else, the non eagerizing variant of manage will allow him to do so. But in most scenarios, I think adding more manage variants or strategies makes the library more complicated both in implementation and in ease of use, requiring the user to start understanding why there's so many variant of manage.

…

On Mon, Jul 24, 2017, 13:21 kurtosys ***@***.***> wrote: Oh, you're talking about a global setting? You certainly misunderstood! I am not in favour of a global setting at all! I am in favour if a setting inside the manage-function, which is not global at all. No global settings... I was at any point referring to my manage-with and manage-as functions... One passes functions, the other is multimethod based. Both allow for something like (def manage (partial manage-with ...)) (as I always intended). Where does the global setting suddenly comes from? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#8 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAktxDoQCVq_3oB1bnM1gD8ijJ_LBOpkks5sRPzggaJpZM4Og4Bz> .

kurt-o-sys · 2017-07-26T20:14:19Z

That's exactly what I'm trying to say (I think).
pr-str doesn't guarantee eagerization, similar to post-walk. I could only see 1 case in which the latter doesn't work, and its in the case I've shown in #2 (comment) (which I call 'embedded laziness' further down).
Again, I don't think pr-str has many advantages over postwalk, and both don't guarantee eagerization. That's why I added 3 strategies now, open to adding new ones (using multimethods). There might be a 4th, like 'expensive'. It's pretty easy to document it. Basically, it comes down to:

Do you use laziness?
- no, use identity
- yes, go to step 2
Do you have 'embedded laziness' (or are you not sure)
- yes - well, expensive eagerization might solve that case, if you don't take care of it yourself
- no - post-walk will do

I don't see a use case for pr-str, as stated already a few times, but some people seem to do, so that's why I keep it in the loop :). An example of when pr-str fails, see the case I've shown above. I'm not sure if there's much difference between pr-str and postwalk - that's why I asked for the use cases where these fail. Again, to me, both fail to guarantee eagerization, so they score about equal to me on that criterium. postwalk wins when it comes to performance... There's no use case for pr-str I can think of.

That being said, I certainly agree there shouldn't be more than 3 (or maybe 4?) variants in the library and optimizing them over time:

no laziness
realize lazyniness (clojure structures only)
realize everything, incl. laziness embedded in java objects (will need reflection somehow)

I suppose these would cover about 99.9% of the cases. So, coming back to your proposition of 2 strategies, well, I agree - I might go for 1 or maybe 2 more :p. The difference is that one uses different functions, the other the same functions with different parameters.
I only submitted both manage-with and manage-as to show some different strategies. I certainly don't think both are necessary or even a good idea. I just want to be able to use 'no eagerization' (in the case I know I don't use laziness) and 'clojure eagerization' (using post-walk, not pr-str).

didibus · 2017-07-27T06:54:40Z

Ya, I'm not personally seing value in having more then one eagerization strategy. This lib is for conditional restarts, not eagerization. But I'll let the maintainer make that choice, obviously if others see value for them, I'll still be able to use it for my needs.

The fact there's no guaranteed mechanism for eagerization in Clojure, I'd almost prefer that only a non eagerizing manage existed. Make it very explicit to users that they must design for laziness when they are using laziness. As I said before, even Clojure's try/catch block does not handle laziness.

kurt-o-sys · 2017-07-27T07:25:08Z

Right, I follow that line of thought... and I fully agree on documenting - I've been stating that already a few times as well. However, if there's only 1 strategy, it shouldn't be pr-str, which is inefficient and doesn't provide any guarantees of eagerization. That was how this discussion started - the pr-str.

It seems better to me to document in which cases eagerization doesn't work (with pr-str, that's a bit vague) and use an efficient eagerization. Or just drop any eagerization and leave it fully to the user. I can live with that as well. But having pr-str as the default and the only strategy, it doesn't seem a good idea to me at all. Some people seem to like the pr-str-strategy, some obviously don't. Having a more pluggable eagerization, or at least to avoid pr-str didn't, and still doesn't seem like a bad idea to me.

, I'd almost prefer that only a non eagerizing manage existed.

Right! Avoid pr-str

It has been stated that one can fork and start changing the strategy, but please refer to #2 . I don't think that's a good idea either. I'd prefer to have 1 library that fits most uses, not having a bunch of forks all doing the same, but with only 1 or 2 lines of code difference.

didibus · 2017-07-27T18:00:54Z

I agree, pr-str is hacky. It relies on hoping that each element at level 0 will have an implementation of print which will loop further down one level.

The hacky advantage of pr-str is that some implementation of toString on Java types tend to do that also.

Maybe instead of (constantly nil), you could have have:

(cond
  (.startsWith (class %) "java.lang") nil
  :else (pr-str %))

This would make it so we still bank on pr-str for java types, but just walk everything that's Clojure or primitive.

In the future, we could even had more concrete java types to cond and have more optimized walkers for them.

kurt-o-sys · 2017-07-27T18:12:04Z

right... but why not using post-walk for clojure structures? More like:

(cond
  (.startsWith (class %) "clojure.lang") (walk/postwalk (constantly nil) %) ;; still no guarantees here! - check my example
  :else  (pr-str %) )

didibus · 2017-07-29T18:17:18Z

Ya, that can work too. Basically just a hybrid approach, so it uses postwalk when it can, and falls back to pr-str when it can't. So we get speed and we don't regress on coverage.

I was thinking, if you look at the postwalk source, that we can do the same, just where they have :else, we would habe it call pr-str, unless it was a primitive from java.lang.

didibus · 2017-07-29T18:23:01Z

(cond
   (list? form) (outer (apply list (map inner form)))
   (instance? clojure.lang.IMapEntry form) (outer (vec (map inner form)))
   (seq? form) (outer (doall (map inner form)))
   (instance? clojure.lang.IRecord form)
     (outer (reduce (fn [r x] (conj r (inner x))) form form))
   (coll? form) (outer (into (empty form) (map inner form)))
   :else (outer
            (cond
              (.startsWith (class form) "java.lang") form
              :else (do (pr-str form) form))))

Something like that.

didibus · 2017-07-31T09:25:34Z

I implemented what I was suggesting: #9

kurt-o-sys · 2017-07-31T09:37:19Z

see #9 :)

kurt-o-sys added 3 commits July 24, 2017 09:38

make eager-fn user-definable

1055889

+ rename manage-with-courage -> manage-eager-conditions + add manage-with function

implement multimethod instead of fn-pass

d606130

implement multimethods and pass-fn

0d55a85

kurt-o-sys mentioned this pull request Jul 24, 2017

Less expensive lazy seq forcing #2

Closed

add ::postwalk eagerization

df0fae2

kurt-o-sys closed this Jul 31, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

user-defined eagerization #8

user-defined eagerization #8

Uh oh!

kurt-o-sys commented Jul 24, 2017

Uh oh!

kurt-o-sys commented Jul 24, 2017 •

edited

Loading

Uh oh!

clojureman commented Jul 24, 2017 •

edited

Loading

Uh oh!

kurt-o-sys commented Jul 24, 2017

Uh oh!

kurt-o-sys commented Jul 24, 2017 •

edited

Loading

Uh oh!

didibus commented Jul 26, 2017 via email

Uh oh!

kurt-o-sys commented Jul 26, 2017 •

edited

Loading

Uh oh!

didibus commented Jul 27, 2017

Uh oh!

kurt-o-sys commented Jul 27, 2017

Uh oh!

didibus commented Jul 27, 2017

Uh oh!

kurt-o-sys commented Jul 27, 2017 •

edited

Loading

Uh oh!

didibus commented Jul 29, 2017

Uh oh!

didibus commented Jul 29, 2017

Uh oh!

didibus commented Jul 31, 2017

Uh oh!

kurt-o-sys commented Jul 31, 2017

Uh oh!

Uh oh!

user-defined eagerization #8

user-defined eagerization #8

Uh oh!

Conversation

kurt-o-sys commented Jul 24, 2017

Uh oh!

kurt-o-sys commented Jul 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clojureman commented Jul 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kurt-o-sys commented Jul 24, 2017

Uh oh!

kurt-o-sys commented Jul 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

didibus commented Jul 26, 2017 via email

Uh oh!

kurt-o-sys commented Jul 26, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

didibus commented Jul 27, 2017

Uh oh!

kurt-o-sys commented Jul 27, 2017

Uh oh!

didibus commented Jul 27, 2017

Uh oh!

kurt-o-sys commented Jul 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

didibus commented Jul 29, 2017

Uh oh!

didibus commented Jul 29, 2017

Uh oh!

didibus commented Jul 31, 2017

Uh oh!

kurt-o-sys commented Jul 31, 2017

Uh oh!

Uh oh!

kurt-o-sys commented Jul 24, 2017 •

edited

Loading

clojureman commented Jul 24, 2017 •

edited

Loading

kurt-o-sys commented Jul 24, 2017 •

edited

Loading

kurt-o-sys commented Jul 26, 2017 •

edited

Loading

kurt-o-sys commented Jul 27, 2017 •

edited

Loading