So ... macros are fun!! (a bit of rant, maybe a kinda tutorial, and a quick hack)

maegul (he/they) · edit-2 6 months ago

So ... macros are fun!! (a bit of rant, maybe a kinda tutorial, and a quick hack)

Ephera · edit-2 6 months ago

For ~~figuring out how to write macros~~ anyone wanting to learn about more advanced macros beyond macro_rules, I can recommend this: https://github.com/dtolnay/proc-macro-workshop

Basically, you clone that repo, pick one of the projects, uncomment the first test in the respective tests/progress.rs file and read the steps in the respective unit test file. Then you try to implement a macro to fulfill the test.

It should be said that it isn’t spoon-feeding you, you will still need to read actual documentation for macros. But with its test harness, you get a quick feedback loop and it gives at least some pointers for where to start learning.

maegul (he/they) · 6 months ago

Nice!!

It seems that it covers mainly procedural macros, which for those who don’t know are different from what I cover here. They are more involved but more powerful.

Ephera · 6 months ago

Ah, you’re right. I’ve mainly worked through the sorted-chapter and thought the seq!()-macro would be a macro_rules thing, but apparently that’s a proc_macro-thing with TokenStream parsing and such, too. I didn’t even know that’s an option, although it makes perfect sense. 🙃

maegul (he/they) · edit-2 6 months ago

Yea, and proc_macro TokenStream macros definitely seem worthwhile knowing about without necessarily ever wanting to reach for them, at least not often.

Declarative macros though (using macro_rules! as in the top post) surprised me in how straightforward and useful they are. Basically boilerplate machines built right into the language. I’d previously gotten the impression that all macros were like proc_macro.

It’d be interesting to see some challenges with macro_rules!. I’m not sure there’s much scope to challenge people though … they’re pretty simple. But there are some tricks in the system AFAICT I didn’t touch on here.

Multiple alternative patterns can be matched on in a single macro (just like match expressions)
Patterns can match on invalid rust, where the tt syntax type, which stands for “Token Tree” and accepts, I think, any arbitrary series of tokens, can be powerful
A macro can call itself recursively

Together it seems you can put together a pseudo parser, with recursive calls passing in flags or markers to dictate which branch the call goes down. I found this suggestion on users.rust-lang to use a “switch” token along with the above tricks).

Ephera · 6 months ago

Yeah, I’m only looking into proc_macros, because I’m working on a library. In application code, I do think they’re essentially never going to be worth the complexity that they introduce. But in a library, I can deal with the complexity and hopefully my users don’t have to think about it.

Having said that, I actually don’t think proc_macros are insanely complex. There’s a bit of a learning curve to them, particularly the parsing with the syn-crate takes a moment to understand the concepts.
But once you’ve parsed things, you can use the quote-crate to do templating in quite a similar fashion as macro_rules. The thing is just that all the simple cases are covered by the simpler macro_rules, so you just wouldn’t reach for proc_macro most of the time in application code.

maegul (he/they) · 6 months ago

yea, and it would probably be worth just a quick hack to get a feel for it (procedural macros) at least once so you know what you can reach for when the time comes. As you say, it seems involved, but not really that insanely complex … and knowing the bits that make the language “your own” can be really valuable. Cheers for the workshop thing though, definitely worth knowing about!

SorteKanin@feddit.dk · 6 months ago

I’d definitely recommend looking into the uom crate. It uses a different and in my opinion much better approach to unit-tracking than you present here. Instead of storing the unit at runtime in a field, it uses generics to specify the unit.

So rather than having this:

struct Length {
    value: f64,
    unit: <some enum of all length units>,
}

It does (conceptually but simplified) this:

struct Length<Unit>(f64);

This means that you can for instance do trait impls like this:

impl<U1, U2> Add<Length<U1>> for Length<U2> { // Convert the units and add them together }

So for instance if you had a variable a: Length<Millimeter> and b: Length<Kilometer> you could add them together and still get the correct result, because the units will do the conversion during the addition, and all of this is ensured at compile-time. So you don’t even need to track that you’re using the same units for different variables, the units will sort themselves out automatically. This is much safer than trying to keep track of the unit dynamically at runtime.

That’s at least the idea as far as I understand but I haven’t used the uom crate much, just read about it.

maegul (he/they) · 6 months ago

Oh, what I did here was a toy example, or a “shit, something would be better than nothing” approach for basic usage, where such is the value of having units represented in the type system, something can be better than nothing.

Yes uom does seem good and everything you say about it totally makes sense (I hadn’t actually thought that much about automatic conversions for arithmetic!). I haven’t dug into it at all, but it did have me a little concerned that one could run into some situation it doesn’t handle well (eg, does it work with arrays?). dimensioned seemed nice too (especially as its approach didn’t seem to rely on conversions to base units as uom did), though it is likely unfinished or unmaintained … it was encouraging to see a simulation example in its documentation.

If I ever dig into uom more I’ll definitely report back here. Cheers for the recommendation!

SorteKanin@feddit.dk · 6 months ago

does it work with arrays

Not sure what you mean - work with arrays how?

maegul (he/they) · 6 months ago

Actually … yea, I didn’t think about this much and probably misunderstood something from the dimensioned docs … so it’s probably not a thing or not common at all …

But assigning a unit type of some sort to an array of values, not just a single or scalar value. It has its uses, and I’ve seen an application of this before. It could also probably be achieved with some basic wrapping (eg Newtype wrappers in rust).

SorteKanin@feddit.dk · 6 months ago

Right you can always do vec![Length<YourUnitHere>; 5] and then you have a vec of 5 length values with a certain unit (or you could do a simple array but then you can’t change the length). But you won’t be able to have different units in different values in that array/vec, since that would be different types and you can’t have different types in an array/vec. You could have different types in a tuple, but then you can’t vary the length.

These limitations are usually not a problem, especially not for something like units where you can easily convert between them.

maegul (he/they) · 6 months ago

Oh yea … that works too of course!!

But you won’t be able to have different units in different values in that array/vec,

Oh that wasn’t the aim. The idea behind thinking about arrays was more to head in the direction of having unit-types along with numpy style arrays (ndarray being the only crate I know of for such a tool) … so that calculations and arithmetic with scalars and arrays can get pretty seamless, but with the safety of a units system too.

SorteKanin@feddit.dk · 6 months ago

Consider also checking https://pola.rs/ for data frames and multidimensional data and such :)

maegul (he/they) · 6 months ago

For sure!

So ... macros are fun!! (a bit of rant, maybe a kinda tutorial, and a quick hack)

So ... macros are fun!! (a bit of rant, maybe a kinda tutorial, and a quick hack)

Intro

Using a macro

The Elements of writing patterns with “Declarative macros”

Example

Writing the emitted or new code

Example

Application: “The code” before making it more efficient with a macro

The Concept (and a rant)

Implementation (without using a macro)

Defining a macro instead

Implementation of the macro

Usage

Crates for unit systems

`dimnesioned`

uom

F#