multiply in asm

VDC 8x2 Administrator VDC 8x2 Living life Posts: 348	multiply in asm Jul 8, 2014 23:33:36 GMT Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by VDC 8x2 on Jul 8, 2014 23:33:36 GMT What is the easiest way to multiply a number by 80 in machine language?

hydrophilic
Global Moderator

Posts: 794

multiply in asm Jul 9, 2014 7:38:10 GMT

Quote

Post by hydrophilic on Jul 9, 2014 7:38:10 GMT

I would guess 16*5. Like maybe


  LDA #0
  STA high
  LDA low
  STA temp ;save *1
  ASL
  ROL high ;*2
  ASL
  ROL high ;*4
  ;carry is clear
  ADC temp
  BCC +
  INC high ;*5
+ ;now *16
  ASL
  ROL high ;*2
  ASL
  ROL high ;*4
  ASL
  ROL high ;*8
  ASL
  ROL high ;*16
  STA low

I'm kupo for Kupo Nuts!
∇ • hydrophilic ≠ 0

VDC 8x2
Administrator

VDC 8x2

Living life

Posts: 348

multiply in asm Jul 9, 2014 15:11:17 GMT

Quote

Post by VDC 8x2 on Jul 9, 2014 15:11:17 GMT

Thank you for the code example!

I am working on code to stamp a 16 by 8 bit character on a 8x2 graphic screen.

Its going to be at the heart of the graphic engine for the game translation.

Since the old engine puts redefined multicolor characters on the 40 column screen. I am rebuilding low routines to do 80 column graphics instead. The high level routines won't know the difference.

Last Edit: Jul 9, 2014 15:16:44 GMT by VDC 8x2

hydrophilic
Global Moderator

Posts: 794

multiply in asm Jul 11, 2014 19:06:13 GMT

Quote

Post by hydrophilic on Jul 11, 2014 19:06:13 GMT

Well for low-level stuff, where the multiply will presumably called many times, like in a loop (or nested loops), you might want something faster. If you're multiplying an 8-bit number by 80, and you have 2 pages (512 bytes) of RAM available somewhere, then a look-up table might be better. (definately faster)


  ;takes 10 to 12 cycles
  ; .A contains value to multiply
  tay           ;index tables
  lda loTab80,y ;low byte
  ldx hiTab80,y ;high byte

...

loTab80 .byte < 0000, < 0080, < 0160, < 0240, ...
hiTab80 .byte > 0000, > 0080, > 0160, > 0240, ...

For the tables, you would probably want to use .REPT / .FOR / .DO psudo-ops, which vary by assembler.

Because the tables would use 512 bytes, you might want to think of ways to reuse the tables. For example, you could make the tables actually hold value*10 (instead of *80) then you would look up a value in the table and multiply it by 8.

I think *this* table method, or the previous posted code, are rather simple which is what you asked for. There is also generic n-bit multiply routines, and fast-multiply routine, but they are more complex and the fastest (general) multiply method uses 1024 bytes for tables.

Edit

I was just thinking, my code in a prior post only uses the .A register, so pretty good for loops, although a bit slow. The code above (in this post) uses all general-purpose registers. Which means you would need to save and restore the X and Y registers in the general case. But below is an alternate version of the table method that only uses .A, it is a bit slower than the table method using X and Y, but faster than adding code to save and restore X and Y. The main restriction is the tables need to be page-aligned; additional issues are the code can't be in ROM, and it is not thread-safe (both because self-modifying code).


  ;takes 19 or 20 cycles (if 'high' is ZP or not; code not in ZP)
  ; .A contains value to multiply
  sta getLo+1
  sta getHi+1
getHi:
  lda hiTab80
  sta high
getLo:
  lda loTab80

Edit 2

For comparison, it looks like the original post using only .A takes about 64 cycles, assuming all temporaries are in zero page (or 75 cycles if none are ZP).

Last Edit: Jul 11, 2014 19:31:04 GMT by hydrophilic: added comparison time

I'm kupo for Kupo Nuts!
∇ • hydrophilic ≠ 0

wegi
KIM-1 User

Posts: 35

multiply in asm Sept 14, 2017 18:24:58 GMT hydrophilic likes this

Quote

Post by wegi on Sept 14, 2017 18:24:58 GMT

a*b = ((a+b)/2)^2-((a-b)/2)^2

codebase64.org/doku.php?id=base:fast_8bit_multiplication_16bit_product

codebase64.org/doku.php?id=base:seriously_fast_multiplication

fastest way hydrophilic by lookup table

You could be simple do it by 80 step add that is probably easiest (not fastest of course).

C128Man Vic 20 User Posts: 102	multiply in asm Sept 23, 2017 15:11:08 GMT Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by C128Man on Sept 23, 2017 15:11:08 GMT Hi, Is it possible too call BASIC functions, like * or SIN, COS, ....

hydrophilic
Global Moderator

Posts: 794

multiply in asm Oct 26, 2017 13:19:03 GMT

Quote

Post by hydrophilic on Oct 26, 2017 13:19:03 GMT

Nice idea, C128Man, but I think the OP was interested in speed. Calling the BASIC ROM is *much* slower. But if the goal if minimum code bytes, then calling ROM is a good idea -- thank you!

Thanks for the links, wegi ! The a^2-b^2 code was my first thought too, but for a constant (like 80) a custom solution (above) is faster. But thanks for sharing links... that method is superior in the general case!

It's nice to have options

Last Edit: Oct 26, 2017 13:22:34 GMT by hydrophilic

I'm kupo for Kupo Nuts!
∇ • hydrophilic ≠ 0

C128Man
Vic 20 User

Posts: 102

multiply in asm Oct 26, 2017 15:56:15 GMT

Quote

Post by C128Man on Oct 26, 2017 15:56:15 GMT

Oct 26, 2017 13:19:03 GMT hydrophilic said:

Nice idea, C128Man, but I think the OP was interested in speed. Calling the BASIC ROM is *much* slower. But if the goal if minimum code bytes, then calling ROM is a good idea -- thank you!

Hi,

Do you have a link to use the BASIC ROM?

hydrophilic
Global Moderator

Posts: 794

multiply in asm Oct 31, 2017 11:15:42 GMT

Quote

Post by hydrophilic on Oct 31, 2017 11:15:42 GMT

I don't have a specific link... I wanted to publish an eBook about the C128 ROM years ago, but the technology at the time was too primitive (and now I'm busy with other things). So I can only suggest looking at some books on BombJack, like the Commodore 128 Programmers Reference Guide (128PRG) written by Commodore or perhaps Commodore BASIC 7.0 Internals.

In short, there is a set of JMP codes into BASIC ROM located in the $af00 page of ROM. These routines can do thinks like math, graphics, and running programs. Obviously you are interested in the math... Anyway, hope this helps!

By the way, the math routines are generally easy to use, but they are VERY difficult to debug, if you are using the built-in Machine Language MONITOR. This is because the MONITOR uses the floating-point-accumulator (FAC) for internal/temporary use. If you want to use the MONITOR while testing the BASIC ROM math code, then you should copy the results of FAC into a safe spot of RAM.

I'm kupo for Kupo Nuts!
∇ • hydrophilic ≠ 0

Post by VDC 8x2 on Jul 8, 2014 23:33:36 GMT

Post by hydrophilic on Jul 9, 2014 7:38:10 GMT

Post by VDC 8x2 on Jul 9, 2014 15:11:17 GMT

Post by hydrophilic on Jul 11, 2014 19:06:13 GMT

Post by wegi on Sept 14, 2017 18:24:58 GMT

Post by C128Man on Sept 23, 2017 15:11:08 GMT

Post by hydrophilic on Oct 26, 2017 13:19:03 GMT

Post by C128Man on Oct 26, 2017 15:56:15 GMT

Post by hydrophilic on Oct 31, 2017 11:15:42 GMT

Quick Reply